Explainable Model-specific Algorithm Selection for Multi-Label Classification

被引:0
作者
Kostovska, Ana [1 ]
Doerr, Carola [1 ]
Dzeroski, Saso [1 ]
Kocev, Dragi [1 ]
Panov, Pance [1 ]
Eftimov, Tome [1 ]
机构
[1] Jozef Stefan Inst, Knowledge Technol Dept, Ljubljana, Slovenia
来源
2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI) | 2022年
关键词
automated algorithm selection; multi-label classification; XAI;
D O I
10.1109/SSCI51031.2022.10022177
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label classification (MLC) is an ML task of predictive modeling in which a data instance can simultaneously belong to multiple classes. MLC is increasingly gaining interest in different application domains such as text mining, computer vision, and bioinformatics. Several MLC algorithms have been proposed in the literature, resulting in a meta-optimization problem that the user needs to address: which MLC approach to select for a given dataset? To address this algorithm selection problem, we investigate in this work the quality of an automated approach that uses characteristics of the datasets - so-called features - and a trained algorithm selector to choose which algorithm to apply for a given task. For our empirical evaluation, we use a portfolio of 38 datasets. We consider eight MLC algorithms, whose quality we evaluate using six different performance metrics. We show that our automated algorithm selector outperforms any of the single MLC algorithms, and this is for all evaluated performance measures. Our selection approach is explainable, a characteristic that we exploit to investigate which meta-features have the largest influence on the decisions made by the algorithm selector. Finally, we also quantify the importance of the most significant metafeatures for various domains.
引用
收藏
页码:39 / 46
页数:8
相关论文
共 40 条
[1]   On learning algorithm selection for classification [J].
Ali, S ;
Smith, KA .
APPLIED SOFT COMPUTING, 2006, 6 (02) :119-138
[2]  
Ana Kostovska, 2022, Zenodo, DOI 10.5281/ZENODO.6829671
[3]   ASlib: A benchmark library for algorithm selection [J].
Bischl, Bernd ;
Kerschke, Pascal ;
Kotthoff, Lars ;
Lindauer, Marius ;
Malitsky, Yuri ;
Frechette, Alexandre ;
Hoos, Holger ;
Hutter, Frank ;
Leyton-Brown, Kevin ;
Tierney, Kevin ;
Vanschoren, Joaquin .
ARTIFICIAL INTELLIGENCE, 2016, 237 :41-58
[4]   Comprehensive comparative study of multi-label classification methods [J].
Bogatinovski, Jasmin ;
Todorovski, Ljupco ;
Dzeroski, Saso ;
Kocev, Dragi .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 203
[5]   Explaining the performance of multilabel classification methods with data set properties [J].
Bogatinovski, Jasmin ;
Todorovski, Ljupco ;
Dzeroski, Saso ;
Kocev, Dragi .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (09) :6080-6122
[6]  
Brazdil P., 2008, METALEARNING APPL DA
[7]   On active learning in multi-label classification [J].
Brinker, K .
FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, :206-213
[8]  
Chang WC, 2020, Arxiv, DOI arXiv:1905.02331
[9]   Multi-label text classification with latent word-wise label information [J].
Chen, Ziheng ;
Ren, Jiangtao .
APPLIED INTELLIGENCE, 2021, 51 (02) :966-979
[10]   Automatic selection of clustering algorithms using supervised graph embedding [J].
Cohen-Shapira, Noy ;
Rokach, Lior .
INFORMATION SCIENCES, 2021, 577 :824-851