Multi-label classification with label clusters

被引:0
作者
Gatto, Elaine Cecilia [1 ]
Ferrandin, Mauri [2 ]
Cerri, Ricardo [3 ]
机构
[1] Univ Fed Sao Carlos, Dept Comp Sci, BR-13565905 Sao Carlos, SP, Brazil
[2] Univ Fed Santa Catarina, Dept Control Automat & Comp Engn, BR-89036002 Blumenau, SC, Brazil
[3] Univ Sao Paulo, Inst Math & Comp Sci, Ave Trabalhador Sao Carlense,400 Ctr, BR-13566590 Sao Carlos, SP, Brazil
关键词
Multi-label correlations; Multi-label partitioning; Multi-label clustering; Multi-label classification; Multi-label learning; CLASSIFIERS; DEPENDENCE; ENSEMBLES;
D O I
10.1007/s10115-024-02270-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label classification is the task of simultaneously predicting a set of labels for an instance, with global and local being the two predominant approaches. The global approach trains a single classifier to handle all classes simultaneously, while the local approach breaks down the problem into multiple binary problems. Despite extensive research, effectively capturing label correlations remains a challenge in both methods. In this paper, we introduce an approach that clusters the label space to create hybrid partitions (disjoint correlated label clusters), striking a balance between global and local strategies while leveraging both advantages. Our approach consists of (i) clustering the label space based on correlations, (ii) generating and validating the resulting hybrid partitions, (iii) selecting the best partitions, and (iv) evaluating their performance. We also compare our approach against an oracle, exhaustive search, and random search to assess how closely our hybrid partitions approximate the best possible partitions. The oracle selects the best partition using the test set, while the exhaustive approach relies on validation data. Experiments conducted on multiple multi-label datasets demonstrate that our method, along with random partitions, achieves results that are superior or competitive compared to traditional global and local approaches, as well as the state-of-the-art Ensemble of Classifier Chains. These findings suggest that conventional methods may not fully capture label correlations, and clustering the label space offers a promising solution.
引用
收藏
页码:1741 / 1785
页数:45
相关论文
共 68 条
[51]   A survey of hierarchical classification across different application domains [J].
Silla, Carlos N., Jr. ;
Freitas, Alex A. .
DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 22 (1-2) :31-72
[52]  
Spivey MZ., 2008, J INTEGER SEQ, V11, P5
[53]   How Is a Data-Driven Approach Better than Random Choice in Label Space Division for Multi-Label Classification? [J].
Szymanski, Piotr ;
Kajdanowicz, Tomasz ;
Kersting, Kristian .
ENTROPY, 2016, 18 (08)
[54]   A Classification Model For Class Imbalance Dataset Using Genetic Programming [J].
Tahir, Mirza Amaad Ul Haq ;
Asghar, Sohail ;
Manzoor, Awais ;
Noor, Muhammad Asim .
IEEE ACCESS, 2019, 7 :71013-71037
[55]   Multi-label classification via label correlation and first order feature dependance in a data stream [J].
Tien Thanh Nguyen ;
Thi Thu Thuy Nguyen ;
Anh Vu Luong ;
Quoc Viet Hung Nguyen ;
Liew, Alan Wee-Chung ;
Stantic, Bela .
PATTERN RECOGNITION, 2019, 90 :35-51
[56]  
Tsoumakas G, 2008, P ECML PKDD 2008 WOR, P30
[57]  
Tsoumakas G, 2007, LECT NOTES ARTIF INT, V4701, P406
[58]   Decision trees for hierarchical multi-label classification [J].
Vens, Celine ;
Struyf, Jan ;
Schietgat, Leander ;
Dzeroski, Saso ;
Blockeel, Hendrik .
MACHINE LEARNING, 2008, 73 (02) :185-214
[59]   STS-NLSP: A Network-Based Label Space Partition Method for Predicting the Specificity of Membrane Transporter Substrates Using a Hybrid Feature of Structural and Semantic Similarity [J].
Wang, Xiangeng ;
Zhu, Xiaolei ;
Ye, Mingzhi ;
Wang, Yanjing ;
Li, Cheng-Dong ;
Xiong, Yi ;
Wei, Dong-Qing .
FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2019, 7
[60]   ATC-NLSP: Prediction of the Classes of Anatomical Therapeutic Chemicals Using a Network-Based Label Space Partition Method [J].
Wang, Xiangeng ;
Wang, Yanjing ;
Xu, Zhenyu ;
Xiong, Yi ;
Wei, Dong-Qing .
FRONTIERS IN PHARMACOLOGY, 2019, 10