Multi-label classification with label clusters

被引:0
作者
Gatto, Elaine Cecilia [1 ]
Ferrandin, Mauri [2 ]
Cerri, Ricardo [3 ]
机构
[1] Univ Fed Sao Carlos, Dept Comp Sci, BR-13565905 Sao Carlos, SP, Brazil
[2] Univ Fed Santa Catarina, Dept Control Automat & Comp Engn, BR-89036002 Blumenau, SC, Brazil
[3] Univ Sao Paulo, Inst Math & Comp Sci, Ave Trabalhador Sao Carlense,400 Ctr, BR-13566590 Sao Carlos, SP, Brazil
关键词
Multi-label correlations; Multi-label partitioning; Multi-label clustering; Multi-label classification; Multi-label learning; CLASSIFIERS; DEPENDENCE; ENSEMBLES;
D O I
10.1007/s10115-024-02270-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label classification is the task of simultaneously predicting a set of labels for an instance, with global and local being the two predominant approaches. The global approach trains a single classifier to handle all classes simultaneously, while the local approach breaks down the problem into multiple binary problems. Despite extensive research, effectively capturing label correlations remains a challenge in both methods. In this paper, we introduce an approach that clusters the label space to create hybrid partitions (disjoint correlated label clusters), striking a balance between global and local strategies while leveraging both advantages. Our approach consists of (i) clustering the label space based on correlations, (ii) generating and validating the resulting hybrid partitions, (iii) selecting the best partitions, and (iv) evaluating their performance. We also compare our approach against an oracle, exhaustive search, and random search to assess how closely our hybrid partitions approximate the best possible partitions. The oracle selects the best partition using the test set, while the exhaustive approach relies on validation data. Experiments conducted on multiple multi-label datasets demonstrate that our method, along with random partitions, achieves results that are superior or competitive compared to traditional global and local approaches, as well as the state-of-the-art Ensemble of Classifier Chains. These findings suggest that conventional methods may not fully capture label correlations, and clustering the label space offers a promising solution.
引用
收藏
页码:1741 / 1785
页数:45
相关论文
共 68 条
  • [51] A survey of hierarchical classification across different application domains
    Silla, Carlos N., Jr.
    Freitas, Alex A.
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 22 (1-2) : 31 - 72
  • [52] Spivey MZ., 2008, J INTEGER SEQ, V11, P5
  • [53] How Is a Data-Driven Approach Better than Random Choice in Label Space Division for Multi-Label Classification?
    Szymanski, Piotr
    Kajdanowicz, Tomasz
    Kersting, Kristian
    [J]. ENTROPY, 2016, 18 (08)
  • [54] A Classification Model For Class Imbalance Dataset Using Genetic Programming
    Tahir, Mirza Amaad Ul Haq
    Asghar, Sohail
    Manzoor, Awais
    Noor, Muhammad Asim
    [J]. IEEE ACCESS, 2019, 7 : 71013 - 71037
  • [55] Multi-label classification via label correlation and first order feature dependance in a data stream
    Tien Thanh Nguyen
    Thi Thu Thuy Nguyen
    Anh Vu Luong
    Quoc Viet Hung Nguyen
    Liew, Alan Wee-Chung
    Stantic, Bela
    [J]. PATTERN RECOGNITION, 2019, 90 : 35 - 51
  • [56] Tsoumakas G., 2008, Proc. ECML/PKDD 2008 Workshop on Mining Multidimensional Data (MMD'08), V21, P53
  • [57] Tsoumakas G, 2007, LECT NOTES ARTIF INT, V4701, P406
  • [58] Decision trees for hierarchical multi-label classification
    Vens, Celine
    Struyf, Jan
    Schietgat, Leander
    Dzeroski, Saso
    Blockeel, Hendrik
    [J]. MACHINE LEARNING, 2008, 73 (02) : 185 - 214
  • [59] STS-NLSP: A Network-Based Label Space Partition Method for Predicting the Specificity of Membrane Transporter Substrates Using a Hybrid Feature of Structural and Semantic Similarity
    Wang, Xiangeng
    Zhu, Xiaolei
    Ye, Mingzhi
    Wang, Yanjing
    Li, Cheng-Dong
    Xiong, Yi
    Wei, Dong-Qing
    [J]. FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2019, 7
  • [60] ATC-NLSP: Prediction of the Classes of Anatomical Therapeutic Chemicals Using a Network-Based Label Space Partition Method
    Wang, Xiangeng
    Wang, Yanjing
    Xu, Zhenyu
    Xiong, Yi
    Wei, Dong-Qing
    [J]. FRONTIERS IN PHARMACOLOGY, 2019, 10