Towards Class-Imbalance Aware Multi-Label Learning

被引:56
作者
Zhang, Min-Ling [1 ,2 ]
Li, Yu-Kun [1 ,3 ]
Yang, Hao [1 ,2 ]
Liu, Xu-Ying [1 ,2 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing 210096, Peoples R China
[2] Southeast Univ, Minist Educ, Key Lab Comp Network & Informat Integrat, Nanjing, Peoples R China
[3] Baidu Inc, Business Grp Nat Language Proc, Beijing 100085, Peoples R China
基金
美国国家科学基金会;
关键词
Training; Correlation; Predictive models; Labeling; Task analysis; Couplings; Technological innovation; Class-imbalance; cross-coupling aggregation (COCOA); machine learning; multi-label learning; CLASSIFICATION; CLASSIFIERS; ENSEMBLE;
D O I
10.1109/TCYB.2020.3027509
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label learning deals with training examples each represented by a single instance while associated with multiple class labels. Due to the exponential number of possible label sets to be considered by the predictive model, it is commonly assumed that label correlations should be well exploited to design an effective multi-label learning approach. On the other hand, class-imbalance stands as an intrinsic property of multi-label data which significantly affects the generalization performance of the multi-label predictive model. For each class label, the number of training examples with positive labeling assignment is generally much less than those with negative labeling assignment. To deal with the class-imbalance issue for multi-label learning, a simple yet effective class-imbalance aware learning strategy called cross-coupling aggregation (COCOA) is proposed in this article. Specifically, COCOA works by leveraging the exploitation of label correlations as well as the exploration of class-imbalance simultaneously. For each class label, a number of multiclass imbalance learners are induced by randomly coupling with other labels, whose predictions on the unseen instance are aggregated to determine the corresponding labeling relevancy. Extensive experiments on 18 benchmark datasets clearly validate the effectiveness of COCOA against state-of-the-art multi-label learning approaches especially in terms of imbalance-specific evaluation metrics.
引用
收藏
页码:4459 / 4471
页数:13
相关论文
共 50 条
  • [41] Binary relevance for multi-label learning: an overview
    Zhang, Min-Ling
    Li, Yu-Kun
    Liu, Xu-Ying
    Geng, Xin
    FRONTIERS OF COMPUTER SCIENCE, 2018, 12 (02) : 191 - 202
  • [42] Generating Counterfactual Instances for Explainable Class-Imbalance Learning
    Chen, Zhi
    Duan, Jiang
    Kang, Li
    Xu, Hongyan
    Chen, Rui
    Qiu, Guoping
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (03) : 1130 - 1144
  • [43] Effective Class-Imbalance Learning Based on SMOTE and Convolutional Neural Networks
    Joloudari, Javad Hassannataj
    Marefat, Abdolreza
    Nematollahi, Mohammad Ali
    Oyelere, Solomon Sunday
    Hussain, Sadiq
    APPLIED SCIENCES-BASEL, 2023, 13 (06):
  • [44] Dual Noise Elimination and Dynamic Label Correlation Guided Partial Multi-Label Learning
    Hu, Yan
    Fang, Xiaozhao
    Kang, Peipei
    Chen, Yonghao
    Fang, Yuting
    Xie, Shengli
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5641 - 5656
  • [45] A First Approach to Deal with Imbalance in Multi-label Datasets
    Charte, Francisco
    Rivera, Antonio
    Jose del Jesus, Maria
    Herrera, Francisco
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, 2013, 8073 : 150 - 160
  • [46] Discovering Latent Class Labels for Multi-Label Learning
    Huang, Jun
    Xu, Linchuan
    Wang, Jing
    Feng, Lei
    Yamanishi, Kenji
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3058 - 3064
  • [47] An Empirical Investigation to Overcome Class-imbalance in Inspection Reviews
    Singh, Maninder
    Walia, Gursimran S.
    Goswami, Anurag
    2017 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND DATA SCIENCE (MLDS 2017), 2017, : 15 - 22
  • [48] Learning Common and Label-Specific Features for Multi-Label Classification With Missing Labels
    Li, Runxin
    Ouyang, Zexian
    Shang, Zhenhong
    Jia, Lianyin
    Li, Xiaowu
    IEEE ACCESS, 2024, 12 : 81170 - 81195
  • [49] A Novel Class-Imbalance Learning Approach for Both Within-Project and Cross-Project Defect Prediction
    Gong, Lina
    Jiang, Shujuan
    Bo, Lili
    Jiang, Li
    Qian, Junyan
    IEEE TRANSACTIONS ON RELIABILITY, 2020, 69 (01) : 40 - 54
  • [50] Towards the Learning of Weighted Multi-label Associative Classifiers
    Liu, Chunyang
    Chen, Ling
    Tsang, Ivor
    Yin, Hongzhi
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,