A Dual-branch Learning Model with Gradient-balanced Loss for Long-tailed Multi-label Text Classification

被引:2
|
作者
Yao, Yitong [1 ]
Zhang, Jing [1 ]
Zhang, Peng [1 ]
Sun, Yueheng [1 ]
机构
[1] Tianjin Univ, Tianjin, Peoples R China
关键词
Multi-label text classification; long-tailed learning; dual-branch structure; re-weighting loss function;
D O I
10.1145/3597416
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label text classification has awide range of applications in the realworld. However, the data distribution in the real world is often imbalanced, which leads to serious long-tailed problems. For multi-label classification, due to the vast scale of datasets and existence of label co-occurrence, how to effectively improve the prediction accuracy of tail labels without degrading the overall precision becomes an important challenge. To address this issue, we propose A Dual-Branch Learning Model with Gradient-Balanced Loss (DBGB) based on the paradigm of existing pre-trained multi-label classification SOTA models. Our model consists of two main long-tailed module improvements. First, with the shared text representation, the dual-classifier is leveraged to process two kinds of label distributions; one is the original data distribution and the other is the under-sampling distribution for head labels to strengthen the prediction for tail labels. Second, the proposed gradient-balanced loss can adaptively suppress the negative gradient accumulation problem related to labels, especially tail labels. We perform extensive experiments on three multi-label text classification datasets. The results show that the proposed method achieves competitive performance on overall prediction results compared to the state-of-the-art methods in solving the multi-label classification, with significant improvement on tail-label accuracy.
引用
收藏
页数:24
相关论文
共 43 条
  • [11] How Does Pruning Impact Long-Tailed Multi-label Medical Image Classifiers?
    Holste, Gregory
    Jiang, Ziyu
    Jaiswal, Ajay
    Hanna, Maria
    Minkowitz, Shlomo
    Legasto, Alan C.
    Escalon, Joanna G.
    Steinberger, Sharon
    Bittman, Mark
    Shen, Thomas C.
    Ding, Ying
    Summers, Ronald M.
    Shih, George
    Peng, Yifan
    Wang, Zhangyang
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT V, 2023, 14224 : 663 - 673
  • [12] Multi-label text classification based on the label correlation mixture model
    He, Zhiyang
    Wu, Ji
    Lv, Ping
    INTELLIGENT DATA ANALYSIS, 2017, 21 (06) : 1371 - 1392
  • [13] Dual-branch network with hypergraph feature augmentation and adaptive logits adjustment for long-tailed visual recognition
    Han, Jia-yi
    Liu, Jian-wei
    Xu, Jing-dong
    APPLIED SOFT COMPUTING, 2024, 167
  • [14] Multi-Label Classification of Text Documents Using Deep Learning
    Mohammed, Hamza Haruna
    Dogdu, Erdogan
    Gorur, Abdul Kadir
    Choupani, Roya
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 4681 - 4689
  • [15] Multi-Label Text Classification Based on Contrastive and Correlation Learning
    Yang, Shuo
    Gao, Shu
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY, ARTIFICIAL INTELLIGENCE AND DIGITAL ECONOMY, CSAIDE 2024, 2024, : 325 - 330
  • [16] An Efficient Framework by Topic Model for Multi-label Text Classification
    Sun, Wei
    Ran, Xiangying
    Luo, Xiangyang
    Wang, Chongjun
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [17] Multi-Label Text Classification model integrating Label Attention and Historical Attention
    Sun, Guoying
    Cheng, Yanan
    Dong, Fangzhou
    Wang, Luhua
    Zhao, Dong
    Zhang, Zhaoxin
    Tong, Xiaojun
    KNOWLEDGE-BASED SYSTEMS, 2024, 296
  • [18] Feature Extraction of Deep Topic Model for Multi-label Text Classification
    Chen W.
    Liu X.
    Lu M.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (09): : 785 - 792
  • [19] Multi-label Text Classification Model Combining BiLSTM and Hypergraph Attention
    Wang, Xing
    Hu, HuiTing
    Zhu, GuoHua
    2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE, CCAI 2024, 2024, : 344 - 349
  • [20] Hierarchical Sequence-to-Sequence Model for Multi-Label Text Classification
    Yang, Zhenyu
    Liu, Guojing
    IEEE ACCESS, 2019, 7 : 153012 - 153020