A Dual-branch Learning Model with Gradient-balanced Loss for Long-tailed Multi-label Text Classification

被引:2
|
作者
Yao, Yitong [1 ]
Zhang, Jing [1 ]
Zhang, Peng [1 ]
Sun, Yueheng [1 ]
机构
[1] Tianjin Univ, Tianjin, Peoples R China
关键词
Multi-label text classification; long-tailed learning; dual-branch structure; re-weighting loss function;
D O I
10.1145/3597416
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label text classification has awide range of applications in the realworld. However, the data distribution in the real world is often imbalanced, which leads to serious long-tailed problems. For multi-label classification, due to the vast scale of datasets and existence of label co-occurrence, how to effectively improve the prediction accuracy of tail labels without degrading the overall precision becomes an important challenge. To address this issue, we propose A Dual-Branch Learning Model with Gradient-Balanced Loss (DBGB) based on the paradigm of existing pre-trained multi-label classification SOTA models. Our model consists of two main long-tailed module improvements. First, with the shared text representation, the dual-classifier is leveraged to process two kinds of label distributions; one is the original data distribution and the other is the under-sampling distribution for head labels to strengthen the prediction for tail labels. Second, the proposed gradient-balanced loss can adaptively suppress the negative gradient accumulation problem related to labels, especially tail labels. We perform extensive experiments on three multi-label text classification datasets. The results show that the proposed method achieves competitive performance on overall prediction results compared to the state-of-the-art methods in solving the multi-label classification, with significant improvement on tail-label accuracy.
引用
收藏
页数:24
相关论文
共 43 条
  • [21] Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge
    Holste, Gregory
    Zhou, Yiliang
    Wang, Song
    Jaiswal, Ajay
    Lin, Mingquan
    Zhuge, Sherry
    Yang, Yuzhe
    Kim, Dongkyun
    Nguyen-Mau, Trong-Hieu
    Tran, Minh-Triet
    Jeong, Jaehyup
    Park, Wongi
    Ryu, Jongbin
    Hong, Feng
    Verma, Arsh
    Yamagishi, Yosuke
    Kim, Changhyun
    Seo, Hyeryeong
    Kang, Myungjoo
    Celi, Leo Anthony
    Lu, Zhiyong
    Summers, Ronald M.
    Shih, George
    Wang, Zhangyang
    Peng, Yifan
    MEDICAL IMAGE ANALYSIS, 2024, 97
  • [22] Learning to rank for multi-label text classification: Combining different sources of information
    Azarbonyad, Hosein
    Dehghani, Mostafa
    Marx, Maarten
    Kamps, Jaap
    NATURAL LANGUAGE ENGINEERING, 2021, 27 (01) : 89 - 111
  • [23] LSPCL: Label-specific supervised prototype contrastive learning for multi-label text classification
    Wang, Gang
    Du, Yajun
    Jiang, Yurui
    KNOWLEDGE-BASED SYSTEMS, 2025, 309
  • [24] Label-Embedding Bi-directional Attentive Model for Multi-label Text Classification
    Liu, Naiyin
    Wang, Qianlong
    Ren, Jiangtao
    NEURAL PROCESSING LETTERS, 2021, 53 (01) : 375 - 389
  • [25] Label-Embedding Bi-directional Attentive Model for Multi-label Text Classification
    Naiyin Liu
    Qianlong Wang
    Jiangtao Ren
    Neural Processing Letters, 2021, 53 : 375 - 389
  • [26] Multi-Label Text Classification Model Based on Multi-Level Constraint Augmentation and Label Association Attention
    Wei, Xiao
    Huang, Jianbao
    Zhao, Rui
    Yu, Hang
    Xu, Zheng
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (01)
  • [27] Gate-Attention and Dual-End Enhancement Mechanism for Multi-Label Text Classification
    Cheng, Jieren
    Chen, Xiaolong
    Xu, Wenghang
    Hua, Shuai
    Tang, Zhu
    Sheng, Victor S.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (02): : 1779 - 1793
  • [28] A Hybrid BERT Model That Incorporates Label Semantics via Adjustive Attention for Multi-Label Text Classification
    Cai, Linkun
    Song, Yu
    Liu, Tao
    Zhang, Kunli
    IEEE ACCESS, 2020, 8 (08): : 152183 - 152192
  • [29] An R-Transformer_BiLSTM Model Based on Attention for Multi-label Text Classification
    Yan, Yaoyao
    Liu, Fang'ai
    Zhuang, Xuqiang
    Ju, Jie
    NEURAL PROCESSING LETTERS, 2023, 55 (02) : 1293 - 1316
  • [30] Latent Dirichlet Allocation complement in the vector space model for Multi-Label Text Classification
    Carrera-Trejo, Victor
    Sidorov, Grigori
    Miranda-Jimenez, Sabino
    Moreno Ibarra, Marco
    Cadena Martinez, Rodrigo
    INTERNATIONAL JOURNAL OF COMBINATORIAL OPTIMIZATION PROBLEMS AND INFORMATICS, 2015, 6 (01): : 7 - 19