A Dual-branch Learning Model with Gradient-balanced Loss for Long-tailed Multi-label Text Classification

被引:2
|
作者
Yao, Yitong [1 ]
Zhang, Jing [1 ]
Zhang, Peng [1 ]
Sun, Yueheng [1 ]
机构
[1] Tianjin Univ, Tianjin, Peoples R China
关键词
Multi-label text classification; long-tailed learning; dual-branch structure; re-weighting loss function;
D O I
10.1145/3597416
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label text classification has awide range of applications in the realworld. However, the data distribution in the real world is often imbalanced, which leads to serious long-tailed problems. For multi-label classification, due to the vast scale of datasets and existence of label co-occurrence, how to effectively improve the prediction accuracy of tail labels without degrading the overall precision becomes an important challenge. To address this issue, we propose A Dual-Branch Learning Model with Gradient-Balanced Loss (DBGB) based on the paradigm of existing pre-trained multi-label classification SOTA models. Our model consists of two main long-tailed module improvements. First, with the shared text representation, the dual-classifier is leveraged to process two kinds of label distributions; one is the original data distribution and the other is the under-sampling distribution for head labels to strengthen the prediction for tail labels. Second, the proposed gradient-balanced loss can adaptively suppress the negative gradient accumulation problem related to labels, especially tail labels. We perform extensive experiments on three multi-label text classification datasets. The results show that the proposed method achieves competitive performance on overall prediction results compared to the state-of-the-art methods in solving the multi-label classification, with significant improvement on tail-label accuracy.
引用
收藏
页数:24
相关论文
共 43 条
  • [31] An R-Transformer_BiLSTM Model Based on Attention for Multi-label Text Classification
    Yaoyao Yan
    Fang’ai Liu
    Xuqiang Zhuang
    Jie Ju
    Neural Processing Letters, 2023, 55 : 1293 - 1316
  • [32] Long-Tailed Classification Based on Coarse-Grained Leading Forest and Multi-Center Loss
    Yang, Jinye
    Xu, Ji
    Wu, Di
    Tang, Jianhang
    Li, Shaobo
    Wang, Guoyin
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [33] A Hybrid Model Based on Convolutional Neural Network and Long Short-Term Memory for Multi-label Text Classification
    Hamed Khataei Maragheh
    Farhad Soleimanian Gharehchopogh
    Kambiz Majidzadeh
    Amin Babazadeh Sangar
    Neural Processing Letters, 56
  • [34] A Hybrid Model Based on Convolutional Neural Network and Long Short-Term Memory for Multi-label Text Classification
    Maragheh, Hamed Khataei
    Gharehchopogh, Farhad Soleimanian
    Majidzadeh, Kambiz
    Sangar, Amin Babazadeh
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [35] Metadata-Induced Contrastive Learning for Zero-Shot Multi-Label Text Classification
    Zhang, Yu
    Shen, Zhihong
    Wu, Chieh-Han
    Xie, Boya
    Hao, Junheng
    Wang, Ye-Yi
    Wang, Kuansan
    Han, Jiawei
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 3162 - 3173
  • [36] Hierarchical Graph Transformer-Based Deep Learning Model for Large-Scale Multi-Label Text Classification
    Gong, Jibing
    Teng, Zhiyong
    Teng, Qi
    Zhang, Hekai
    Du, Linfeng
    Chen, Shuai
    Bhuiyan, Md Zakirul Alam
    Li, Jianhua
    Liu, Mingsheng
    Ma, Hongyuan
    IEEE ACCESS, 2020, 8 : 30885 - 30896
  • [37] Feature balanced re-enhanced network with multi-factor margin loss for long-tailed visual recognition
    Wang, Yaoyao
    Zhai, Junhai
    NEUROCOMPUTING, 2024, 610
  • [38] History-based attention in Seq2Seq model for multi-label text classification
    Xiao, Yaoqiang
    Li, Yi
    Yuan, Jin
    Guo, Songrui
    Xiao, Yi
    Li, Zhiyong
    KNOWLEDGE-BASED SYSTEMS, 2021, 224
  • [39] A multi-label social short text classification method based on contrastive learning and improved ml-KNN
    Tian, Gang
    Wang, Jiachang
    Wang, Rui
    Zhao, Guangxin
    He, Cheng
    EXPERT SYSTEMS, 2024, 41 (07)
  • [40] Multi-label text classification on unbalanced Twitter with monolingual model and hyperparameter optimization for hate speech and abusive language detection
    Alzahrani, Ahmad A.
    Bramantoro, Arif
    Permana, Asep
    INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2024, 11 (05): : 177 - 185