A Dual-branch Learning Model with Gradient-balanced Loss for Long-tailed Multi-label Text Classification

被引：2

作者：

Yao, Yitong ^{[1
]}

Zhang, Jing ^{[1
]}

Zhang, Peng ^{[1
]}

Sun, Yueheng ^{[1
]}

机构：

[1] Tianjin Univ, Tianjin, Peoples R China

来源：

ACM TRANSACTIONS ON INFORMATION SYSTEMS | 2024年 / 42卷 / 02期

关键词：

Multi-label text classification; long-tailed learning; dual-branch structure; re-weighting loss function;

D O I：

10.1145/3597416

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multi-label text classification has awide range of applications in the realworld. However, the data distribution in the real world is often imbalanced, which leads to serious long-tailed problems. For multi-label classification, due to the vast scale of datasets and existence of label co-occurrence, how to effectively improve the prediction accuracy of tail labels without degrading the overall precision becomes an important challenge. To address this issue, we propose A Dual-Branch Learning Model with Gradient-Balanced Loss (DBGB) based on the paradigm of existing pre-trained multi-label classification SOTA models. Our model consists of two main long-tailed module improvements. First, with the shared text representation, the dual-classifier is leveraged to process two kinds of label distributions; one is the original data distribution and the other is the under-sampling distribution for head labels to strengthen the prediction for tail labels. Second, the proposed gradient-balanced loss can adaptively suppress the negative gradient accumulation problem related to labels, especially tail labels. We perform extensive experiments on three multi-label text classification datasets. The results show that the proposed method achieves competitive performance on overall prediction results compared to the state-of-the-art methods in solving the multi-label classification, with significant improvement on tail-label accuracy.

引用

页数：24

共 43 条

[41] MCICT: Graph convolutional network-based end-to-end model for multi-label classification of imbalanced clinical text
He, Yao
Xiong, Qingyu
Ke, Cai
Wang, Yaqiang
Yang, Zhengyi
Yi, Hualing
Fan, Qilin
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 91
[42] A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification
Khataei Maragheh, Hamed
Gharehchopogh, Farhad Soleimanian
Majidzadeh, Kambiz
Sangar, Amin Babazadeh
MATHEMATICS, 2022, 10 (03)
[43] Creating an incident investigation framework for a complex socio-technical system: Application of multi-label text classification and Bayesian network structure learning
Dehkordi, Mohammadreza Karimi
Sattari, Fereshteh
Lefsrud, Lianne
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2025, 260

← 1 2 3 4 5 →