Extreme Multi-Label Text Classification Based on Balance Function

被引:0
|
作者
Chen, Zhaohong [1 ]
Hong, Zhiyong [1 ]
Yu, Wenhua [1 ]
Zhang, Xin [1 ]
机构
[1] Faculty of Intelligent Manufacturing, Wuyi University, Guangdong, Jiangmen,529020, China
关键词
D O I
10.3778/j.issn.1002-8331.2209-0472
中图分类号
学科分类号
摘要
Extreme multi-label text classification is a challenging task in the field of natural language processing. In this task, there is a long-tailed distribution situation of labeled data. In this situation, model has a poor ability to learn tail labels classification, which results the overall classification effect is not good. In order to address the above problems, an extreme multi-label text classification method based on balance function is proposed. Firstly, the BERT pre-training model is used for word embedding. Further, the concatenated output of the multi-layer encoder in the pre-trained model is used as the text vector representation to obtain richer text semantic information and improves the model convergence speed. Finally, the balance function is used to assign different attenuation weights to the training losses of different prediction labels, which improves the learning ability of the method on tail label classification. The experimental results on Eurlex-4K and Wiki10- 31K datasets show that the evaluation indicators P@1, P@3 and P@5 respectively reach 86.95%, 74.12%, 61.43% and 88.57%, 77.46% and 67.90%. © The Author(s) 2024.
引用
收藏
页码:163 / 172
相关论文
共 50 条
  • [1] Deep Learning for Extreme Multi-label Text Classification
    Liu, Jingzhou
    Chang, Wei-Cheng
    Wu, Yuexin
    Yang, Yiming
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 115 - 124
  • [2] Correlation Networks for Extreme Multi-label Text Classification
    Xun, Guangxu
    Jha, Kishlay
    Sun, Jianhui
    Zhang, Aidong
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1074 - 1082
  • [3] Taming Pretrained Transformers for Extreme Multi-label Text Classification
    Chang, Wei-Cheng
    Yu, Hsiang-Fu
    Zhong, Kai
    Yang, Yiming
    Dhillon, Inderjit S.
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3163 - 3171
  • [4] MatchXML: An Efficient Text-Label Matching Framework for Extreme Multi-Label Text Classification
    Ye, Hui
    Sunderraman, Rajshekhar
    Ji, Shihao
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (09) : 4781 - 4793
  • [5] Multi-Label Text Classification Based on DistilBERT and Label Correlation
    Wang, Xuyang
    Geng, Liuqing
    Zhang, Xin
    Computer Engineering and Applications, 2024, 60 (23) : 168 - 175
  • [6] An Empirical Study for Class Imbalance in Extreme Multi-label Text Classification
    Han, Sangwoo
    Lim, Chan
    Cha, Bonggeon
    Lee, Jongwuk
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2021), 2021, : 338 - 341
  • [7] Deep Learning Method with Attention for Extreme Multi-label Text Classification
    Chen, Si
    Wang, Liangguo
    Li, Wan
    Zhang, Kun
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 179 - 190
  • [8] Deep neural network for hierarchical extreme multi-label text classification
    Gargiulo, Francesco
    Silvestri, Stefano
    Ciampi, Mario
    De Pietro, Giuseppe
    APPLIED SOFT COMPUTING, 2019, 79 : 125 - 138
  • [9] Label prompt for multi-label text classification
    Song, Rui
    Liu, Zelong
    Chen, Xingbing
    An, Haining
    Zhang, Zhiqi
    Wang, Xiaoguang
    Xu, Hao
    APPLIED INTELLIGENCE, 2023, 53 (08) : 8761 - 8775
  • [10] Label prompt for multi-label text classification
    Rui Song
    Zelong Liu
    Xingbing Chen
    Haining An
    Zhiqi Zhang
    Xiaoguang Wang
    Hao Xu
    Applied Intelligence, 2023, 53 : 8761 - 8775