Extreme Multi-Label Text Classification Based on Balance Function

被引:0
|
作者
Chen, Zhaohong [1 ]
Hong, Zhiyong [1 ]
Yu, Wenhua [1 ]
Zhang, Xin [1 ]
机构
[1] Faculty of Intelligent Manufacturing, Wuyi University, Guangdong, Jiangmen,529020, China
关键词
D O I
10.3778/j.issn.1002-8331.2209-0472
中图分类号
学科分类号
摘要
Extreme multi-label text classification is a challenging task in the field of natural language processing. In this task, there is a long-tailed distribution situation of labeled data. In this situation, model has a poor ability to learn tail labels classification, which results the overall classification effect is not good. In order to address the above problems, an extreme multi-label text classification method based on balance function is proposed. Firstly, the BERT pre-training model is used for word embedding. Further, the concatenated output of the multi-layer encoder in the pre-trained model is used as the text vector representation to obtain richer text semantic information and improves the model convergence speed. Finally, the balance function is used to assign different attenuation weights to the training losses of different prediction labels, which improves the learning ability of the method on tail label classification. The experimental results on Eurlex-4K and Wiki10- 31K datasets show that the evaluation indicators P@1, P@3 and P@5 respectively reach 86.95%, 74.12%, 61.43% and 88.57%, 77.46% and 67.90%. © The Author(s) 2024.
引用
收藏
页码:163 / 172
相关论文
共 50 条
  • [41] Multi-label arabic text classification: an overview
    Aljedani N.
    Alotaibi R.
    Taileb M.
    International Journal of Advanced Computer Science and Applications, 2020, 11 (10): : 694 - 706
  • [42] Multi-Label Arabic Text Classification: An Overview
    Aljedani, Nawal
    Alotaibi, Reem
    Taileb, Mounira
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (10) : 694 - 706
  • [43] A Neural Architecture for Multi-label Text Classification
    Coope, Sam
    Bachrach, Yoram
    Zukov-Gregoric, Andrej
    Rodriguez, Jose
    Maksak, Bogdan
    McMurtie, Conan
    Bordbar, Mahyar
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2019, 868 : 676 - 691
  • [44] Multi-label Classification of Legislative Text into EuroVoc
    Boella, Guido
    Di Caro, Luigi
    Lesmo, Leonardo
    Daniele, Rispoli
    Robaldo, Livio
    LEGAL KNOWLEDGE AND INFORMATION SYSTEMS (JURIX 2012), 2012, 250 : 21 - 30
  • [45] Long-tailed Extreme Multi-label Text Classification by the Retrieval of Generated Pseudo Label Descriptions
    Zhang, Ruohong
    Wang, Yau-Shian
    Yang, Yiming
    Yu, Donghan
    Vu, Tom
    Lei, Likun
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1092 - 1106
  • [46] Label-Aware Document Representation via Hybrid Attention for Extreme Multi-Label Text Classification
    Huang, Xin
    Chen, Boli
    Xiao, Lin
    Yu, Jian
    Jing, Liping
    NEURAL PROCESSING LETTERS, 2022, 54 (05) : 3601 - 3617
  • [47] Label-Aware Document Representation via Hybrid Attention for Extreme Multi-Label Text Classification
    Xin Huang
    Boli Chen
    Lin Xiao
    Jian Yu
    Liping Jing
    Neural Processing Letters, 2022, 54 : 3601 - 3617
  • [48] Multi-label Classification of Legal Text with Fusion of Label Relations
    Song Z.
    Li Y.
    Li D.
    Wang S.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (02): : 185 - 192
  • [49] MULTI-LABEL TEXT CLASSIFICATION WITH A ROBUST LABEL DEPENDENT REPRESENTATION
    Alfaro, Rodrigo
    Allende, Hector
    2011 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, MEASUREMENT, CIRCUITS AND SYSTEMS (ICIMCS 2011), VOL 3: COMPUTER-AIDED DESIGN, MANUFACTURING AND MANAGEMENT, 2011, : 211 - 214
  • [50] A Multi-Label Text Classification Model with Enhanced Label Information
    Wang, Min
    Gao, Yan
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 329 - 334