Exploiting Dynamic and Fine-grained Semantic Scope for Extreme Multi-label Text Classification

被引:1
|
作者
Wang, Yuan [1 ,2 ]
Song, Huiling [1 ]
Huo, Peng [1 ]
Xu, Tao [1 ]
Yang, Jucheng [1 ]
Chen, Yarui [1 ]
Zhao, Tingting [1 ]
机构
[1] Tianjin Univ Sci & Technol, Tianjin 300457, Peoples R China
[2] Populat & Precis Hlth Care Ltd, Tianjin 300000, Peoples R China
来源
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II | 2022年 / 13552卷
基金
中国国家自然科学基金;
关键词
Extreme multi-label text classification; Semantic scope; A dual cooperative network; Data sparsity;
D O I
10.1007/978-3-031-17189-5_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extreme multi-label text classification (XMTC) refers to the problem of tagging a given text with the most relevant subset of labels from a large label set. A majority of labels only have a few training instances due to large label dimensionality in XMTC. To solve this data sparsity issue, most existing XMTC methods take advantage of fixed label clusters obtained in early stage to balance performance on tail labels and head labels. However, such label clusters provide static and coarse-grained semantic scope for every text, which ignores distinct characteristics of different texts and has difficulties modelling accurate semantics scope for texts with tail labels. In this paper, we propose a novel framework TReaderXML for XMTC, which adopts dynamic and fine-grained semantic scope from teacher knowledge for individual text to optimize text conditional prior category semantic ranges. TReaderXML dynamically obtains teacher knowledge for each text by similar texts and hierarchical label information in training sets to release the ability of distinctly fine-grained label-oriented semantic scope. Then, TReaderXML benefits from a novel dual cooperative network that firstly learns features of a text and its corresponding label-oriented semantic scope by parallel Encoding Module and Reading Module, secondly embeds two parts by Interaction Module to regularize the text's representation by dynamic and fine-grained label-oriented semantic scope, and finally find target labels by Prediction Module. Experimental results on three XMTC benchmark datasets show that our method achieves new state-of-the-art results and especially performs well for severely imbalanced and sparse datasets.
引用
收藏
页码:85 / 97
页数:13
相关论文
共 50 条
  • [1] Fine-grained local label correlation for multi-label classification
    Zhao, Tianna
    Zhang, Yuanjian
    Miao, Duoqian
    Pedrycz, Witold
    KNOWLEDGE-BASED SYSTEMS, 2025, 314
  • [2] Fine-Grained Text Classification Based on Label Augmentation
    Guo, Ruiqiang
    Yang, Shilong
    Jia, Xiaowen
    Wei, Qianqiang
    Computer Engineering and Applications, 60 (21): : 134 - 141
  • [3] Fine-Grained Emotion Detection in Suicide Notes: A Thresholding Approach to Multi-Label Classification
    Luyckx, Kim
    Vaassen, Frederik
    Peersman, Claudia
    Daelemans, Walter
    BIOMEDICAL INFORMATICS INSIGHTS, 2012, 5 : 61 - 69
  • [4] FM-ECG: A fine-grained multi-label framework for ECG image classification
    Du, Nan
    Cao, Qing
    Yu, Li
    Liu, Nathan
    Zhong, Erheng
    Liu, Zizhu
    Shen, Ying
    Chen, Kang
    INFORMATION SCIENCES, 2021, 549 : 164 - 177
  • [5] Fine-grained Multi-label Sexism Classification Using Semi-supervised Learning
    Abburi, Harika
    Parikh, Pulkit
    Chhaya, Niyati
    Varma, Vasudeva
    WEB INFORMATION SYSTEMS ENGINEERING, WISE 2020, PT II, 2020, 12343 : 531 - 547
  • [6] Evaluating Word Embeddings in Multi-label Classification Using Fine-grained Name Typing
    Yaghoobzadeh, Yadollah
    Kann, Katharina
    Schuetze, Hinrich
    REPRESENTATION LEARNING FOR NLP, 2018, : 101 - 106
  • [7] Deep Learning for Extreme Multi-label Text Classification
    Liu, Jingzhou
    Chang, Wei-Cheng
    Wu, Yuexin
    Yang, Yiming
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 115 - 124
  • [8] Correlation Networks for Extreme Multi-label Text Classification
    Xun, Guangxu
    Jha, Kishlay
    Sun, Jianhui
    Zhang, Aidong
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1074 - 1082
  • [9] Integrating Label Semantic Similarity Scores into Multi-label Text Classification
    Chen, Zihao
    Liu, Yang
    Cheng, Baitai
    Peng, Jing
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 234 - 245
  • [10] Multi-label Text Classification Method Based on Label Semantic Information
    Xiao L.
    Chen B.-L.
    Huang X.
    Liu H.-F.
    Jing L.-P.
    Yu J.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (04): : 1079 - 1089