Label-representative graph convolutional network for multi-label text classification

被引:21
作者
Huy-The Vu [1 ]
Minh-Tien Nguyen [2 ]
Van-Chien Nguyen [5 ]
Minh-Hieu Pham [6 ]
Van-Quyet Nguyen [3 ]
Van-Hau Nguyen [4 ]
机构
[1] Hung Yen Univ Technol & Educ, Dept Comp Sci, Hung Yen, Vietnam
[2] Hung Yen Univ Technol & Educ, Fac Informat & Technol, Hung Yen, Vietnam
[3] Hung Yen Univ Technol & Educ, Hung Yen, Vietnam
[4] Hung Yen Univ Technol & Educ, Fac Informat & Technol, AI Ctr, Hung Yen, Vietnam
[5] Hanoi Univ Sci & Technol, Comp Sci, Hanoi, Vietnam
[6] Foreign Trade Univ, Hanoi, Vietnam
关键词
Graph convolutional network; Multi-label classification; Correlation matrix; Label embedding; Label correlation;
D O I
10.1007/s10489-022-04106-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label text classification (MLTC) is the task that assigns each document to the most relevant subset of class labels. Previous works usually ignored the correlation and semantics of labels resulting in information loss. To deal with this problem, we propose a new model that explores label dependencies and semantics by using graph convolutional networks (GCN). Particularly, we introduce an efficient correlation matrix to model label correlation based on occurrence and co-occurrence probabilities. To enrich the semantic information of labels, we design a method to use external information from Wikipedia for label embeddings. Correlated label information learned from GCN is combined with fine-grained document representation generated from another sub-net for classification. Experimental results on three benchmark datasets show that our model outweighs prior state-of-the-art methods. Ablation studies also show several aspects of the proposed model. Our code is available at https://github.com/chiennv2000/LR-GCN.
引用
收藏
页码:14759 / 14774
页数:16
相关论文
共 36 条
[1]  
Adhikari Ashutosh, 2019, Docbert: Bert for document classification
[2]  
杨云峰, 1999, [西安公路交通大学学报, Journal of Xian Highway University], P67
[3]  
[Anonymous], 2019, CoRR, DOI DOI 10.48550/arXiv.1907.11692
[4]  
[Anonymous], 2013, P WORKSHOP ICLR 2013
[5]  
Biesialska M, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): STUDENT RESEARCH WORKSHOP, P271
[6]   A Hybrid BERT Model That Incorporates Label Semantics via Adjustive Attention for Multi-Label Text Classification [J].
Cai, Linkun ;
Song, Yu ;
Liu, Tao ;
Zhang, Kunli .
IEEE ACCESS, 2020, 8 :152183-152192
[7]  
Chai D, 2020, PR MACH LEARN RES, V119
[8]   Multi-Label Image Recognition with Graph Convolutional Networks [J].
Chen, Zhao-Min ;
Wei, Xiu-Shen ;
Wang, Peng ;
Guo, Yanwen .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5172-5181
[9]   Hierarchical Graph Transformer-Based Deep Learning Model for Large-Scale Multi-Label Text Classification [J].
Gong, Jibing ;
Teng, Zhiyong ;
Teng, Qi ;
Zhang, Hekai ;
Du, Linfeng ;
Chen, Shuai ;
Bhuiyan, Md Zakirul Alam ;
Li, Jianhua ;
Liu, Mingsheng ;
Ma, Hongyuan .
IEEE ACCESS, 2020, 8 :30885-30896
[10]   Aspect-level sentiment analysis with aspect-specific context position information [J].
Huang, Bo ;
Guo, Ruyan ;
Zhu, Yimin ;
Fang, Zhijun ;
Zeng, Guohui ;
Liu, Jin ;
Wang, Yini ;
Fujita, Hamido ;
Shi, Zhicai .
KNOWLEDGE-BASED SYSTEMS, 2022, 243