Multi-label text classification based on semantic-sensitive graph convolutional network

被引:24
作者
Zeng, Delong [1 ]
Zha, Enze [1 ]
Kuang, Jiayi [1 ]
Shen, Ying [1 ]
机构
[1] Sun Yat Sen Univ, Gongchang Rd 66, Shenzhen 518107, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label; Text classification; Graph convolutional network;
D O I
10.1016/j.knosys.2023.111303
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-Label Text Classification (MLTC) is an important but challenging task in the field of natural language processing. In this paper, we propose a novel method, Semantic-sensitive Graph Convolutional Network (S-GCN), by simultaneously considering semantic and word-global associations. More specifically, we first leverage texts, words, and labels to construct a global graph, which helps mine the relevance between similar documents. Then we design and pre-train an encoder to initialize text nodes in the graph, from which the semantic features of documents are extracted. Next, we employ a graph convolutional network to classify text nodes, which can well fuse node information. Finally, we normalize the adjacency matrix and store hidden layer representations of word nodes, tackling the issue that conventional graph-based methods cannot predict texts that did not appear during training. We conduct experiments on three public datasets, AAPD, RMSC-V2, and Reuters-21578, and demonstrate the superiority of our model over the baselines on the MLTC task. Source code is available at https://github.com/sysu18364004/SGCN.
引用
收藏
页数:9
相关论文
共 62 条
[1]  
Adhikari A, 2019, Arxiv, DOI [arXiv:1904.08398, 10.48550/arXiv.1904.08398, DOI 10.48550/ARXIV.1904.08398]
[2]   A fast O(N lg N) time hybrid clustering algorithm using the circumference proximity based merging technique for diversified datasets [J].
Akhter, Mohammad Maksood ;
Mohanty, Sraban Kumar .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
[3]  
[Anonymous], 2017, P 14 INT C NATURAL L
[4]   Automatic ICD-10 Classification of Diseases from Dutch Discharge Letters [J].
Bagheri, Ayoub ;
Sammani, Arjan ;
Van der Heijden, Peter G. M. ;
Asselbergs, Folkert W. ;
Oberski, Daniel L. .
PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 3: BIOINFORMATICS, 2020, :281-289
[5]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[6]   Scaling Graph Neural Networks with Approximate PageRank [J].
Bojchevski, Aleksandar ;
Klicpera, Johannes ;
Perozzi, Bryan ;
Kapoor, Amol ;
Blais, Martin ;
Rozemberczki, Benedek ;
Lukasik, Michal ;
Guennemann, Stephan .
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, :2464-2473
[7]   Learning multi-label scene classification [J].
Boutell, MR ;
Luo, JB ;
Shen, XP ;
Brown, CM .
PATTERN RECOGNITION, 2004, 37 (09) :1757-1771
[8]  
Chen M, 2020, ADV NEUR IN, V33
[9]   Integrating Label Semantic Similarity Scores into Multi-label Text Classification [J].
Chen, Zihao ;
Liu, Yang ;
Cheng, Baitai ;
Peng, Jing .
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 :234-245
[10]  
Chuanakrud Piyawat, 2021, 2021 13th International Conference on Information Technology and Electrical Engineering (ICITEE), P24, DOI 10.1109/ICITEE53064.2021.9611935