A Word-Concept Heterogeneous Graph Convolutional Network for Short Text Classification

被引:4
作者
Yang, Shigang [1 ]
Liu, Yongguo [1 ]
Zhang, Yun [1 ]
Zhu, Jiajing [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Knowledge & Data Engn Lab Chinese Med, Chengdu 610054, Peoples R China
基金
国家重点研发计划;
关键词
Short text classification; Concepts; Words; Graph convolution network; PERFORMANCE;
D O I
10.1007/s11063-022-10906-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text classification is an important task in natural language processing. However, most of the existing models focus on long texts, and their performance in short texts is not satisfied due to the problem of data sparsity. To solve this problem, recent studies have introduced the concepts of words to enrich the representation of short texts. However, these methods ignore the interactive information between words and concepts and lead introduced concepts to be noises unsuitable for semantic understanding. In this paper, we propose a new model called word-concept heterogeneous graph convolution network (WC-HGCN) to introduce interactive information between words and concepts for short text classification. WC-HGCN develops words and relevant concepts and adopts graph convolution networks to learn the representation with interactive information. Furthermore, we design an innovative learning strategy, which can make full use of the introduced concept information. Experimental results on seven real short text datasets show that our model outperforms latest baseline methods.
引用
收藏
页码:735 / 750
页数:16
相关论文
共 37 条
[21]  
Liu XE, 2020, AAAI CONF ARTIF INTE, V34, P8409
[22]  
Niu GC, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P496
[23]  
Pennington Jeffrey, 2014, P 2014 C EMPIRICAL M, P1532, DOI [10.3115/v1/D14-1162, DOI 10.3115/V1/D14-1162]
[24]   Improving random forests by neighborhood projection for effective text classification [J].
Salles, Thiago ;
Goncalves, Marcos ;
Rodrigues, Victor ;
Rocha, Leonardo .
INFORMATION SYSTEMS, 2018, 77 :1-21
[25]   The Graph Neural Network Model [J].
Scarselli, Franco ;
Gori, Marco ;
Tsoi, Ah Chung ;
Hagenbuchner, Markus ;
Monfardini, Gabriele .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (01) :61-80
[26]  
Shanahan JG, 2003, LECT NOTES ARTIF INT, V2837, P361
[27]  
Sui YL, 2013, INT SYM CODE GENER, P1
[28]   PTE: Predictive Text Embedding through Large-scale Heterogeneous Text Networks [J].
Tang, Jian ;
Qu, Meng ;
Mei, Qiaozhu .
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :1165-1174
[29]  
Vaswani A, 2017, ADV NEUR IN, V30
[30]  
Wang AQ, 2016, INT CONF GEOINFORM