A Word-Concept Heterogeneous Graph Convolutional Network for Short Text Classification

被引:4
作者
Yang, Shigang [1 ]
Liu, Yongguo [1 ]
Zhang, Yun [1 ]
Zhu, Jiajing [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Knowledge & Data Engn Lab Chinese Med, Chengdu 610054, Peoples R China
基金
国家重点研发计划;
关键词
Short text classification; Concepts; Words; Graph convolution network; PERFORMANCE;
D O I
10.1007/s11063-022-10906-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text classification is an important task in natural language processing. However, most of the existing models focus on long texts, and their performance in short texts is not satisfied due to the problem of data sparsity. To solve this problem, recent studies have introduced the concepts of words to enrich the representation of short texts. However, these methods ignore the interactive information between words and concepts and lead introduced concepts to be noises unsuitable for semantic understanding. In this paper, we propose a new model called word-concept heterogeneous graph convolution network (WC-HGCN) to introduce interactive information between words and concepts for short text classification. WC-HGCN develops words and relevant concepts and adopts graph convolution networks to learn the representation with interactive information. Furthermore, we design an innovative learning strategy, which can make full use of the introduced concept information. Experimental results on seven real short text datasets show that our model outperforms latest baseline methods.
引用
收藏
页码:735 / 750
页数:16
相关论文
共 37 条
  • [1] Review of short-text classification
    Alsmadi, Issa
    Gan, Keng Hoon
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2019, 15 (02) : 155 - 182
  • [2] Recurrent neural networks for robust real-world text classffication
    Arevian, Garen
    [J]. PROCEEDINGS OF THE IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE: WI 2007, 2007, : 326 - 329
  • [3] Baoxun Xu, 2012, Advances in Knowledge Discovery and Data Mining. Proceedings 16th Pacific-Asia Conference (PAKDD 2012), P147, DOI 10.1007/978-3-642-30217-6_13
  • [4] Batal Iyad., 2009, P 18 ACM C INFORM KN, P2041
  • [5] Chen JD, 2019, AAAI CONF ARTIF INTE, P6252
  • [6] Dilrukshi Inoshika, 2014, International Journal of Machine Learning and Computing, V4, P70, DOI 10.7763/IJMLC.2014.V4.438
  • [7] A novel naive bayesian text classifier
    Ding, Wang
    Yu, Songnian
    Wang, Qianfeng
    Yu, Jiaqi
    Guo, Qiang
    [J]. 2008 INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING AND 2008 INTERNATIONAL PACIFIC WORKSHOP ON WEB MINING AND WEB-BASED APPLICATION, 2008, : 78 - 82
  • [8] Text Classification Research with Attention-based Recurrent Neural Networks
    Du, C.
    Huang, L.
    [J]. INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2018, 13 (01) : 50 - 61
  • [9] Lazy fine-tuning algorithms for naive Bayesian text classification
    El Hindi, Khalil M.
    Aljulaidan, Reem R.
    AlSalman, Hussien
    [J]. APPLIED SOFT COMPUTING, 2020, 96
  • [10] Ge Song, 2014, Journal of Multimedia, V9, P635, DOI 10.4304/jmm.9.5.635-643