Heterogeneous graph contrastive learning with adaptive data augmentation for semi-supervised short text classification

被引:0
|
作者
Wu, Mingqiang [1 ]
Xu, Zhuoming [1 ]
Zheng, Lei [1 ]
机构
[1] Hohai Univ, Coll Comp Sci & Software Engn, Nanjing 211100, Jiangsu, Peoples R China
关键词
data augmentation; heterogeneous graph contrastive learning; semi-supervised short text classification; short text clustering; soft prompt;
D O I
10.1111/exsy.13744
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Short text classification has been widely used in many fields. Due to the scarcity of labelled data, implementing short text classification under semi-supervised learning setting has become increasingly popular. Semi-supervised short text classification methods based on graph neural networks can achieve state-of-the-art classification performance by utilizing the expressive power of graph neural networks. However, these methods usually fail to mine the hidden patterns of a large amount of short text node data in the graph to optimize the short text node embeddings, which limits the semantic representation power of the short texts, thus leading to suboptimal classification performance. To overcome the limitation, this paper proposes a novel semi-supervised short text classification method called the Heterogeneous Graph Contrastive Learning with Adaptive Data Augmentation (HGCLADA). In the knowledge bases guided soft prompt-based data augmentation component, the related words of the tag words are used to optimize the soft prompts for generating diverse augmented samples. In the heterogeneous graph contrastive learning framework component, a heterogeneous graph that is constructed using short texts and keywords and an effective edge augmentation scheme based on a short text clustering algorithm are proposed. The optimized short text embeddings can be obtained to achieve the effective semi-supervised short text classification. Extensive experiments on six benchmark datasets show that our HGCLADA method outperforms four classes of state-of-the-art methods in terms of classification accuracy, especially with significant performance improvements of 8.74% on the TagMyNews dataset when each class only contains 20 labelled data.
引用
收藏
页数:28
相关论文
共 50 条
  • [21] Adaptive Graph Learning for Semi-supervised Self-paced Classification
    Chen, Long
    Lu, Jianbo
    NEURAL PROCESSING LETTERS, 2022, 54 (04) : 2695 - 2716
  • [22] Short text classification algorithm based on semi-supervised learning and SVM
    Yin, Chunyong
    Xiang, Jun
    Zhang, Hui
    Yin, Zhichao
    Wang, Jin
    International Journal of Multimedia and Ubiquitous Engineering, 2015, 10 (12): : 195 - 206
  • [23] Adaptive Graph Learning for Semi-supervised Self-paced Classification
    Long Chen
    Jianbo Lu
    Neural Processing Letters, 2022, 54 : 2695 - 2716
  • [24] Semi-Supervised Short Text Classification Based on Gated Double-Layer Heterogeneous Graph Attention Network
    Jiang Y.
    Wang Q.
    Zhang X.
    Huang X.
    Shen Q.
    Rao J.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (07): : 602 - 612
  • [25] A review of semi-supervised learning for text classification
    José Marcio Duarte
    Lilian Berton
    Artificial Intelligence Review, 2023, 56 : 9401 - 9469
  • [26] A review of semi-supervised learning for text classification
    Duarte, Jose Marcio
    Berton, Lilian
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (09) : 9401 - 9469
  • [27] Semi-Supervised Text Classification With Universum Learning
    Liu, Chien-Liang
    Hsaio, Wen-Hoar
    Lee, Chia-Hoang
    Chang, Tao-Hsing
    Kuo, Tsung-Hsun
    IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (02) : 462 - 473
  • [28] TEXT CLASSIFICATION BASED ON SEMI-SUPERVISED LEARNING
    Vo Duy Thanh
    Vo Trung Hung
    Pham Minh Tuan
    Doan Van Ban
    2013 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2013, : 232 - 236
  • [29] SGCL: Semi-supervised Graph Contrastive Learning with confidence propagation algorithm for node classification
    Jiang, Wenhao
    Bai, Yuebin
    KNOWLEDGE-BASED SYSTEMS, 2024, 301
  • [30] Contrastive Graph Convolutional Networks with adaptive augmentation for text classification
    Yang, Yintao
    Miao, Rui
    Wang, Yili
    Wang, Xin
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (04)