Heterogeneous graph contrastive learning with adaptive data augmentation for semi-supervised short text classification

被引:0
|
作者
Wu, Mingqiang [1 ]
Xu, Zhuoming [1 ]
Zheng, Lei [1 ]
机构
[1] Hohai Univ, Coll Comp Sci & Software Engn, Nanjing 211100, Jiangsu, Peoples R China
关键词
data augmentation; heterogeneous graph contrastive learning; semi-supervised short text classification; short text clustering; soft prompt;
D O I
10.1111/exsy.13744
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Short text classification has been widely used in many fields. Due to the scarcity of labelled data, implementing short text classification under semi-supervised learning setting has become increasingly popular. Semi-supervised short text classification methods based on graph neural networks can achieve state-of-the-art classification performance by utilizing the expressive power of graph neural networks. However, these methods usually fail to mine the hidden patterns of a large amount of short text node data in the graph to optimize the short text node embeddings, which limits the semantic representation power of the short texts, thus leading to suboptimal classification performance. To overcome the limitation, this paper proposes a novel semi-supervised short text classification method called the Heterogeneous Graph Contrastive Learning with Adaptive Data Augmentation (HGCLADA). In the knowledge bases guided soft prompt-based data augmentation component, the related words of the tag words are used to optimize the soft prompts for generating diverse augmented samples. In the heterogeneous graph contrastive learning framework component, a heterogeneous graph that is constructed using short texts and keywords and an effective edge augmentation scheme based on a short text clustering algorithm are proposed. The optimized short text embeddings can be obtained to achieve the effective semi-supervised short text classification. Extensive experiments on six benchmark datasets show that our HGCLADA method outperforms four classes of state-of-the-art methods in terms of classification accuracy, especially with significant performance improvements of 8.74% on the TagMyNews dataset when each class only contains 20 labelled data.
引用
收藏
页数:28
相关论文
共 50 条
  • [41] EXPLORING STRUCTURE-ADAPTIVE GRAPH LEARNING FOR ROBUST SEMI-SUPERVISED CLASSIFICATION
    Gao, Xiang
    Hu, Wei
    Guo, Zongming
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [42] Improved Graph Contrastive Learning for Short Text Classification
    Liu, Yonghao
    Huang, Lan
    Giunchiglia, Fausto
    Feng, Xiaoyue
    Guan, Renchu
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18716 - 18724
  • [43] FedFAME: A Data Augmentation Free Framework based on Model Contrastive Learning for Federated Semi-Supervised Learning
    Malaviya, Shubham
    Shukla, Manish
    Korat, Pratik
    Lodha, Sachin
    38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 1114 - 1121
  • [44] A Semi-supervised Classification Method of Parasites Using Contrastive Learning
    Ren, Yanni
    Jiang, Hao
    Zhu, Huilin
    Tian, Yanling
    Hu, Jinglu
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2022, 17 (03) : 445 - 453
  • [45] Semi-supervised hybrid contrastive learning for PolSAR image classification
    Hua, Wenqiang
    Sun, Nan
    Liu, Lin
    Ding, Chen
    Dong, Yizhuo
    Sun, Wei
    KNOWLEDGE-BASED SYSTEMS, 2025, 311
  • [46] Semi-Supervised Contrastive Learning for Time Series Classification in Healthcare
    Liu, Xiaofeng
    Liu, Zhihong
    Li, Jie
    Zhang, Xiang
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025, 9 (01): : 318 - 331
  • [47] Enhanced Graph Neural Network with Multi-Task Learning and Data Augmentation for Semi-Supervised Node Classification
    Fan, Cheng
    Wang, Buhong
    Wang, Zhen
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (12)
  • [48] SEMI-SUPERVISED LEARNING FOR TEXT CLASSIFICATION BY LAYER PARTITIONING
    Li, Alexander Hanbo
    Sethy, Abhinav
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6164 - 6168
  • [49] AAGCN: An adaptive data augmentation for graph contrastive learning
    Qin, Peng
    Lu, Yaochun
    Chen, Weifu
    Li, Defang
    Feng, Guocan
    PATTERN RECOGNITION, 2025, 163
  • [50] Adaptive Graph Constrained NMF for Semi-Supervised Learning
    Li, Qian
    Jing, Liping
    Yu, Jian
    PARTIALLY SUPERVISED LEARNING, PSL 2013, 2013, 8193 : 36 - 48