Contrastive learning based on linguistic knowledge and adaptive augmentation for text classification

被引:0
|
作者
Zhang, Shaokang [1 ]
Ran, Ning [2 ]
机构
[1] Hebei Univ, Sch Cyber Secur & Comp, Baoding, Peoples R China
[2] Hebei Univ, Coll Elect & Informat Engn, Baoding, Peoples R China
基金
中国国家自然科学基金;
关键词
Text classification; Contrastive learning; Linguistic knowledge; Adaptive data augmentation; REPRESENTATION;
D O I
10.1016/j.knosys.2024.112189
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pre-trained language models based on contrastive learning have shown to be effective in text classification. Despite its great success, contrastive learning still has some limitations. First, external linguistic knowledge has been shown to improve the performance of pre-trained language models, but how to use it in contrastive learning is still unclear. Second, general contrastive learning generates training samples with fixed data augmentation during the whole training period, while different augmentation methods are suitable for different downstream tasks. Fixed data augmentation can lead to suboptimal settings. In this paper, we propose contrastive learning based on linguistic knowledge and adaptive augmentation, which can obtain high-quality sentence representations to improve the performance of text classification. Specifically, we construct wordlevel positive and negative sample pairs by WordNet and propose a novel word-level contrastive learning function to inject linguistic knowledge. Then we dynamically select the augmentation policy by alignment and uniformity. This adaptive augmentation policy can acquire more generalized sentence representations with little computational overhead. Experiments on multiple public datasets demonstrate that our method outperforms state-of-the-art methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] COPHTC: CONTRASTIVE LEARNING WITH PROMPT TUNING FOR HIERARCHICAL TEXT CLASSIFICATION
    Cai, Fuhan
    Zhang, Zhongqiang
    Liu, Duo
    Fang, Xiangzhong
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 5400 - 5404
  • [32] Hierarchical contrastive learning for multi-label text classification
    Wei Zhang
    Yun Jiang
    Yun Fang
    Shuai Pan
    Scientific Reports, 15 (1)
  • [33] TFD-GCL: Telecommunications Fraud Detection Based on Graph Contrastive Learning with Adaptive Augmentation
    Cui, Xiaohui (xcui@whu.edu.cn), 1600, Institute of Electrical and Electronics Engineers Inc.
  • [34] Unlocking the Potential of Data Augmentation in Contrastive Learning for Hyperspectral Image Classification
    Li, Jinhui
    Li, Xiaorun
    Yan, Yunfeng
    REMOTE SENSING, 2023, 15 (12)
  • [35] Adaptive Augmentation and Neighbor Contrastive Learning for Multi-Behavior Recommendation
    Wu, Xia
    Wang, Shaoqing
    Zhang, Yao
    WEB AND BIG DATA, APWEB-WAIM 2024, PT II, 2024, 14962 : 18 - 32
  • [36] A Graph Contrastive Learning Framework with Adaptive Augmentation and Encoding for Unaligned Views
    Guo, Yifu
    Liu, Yong
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT II, 2023, 13936 : 92 - 104
  • [37] Heterogeneous Graph Contrastive Learning with Dual Aggregation Scheme and Adaptive Augmentation
    Xie, Yingjie
    Yan, Qi
    Zhou, Cangqi
    Zhang, Jing
    Hu, Dianming
    WEB AND BIG DATA, PT IV, APWEB-WAIM 2023, 2024, 14334 : 124 - 138
  • [38] Text Classification Method Based on Machine Learning and Domain Knowledge Ontology
    Gao, Zhiyong
    Qiao, Shuhan
    Liang, Yongquan
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON MODELING, SIMULATION AND OPTIMIZATION TECHNOLOGIES AND APPLICATIONS (MSOTA2016), 2016, 58 : 344 - 347
  • [39] ADCL: An attention feature enhancement network based on adversarial contrastive learning for short text classification
    Su, Shun
    Shao, Dangguo
    Ma, Lei
    Yi, Sanli
    Yang, Ziwei
    ADVANCED ENGINEERING INFORMATICS, 2025, 65
  • [40] Contrastive learning from label distribution: A case study on text classification
    Qian, Tao
    Li, Fei
    Zhang, Meishan
    Jin, Guonian
    Fan, Ping
    Dai, Wenhua
    NEUROCOMPUTING, 2022, 507 : 208 - 220