Robust scientific text classification using prompt tuning based on data augmentation with L2 regularization

被引:8
|
作者
Shi, Shijun [1 ]
Hu, Kai [1 ]
Xie, Jie [2 ,3 ]
Guo, Ya [1 ]
Wu, Huayi [4 ]
机构
[1] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China
[2] Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing 210023, Peoples R China
[3] Nanjing Normal Univ, Sch Artificial Intelligence, Nanjing 210023, Peoples R China
[4] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Scientific text classification; Pre-training model; Prompt tuning; Data augmentation; Pairwise training; L2; regularization;
D O I
10.1016/j.ipm.2023.103531
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, the prompt tuning technique, which incorporates prompts into the input of the pretraining language model (like BERT, GPT), has shown promise in improving the performance of language models when facing limited annotated data. However, the equivalence of template semantics in learning is not related to the effect of prompts and the prompt tuning often exhibits unstable performance, which is more severe in the domain of the scientific domain. To address this challenge, we propose to enhance prompt tuning using data augmentation with L2 regularization. Namely, pairing-wise training for the pair of the original and transformed data is performed. Our experiments on two scientific text datasets (ACL-ARC and SciCite) demonstrate that our proposed method significantly improves both accuracy and robustness. By using 1000 samples out of 1688 in the ACL-ARC training set, our method achieved an F1 score 3.33% higher than the same model trained on all 1688-sample data. In the SciCite dataset, our method surpassed the same model with labeled data reduced by over 93%. Our method is also proved to have high robustness, reaching F1 scores from 1% to 8% higher than those models without our method after the Probability Weighted Word Saliency attack.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Semi-supervised Short Text Classification Based On Dual-channel Data Augmentation
    Li, Jiajun
    Li, Peipei
    Hu, Xuegang
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [22] A blind deconvolution method based on L1/L2 regularization priors in the gradient space
    Cai, Ying
    Shi, Yu
    Hua, Xia
    MIPPR 2017: MULTISPECTRAL IMAGE ACQUISITION, PROCESSING, AND ANALYSIS, 2018, 10607
  • [23] L2 Norm-Based Control Regularization for Solving Optimal Control Problems
    Taheri, Ehsan
    Li, Nan
    IEEE ACCESS, 2023, 11 : 125959 - 125971
  • [24] ROBUST SPEAKER VERIFICATION USING POPULATION-BASED DATA AUGMENTATION
    Lin, Weiwei
    Mak, Man-Wai
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7642 - 7646
  • [25] A two-stage balancing strategy based on data augmentation for imbalanced text sentiment classification
    Pang, Zhicheng
    Li, Hong
    Wang, Chiyu
    Shi, Jiawen
    Zhou, Jiale
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (05) : 10073 - 10086
  • [26] Robust Underwater Fish Classification Based on Data Augmentation by Adding Noises in Random Local Regions
    Wei, Guanqun
    Wei, Zhiqiang
    Huang, Lei
    Nie, Jie
    Chang, Huanhuan
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 509 - 518
  • [27] Efficient Color Image Segmentation via Quaternion-based L1/L2 Regularization
    Wu, Tingting
    Mao, Zhihui
    Li, Zeyu
    Zeng, Yonghua
    Zeng, Tieyong
    JOURNAL OF SCIENTIFIC COMPUTING, 2022, 93 (01)
  • [28] Classification of Osteoporosis in the Lumbar Vertebrae using L2 Regularized Neural Network based on PHOG Features
    Patil, Kavita Avinash
    Prashanth, K. V. Mahendra
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (04) : 413 - 423
  • [29] Saliency-Based Token Swap - A Language-Agnostic Data Augmentation Method for Text Classification
    Ilangeshwaran, Hiroshan
    Abeywardhana, Lakmini
    Rathnayake, Samadhi
    2024 9TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY RESEARCH, ICITR, 2024,
  • [30] Enhancing statistical performance of data-driven controller tuning via L2-regularization
    Formentin, Simone
    Karimi, Alireza
    AUTOMATICA, 2014, 50 (05) : 1514 - 1520