Robust scientific text classification using prompt tuning based on data augmentation with L2 regularization

被引:8
|
作者
Shi, Shijun [1 ]
Hu, Kai [1 ]
Xie, Jie [2 ,3 ]
Guo, Ya [1 ]
Wu, Huayi [4 ]
机构
[1] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China
[2] Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing 210023, Peoples R China
[3] Nanjing Normal Univ, Sch Artificial Intelligence, Nanjing 210023, Peoples R China
[4] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Scientific text classification; Pre-training model; Prompt tuning; Data augmentation; Pairwise training; L2; regularization;
D O I
10.1016/j.ipm.2023.103531
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, the prompt tuning technique, which incorporates prompts into the input of the pretraining language model (like BERT, GPT), has shown promise in improving the performance of language models when facing limited annotated data. However, the equivalence of template semantics in learning is not related to the effect of prompts and the prompt tuning often exhibits unstable performance, which is more severe in the domain of the scientific domain. To address this challenge, we propose to enhance prompt tuning using data augmentation with L2 regularization. Namely, pairing-wise training for the pair of the original and transformed data is performed. Our experiments on two scientific text datasets (ACL-ARC and SciCite) demonstrate that our proposed method significantly improves both accuracy and robustness. By using 1000 samples out of 1688 in the ACL-ARC training set, our method achieved an F1 score 3.33% higher than the same model trained on all 1688-sample data. In the SciCite dataset, our method surpassed the same model with labeled data reduced by over 93%. Our method is also proved to have high robustness, reaching F1 scores from 1% to 8% higher than those models without our method after the Probability Weighted Word Saliency attack.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Missing data imputation on biomedical data using deeply learned clustering and L2 regularized regression based on symmetric uncertainty
    Nagarajan, Gayathri
    Babu, L. D. Dhinesh
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2022, 123
  • [42] FRDA: Fingerprint Region based Data Augmentation using explainable AI for FTIR based microplastics classification
    Yan, Xinyu
    Cao, Zhi
    Murphy, Alan
    Ye, Yuhang
    Wang, Xinwu
    Qiao, Yuansong
    SCIENCE OF THE TOTAL ENVIRONMENT, 2023, 896
  • [43] Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis
    Cong-Thanh Do
    Imai, Shuhei
    Doddipatla, Rama
    Hain, Thomas
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 136 - 140
  • [44] The Recovery of Data Flow based on Weighted L1/2-regularization
    Wang Yuanyuan
    Wen Chenglin
    2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 7415 - 7420
  • [45] Driving Safety Area Classification for Automated Vehicles Based on Data Augmentation Using Generative Models
    Lee, Donghoun
    SUSTAINABILITY, 2024, 16 (11)
  • [46] Improved Kurdish Dialect Classification Using Data Augmentation and ANOVA-Based Feature Selection
    Ghafoor, Karzan J.
    Taher, Sarkhel H.
    Rawf, Karwan M. Hama
    Abdulrahman, Ayub O.
    ARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY, 2025, 13 (01): : 94 - 103
  • [47] A Novel Data Augmentation Method Using Style-Based GAN for Robust Pulmonary Nodule Segmentation
    Shi, Haoqi
    Lu, Junguo
    Zhou, Qianjun
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 2486 - 2491
  • [48] Gene Selection in Cancer Classification Using Sparse Logistic Regression with L1/2 Regularization
    Wu, Shengbing
    Jiang, Hongkun
    Shen, Haiwei
    Yang, Ziyi
    APPLIED SCIENCES-BASEL, 2018, 8 (09):
  • [49] Inverse Scattering Using a Joint L1-L2 Norm-Based Regularization
    Shah, Pratik
    Khankhoje, Uday K.
    Moghaddam, Mahta
    IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2016, 64 (04) : 1373 - 1384
  • [50] Resampling and Data Augmentation For Equines' Behaviour Classification Based on Wearable Sensor Accelerometer Data Using a Convolutional Neural Network
    Eerdekens, Anniek
    Deruyck, Margot
    Fontaine, Jaron
    Martens, Luc
    De Poorter, Eli
    Plets, David
    Joseph, Wout
    2020 INTERNATIONAL CONFERENCE ON OMNI-LAYER INTELLIGENT SYSTEMS (IEEE COINS 2020), 2020, : 168 - 173