Robust scientific text classification using prompt tuning based on data augmentation with L2 regularization

被引：8

作者：

Shi, Shijun ^{[1
]}

Hu, Kai ^{[1
]}

Xie, Jie ^{[2
,3
]}

Guo, Ya ^{[1
]}

Wu, Huayi ^{[4
]}

机构：

[1] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China

[2] Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing 210023, Peoples R China

[3] Nanjing Normal Univ, Sch Artificial Intelligence, Nanjing 210023, Peoples R China

[4] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China

来源：

INFORMATION PROCESSING & MANAGEMENT | 2024年 / 61卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Scientific text classification; Pre-training model; Prompt tuning; Data augmentation; Pairwise training; L2; regularization;

D O I：

10.1016/j.ipm.2023.103531

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recently, the prompt tuning technique, which incorporates prompts into the input of the pretraining language model (like BERT, GPT), has shown promise in improving the performance of language models when facing limited annotated data. However, the equivalence of template semantics in learning is not related to the effect of prompts and the prompt tuning often exhibits unstable performance, which is more severe in the domain of the scientific domain. To address this challenge, we propose to enhance prompt tuning using data augmentation with L2 regularization. Namely, pairing-wise training for the pair of the original and transformed data is performed. Our experiments on two scientific text datasets (ACL-ARC and SciCite) demonstrate that our proposed method significantly improves both accuracy and robustness. By using 1000 samples out of 1688 in the ACL-ARC training set, our method achieved an F1 score 3.33% higher than the same model trained on all 1688-sample data. In the SciCite dataset, our method surpassed the same model with labeled data reduced by over 93%. Our method is also proved to have high robustness, reaching F1 scores from 1% to 8% higher than those models without our method after the Probability Weighted Word Saliency attack.

引用

页数：19

共 50 条

[21] Semi-supervised Short Text Classification Based On Dual-channel Data Augmentation
Li, Jiajun
Li, Peipei
Hu, Xuegang
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[22] A blind deconvolution method based on L1/L2 regularization priors in the gradient space
Cai, Ying
Shi, Yu
Hua, Xia
MIPPR 2017: MULTISPECTRAL IMAGE ACQUISITION, PROCESSING, AND ANALYSIS, 2018, 10607
[23] L2 Norm-Based Control Regularization for Solving Optimal Control Problems
Taheri, Ehsan
Li, Nan
IEEE ACCESS, 2023, 11 : 125959 - 125971
[24] ROBUST SPEAKER VERIFICATION USING POPULATION-BASED DATA AUGMENTATION
Lin, Weiwei
Mak, Man-Wai
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7642 - 7646
[25] A two-stage balancing strategy based on data augmentation for imbalanced text sentiment classification
Pang, Zhicheng
Li, Hong
Wang, Chiyu
Shi, Jiawen
Zhou, Jiale
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (05) : 10073 - 10086
[26] Robust Underwater Fish Classification Based on Data Augmentation by Adding Noises in Random Local Regions
Wei, Guanqun
Wei, Zhiqiang
Huang, Lei
Nie, Jie
Chang, Huanhuan
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 509 - 518
[27] Efficient Color Image Segmentation via Quaternion-based L1/L2 Regularization
Wu, Tingting
Mao, Zhihui
Li, Zeyu
Zeng, Yonghua
Zeng, Tieyong
JOURNAL OF SCIENTIFIC COMPUTING, 2022, 93 (01)
[28] Classification of Osteoporosis in the Lumbar Vertebrae using L2 Regularized Neural Network based on PHOG Features
Patil, Kavita Avinash
Prashanth, K. V. Mahendra
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (04) : 413 - 423
[29] Saliency-Based Token Swap - A Language-Agnostic Data Augmentation Method for Text Classification
Ilangeshwaran, Hiroshan
Abeywardhana, Lakmini
Rathnayake, Samadhi
2024 9TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY RESEARCH, ICITR, 2024,
[30] Enhancing statistical performance of data-driven controller tuning via L2-regularization
Formentin, Simone
Karimi, Alireza
AUTOMATICA, 2014, 50 (05) : 1514 - 1520

← 1 2 3 4 5 →