Few-Shot Text Classification with an Efficient Prompt Tuning Method in Meta-Learning Framework

被引:1
作者
Lv, Xiaobao [1 ,2 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, 2 Southeast Univ Rd, Nanjing, Jiangsu, Peoples R China
[2] Zhongke Shuguang Nanjing Res Inst Co Ltd, 519 Chengxin Rd, Nanjing, Jiangsu, Peoples R China
关键词
Few-shot learning; meta-learning; prompt tuning; text classification; pre-trained language model;
D O I
10.1142/S0218001424510066
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meta-learning stands as a prevalent framework utilized in few-shot learning methods. Nonetheless, its efficacy hinges on substantial data availability during meta-training. Recent work adeptly tackled this hurdle by synergizing prompt tuning with the meta-learning paradigm, consequently attaining unparalleled performance on four benchmarks (FewRel, HuffPost, Reuters and Amazon). Nonetheless, the implementation efficacy of the previous method leaves room for enhancement, which is especially crucial when tuning larger language models. To this end, we introduce another expedited prompt tuning approach nested within the meta-learning framework. The novel approach normalizes the label information and sample information and uses the regression method to obtain the closed-form solution of each few-shot task, which significantly enhances inference speed, achieving a twofold improvement, while concurrently elevating average accuracy by 1.7 similar to 3.0% on the same benchmarks. Moreover, it demonstrates enhanced stability when faced with limited meta-training data, which is more applicable in many real scenarios where parallel data is rare. The source code is available to reproduce the results (http://github.com/Dr-Lv/EMPT).
引用
收藏
页数:18
相关论文
共 37 条
[1]  
Antypas D., 2023, ARXIV
[2]  
Bao Yujia, 2020, ICLR
[3]  
Brown T. B., ARXIV
[4]   Few-Shot Person Re-Identification Based on Meta-Learning with a Compression and Stimulation Module [J].
Cao, Jinying ;
Han, Hua ;
Huang, Li .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (13)
[5]   Randomly Wired Graph Neural Network for Chinese NER [J].
Chen, Jie ;
Xi, Xuefeng ;
Sheng, Victor S. ;
Cui, Zhiming .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 227
[6]  
Devlin J., 2018, Annual Conference of the North American Chapter of the ACL
[7]  
Fink M., 2005, Advances in neural information processing systems, P449
[8]  
Finn C, 2017, PR MACH LEARN RES, V70
[9]   Double peaks of gravitational wave spectrum induced from inflection point inflation [J].
Gao, Tie-Jun ;
Yang, Xiu-Yi .
EUROPEAN PHYSICAL JOURNAL C, 2021, 81 (06)
[10]  
Grave E., 2016, arXiv