Few-shot multi-domain text intent classification with Dynamic Balance Domain Adaptation Meta-learning

被引：4

作者：

Yang, Shun ^{[1
]}

Du, Yajun ^{[1
]}

Liu, Jia ^{[1
]}

Li, Xianyong ^{[1
]}

Chen, Xiaoliang ^{[1
]}

Gao, Hongmei ^{[1
]}

Xie, Chunzhi ^{[1
]}

Li, Yanli ^{[1
]}

机构：

[1] Xihua Univ, Sch Comp & Software Engn, Chengdu 610065, Sichuan, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 255卷

关键词：

Few-shot text intent classification; Few-shot learning; Meta-learning; Domain adaptation; Dynamic balance factor;

D O I：

10.1016/j.eswa.2024.124429

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

User intents are ever-changing, which requires deep learning models to have the ability to classify unknown intents. Meta-learning aims to solve this problem by improving the model's generalization ability to unknown intent. However, learning on a small amount of text can easily lead to overfitting of the model. Domain adaptation can help us train a more robust model. However, most existing methods only focus on global feature alignment and ignore alignment in subdomains. Therefore, in this study, we first consider the case where the model can maintain robustness with a small amount of data and then explore and mine the higher quality transferable features. Based on these ideas, we propose Dynamic Balance Domain Adaptation Meta-learning (DBDAML), which adaptively learns higher quality transferable features in both the global domain and subdomains.(1) At the same time, we define a dynamic balance factor to enable DBDAML to dynamically focus on the global domain and subdomains. This allows the model to give different attention to different domain adaptations and prevents it from overfitting of a domain feature alignment. The dynamic balance factor is estimated by the contribution of different domain discriminators to the loss, which also makes it easy to calculate and accurate. Finally, we use the meta-learning framework to model the entire theoretical idea. Extensive experiments demonstrate that our approach achieves better performance than state-of-the-art baseline methods.

引用

页数：16

共 47 条

[1]

Aghabozorgi M, 2023, PR MACH LEARN RES, V202, P248

[2]

Bao Yujia, 2020, INT C LEARNING REPRE

[3] Transfer learning for raw network traffic detection [J].

Bierbrauer, David A. ;

De Lucia, Michael J. ;

Reddy, Krishna ;

Maxwell, Paul ;

Bastian, Nathaniel D. .

EXPERT SYSTEMS WITH APPLICATIONS, 2023, 211

[4]

Casanueva I, 2020, NLP FOR CONVERSATIONAL AI, P38

[5]

Chai HY, 2023, PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, P2565

[6]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

[7]

Du MN, 2023, 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, P1766

[8]

Fang SM, 2022, AAAI CONF ARTIF INTE, P571

[9]

Finn C, 2017, PR MACH LEARN RES, V70

[10] Balanced and robust unsupervised Open Set Domain Adaptation via joint adversarial alignment and unknown class isolation [J].

Gao, Feng ;

Pi, Dechang ;

Chen, Junfu .

EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238

← 1 2 3 4 5 →