Few-shot task-oriented dialogue generation with in-context learning

被引:0
作者
Bi, Zhongqin [1 ]
Hu, Xueni [1 ]
Zhang, Weina [1 ]
Wang, Xiaoyu [1 ]
机构
[1] Shanghai Univ Elect Power, Coll Comp Sci & Technol, Shanghai, Peoples R China
关键词
Few-shot; Dialogue generation; In-context learning; Task-oriented;
D O I
10.1108/IJWIS-01-2025-0007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
PurposeIn the construction of task-oriented dialogue systems, collecting data in specific domains is difficult. This has made the automatic generation of a large amount of dialogue data a highly promising research direction. Traditional methods lack sufficient information diversity, which consequently leads to poor quality of generated dialogues and insufficient annotation accuracy. The purpose of this paper is to propose methods to generate higher-quality dialogues with more accurate annotations under few-shot conditions.Design/methodology/approachThis paper puts forward a dialogue retriever specifically designed for task-oriented dialogue generation. This retriever conducts retrieval based on two retrieval methods, namely, those based on user utterances and system responses. It screens out the training prompts that have a higher degree of fit with the test prompts as examples, thereby strengthening the in-context learning ability of large language models and effectively improving the quality of generated user utterances and system responses. Moreover, this paper proposes an automatic annotation correction mechanism, which overcomes the uncertainty of large language models and enhances the accuracy of annotations. This method requires almost no human involvement and can quickly generate dialogues at minimal cost.FindingsThis paper conducts experiments on two data sets. The results show that compared with previous methods, under the condition of few samples, the method proposed in this paper can generate dialogues with better quality and more accurate annotations.Originality/valueIn this research, this paper has proposed a turn-based dialogue retriever based on user utterances and system responses and designed a new similarity calculation method, as well as an automatic annotation correction mechanism for task-oriented dialogue generation. The calculation results have demonstrated the effectiveness of this method.
引用
收藏
页码:254 / 274
页数:21
相关论文
共 39 条
[1]  
Asri L., 2016, ARXIV
[2]   Chain-of-event prompting for multi-document summarization by large language models [J].
Bao, Songlin ;
Li, Tiantian ;
Cao, Bin .
INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2024, 20 (03) :229-247
[3]  
Black S., 2021, GPT Neo: Large scale autoregressive language modeling with mesh-tensorflow (1.0)python. eleutherai
[4]  
Brown TB, 2020, ADV NEUR IN, V33
[5]  
Chen Xi, 2016, Proceedings of the 30th International Conference on Neural Information Processing Systems, V29
[6]  
Crook P., 2017, SEQUENCE SEQUENCE MO
[7]  
Dong Q., 2023, A survey on in-context learning
[8]  
Eric M., 2020, MULTIWOZ21 CONSOLIDA
[9]  
Gur I., 2018, ARXIV
[10]  
Han T., 2020, MULTIWOZ 23 MULTIDOM