Towards Fewer Hallucinations in Knowledge-Grounded Dialogue Generation via Augmentative and Contrastive Knowledge-Dialogue

被引:0
作者
Sun, Bin [1 ]
Li, Yitong [2 ]
Mi, Fei [2 ]
Bie, FanHu [3 ]
Li, Yiwei [1 ]
Li, Kan [1 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China
[2] Huawei Noahs Ark Lab, Montreal, PQ, Canada
[3] Huawei Technol Ltd, Beijing, Peoples R China
来源
61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2 | 2023年
基金
北京市自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing knowledge-grounded open-domain dialogue generation models often face the hallucination problem, i.e. the dialogue generative model will persist in an inappropriate knowledge and generate responses that inconsistent with the facts. We argue that this problem mainly stems from the polarized optimization objectives and weak knowledge generation ability. To mitigate the hallucination, we take inspiration from human communicating that people will replay euphemistic responses for the unclear or unrecognizable knowledge, and propose an Augmentative and Contrastive Knowledge Dialogue Expansion Framework (ACK-DEF). ACK-DEF constructs the augmentative and contrastive knowledge dialogue samples, which consist of the knowledge of different degrees of errors and the response of manual design, to expand the original training set and smooth the polarized optimization objective that enables models to generate ground-truth with or without gold knowledge. Not only the knowledge, ACK-DEF also provides the tactful responses of manual design corresponding to the incomplete correct knowledge. Experimental results on the Wikipedia of Wizard dataset show that employing the ACK-DEF is effective to alleviate the hallucination problem.
引用
收藏
页码:1741 / 1750
页数:10
相关论文
共 34 条
[1]  
Adolphs Leonard, 2021, PREPRINT
[2]  
Bao SQ, 2022, Arxiv, DOI arXiv:2211.00910
[3]  
Bao SQ, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P85
[4]  
Cai HY, 2020, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, P793
[5]  
Chen T, 2020, PR MACH LEARN RES, V119
[6]  
Chen XL, 2020, Arxiv, DOI arXiv:2003.04297
[7]  
Csaky R, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P5650
[8]  
Dinan Emily, 2019, 7 INT C LEARN REPR I
[9]  
Dziri N, 2022, NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, P5271
[10]   Topical-Chat: Towards Knowledge-Grounded Open-Domain Conversations [J].
Gopalakrishnan, Karthik ;
Hedayatnia, Behnam ;
Chen, Qinlang ;
Gottardi, Anna ;
Kwatra, Sanjeev ;
Venkatesh, Anu ;
Gabriel, Raefer ;
Hakkani-Tur, Dilek .
INTERSPEECH 2019, 2019, :1891-1895