Human Annotated Dialogues Dataset for Natural Conversational Agents

被引:14
作者
Merdivan, Erinc [1 ,2 ]
Singh, Deepika [1 ,3 ]
Hanke, Sten [4 ]
Kropf, Johannes [1 ]
Holzinger, Andreas [3 ]
Geist, Matthieu [5 ]
机构
[1] AIT Austrian Inst Technol, A-2700 Wiener Neustadt, Austria
[2] Univ Lorraine, CNRS, LORIA, Cent Supelec, F-57000 Metz, France
[3] Med Univ Graz, Inst Med Informat Stat, HCI KDD, Holzinger Grp, A-8036 Graz, Austria
[4] FH Joanneum Gesell mbH, A-8020 Graz, Austria
[5] Univ Lorraine, CNRS, LIEC, F-57000 Metz, France
来源
APPLIED SCIENCES-BASEL | 2020年 / 10卷 / 03期
关键词
conversational agents; dialogue systems; chatbots;
D O I
10.3390/app10030762
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Conversational agents are gaining huge popularity in industrial applications such as digital assistants, chatbots, and particularly systems for natural language understanding (NLU). However, a major drawback is the unavailability of a common metric to evaluate the replies against human judgement for conversational agents. In this paper, we develop a benchmark dataset with human annotations and diverse replies that can be used to develop such metric for conversational agents. The paper introduces a high-quality human annotated movie dialogue dataset, HUMOD, that is developed from the Cornell movie dialogues dataset. This new dataset comprises 28,500 human responses from 9500 multi-turn dialogue history-reply pairs. Human responses include: (i) ratings of the dialogue reply in relevance to the dialogue history; and (ii) unique dialogue replies for each dialogue history from the users. Such unique dialogue replies enable researchers in evaluating their models against six unique human responses for each given history. Detailed analysis on how dialogues are structured and human perception on dialogue score in comparison with existing models are also presented.
引用
收藏
页数:16
相关论文
共 41 条
[1]  
[Anonymous], 2002, P 40 ANN M ASS COMP
[2]  
[Anonymous], 2002, Lecture Notes in Computer Science
[3]  
[Anonymous], 2016, ARXIV160308023
[4]  
[Anonymous], P INT C LANG RES EV
[5]  
[Anonymous], 2017, ACM SIGKDD EXPLOR NE
[6]  
[Anonymous], 2012, P ACL 2012 SYST DEM
[7]  
[Anonymous], P INT C LEARN REPR I
[8]  
[Anonymous], LANGUAGES FORMAL NAT
[9]  
[Anonymous], 2017, P INT C LEARN REPR I
[10]  
[Anonymous], P 34 ANN M ASS COMP