Interactive Evaluation of Dialog Track at DSTC9

被引:0
|
作者
Mehri, Shikib [1 ]
Feng, Yulan [1 ]
Gordon, Carla [2 ]
Alavi, Seyed Hossein [2 ]
Traum, David [2 ]
Eskenazi, Maxine [1 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
[2] Univ Southern Calif, Inst Creat Technol, Los Angeles, CA USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The ultimate goal of dialog research is to develop systems that can be effectively used in interactive settings by real users. To this end, we introduced the Interactive Evaluation of Dialog Track at the 9th Dialog System Technology Challenge. This track consisted of two sub-tasks. The first sub-task involved building knowledge-grounded response generation models. The second sub-task aimed to extend dialog models beyond static datasets by assessing them in an interactive setting with real users. Our track challenges participants to develop strong response generation models and explore strategies that extend them to back-and-forth interactions with real users. The progression from static corpora to interactive evaluation introduces unique challenges and facilitates a more thorough assessment of open-domain dialog systems. This paper provides an overview of the track, including the methodology and results. Furthermore, it provides insights into how to best evaluate open-domain dialog models.
引用
收藏
页码:5731 / 5738
页数:8
相关论文
共 50 条
  • [1] Overview of the Ninth Dialog System Technology Challenge: DSTC9
    Gunasekara, Chulaka
    Kim, Seokhwan
    D'Haro, Luis Fernando
    Rastogi, Abhinav
    Chen, Yun-Nung
    Eric, Mihail
    Hedayatnia, Behnam
    Gopalakrishnan, Karthik
    Liu, Yang
    Huang, Chao-Wei
    Hakkani-Tur, Dilek
    Li, Jinchao
    Zhu, Qi
    Luo, Lingxiao
    Liden, Lars
    Huang, Kaili
    Shayandeh, Shahin
    Liang, Runze
    Peng, Baolin
    Zhang, Zheng
    Shukla, Swadheen
    Huang, Minlie
    Gao, Jianfeng
    Mehri, Shikib
    Feng, Yulan
    Gordon, Carla
    Alavi, Seyed Hossein
    Traum, David
    Eskenazi, Maxine
    Beirami, Ahmad
    Cho, Eunjoon
    Crook, Paul A.
    De, Ankita
    Geramifard, Alborz
    Kottur, Satwik
    Moon, Seungwhan
    Poddar, Shivani
    Subba, Rajen
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4066 - 4076
  • [2] Task-Oriented Document-Grounded Dialog Systems by HLTPR@RWTH for DSTC9 and DSTC10
    Thulke, David
    Daheim, Nico
    Dugast, Christian
    Ney, Hermann
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 733 - 741
  • [3] Unsupervised Evaluation of Interactive Dialog with DialoGPT
    Mehri, Shikib
    Eskenazi, Maxine
    SIGDIAL 2020: 21ST ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2020), 2020, : 225 - 235
  • [4] Speech Aware Dialog System Technology Challenge (DSTC11)
    Soltau, Hagen
    Shafran, Izhak
    Wang, Mingqiu
    Rastogi, Abhinav
    Zhao, Jeffrey
    Jia, Ye
    Han, Wei
    Cao, Yuan
    Miranda, Aramys
    INTERSPEECH 2023, 2023, : 4668 - 4672
  • [5] Overview of the seventh Dialog System Technology Challenge: DSTC7
    Fernando D'Haro, Luis
    Yoshino, Koichiro
    Hori, Chiori
    Marks, Tim K.
    Polymenakos, Lazaros
    Kummerfeld, Jonathan K.
    Galley, Michel
    Gao, Xiang
    COMPUTER SPEECH AND LANGUAGE, 2020, 62
  • [6] Overview of the sixth dialog system technology challenge: DSTC6
    Horia, Chiori
    Perez, Julien
    Higashinakac, Ryuichiro
    Horia, Takaaki
    Boureau, Y-Lan
    Inabae, Michimasa
    Tsunomorif, Yuiko
    Takahashig, Tetsuro
    Yoshinoh, Koichiro
    Kim, Seokhwan
    COMPUTER SPEECH AND LANGUAGE, 2019, 55 : 1 - 25
  • [7] Overview of the Eighth Dialog System Technology Challenge: DSTC8
    Kim, Seokhwan
    Galley, Michel
    Gunasekara, Chulaka
    Lee, Sungjin
    Atkinson, Adam
    Peng, Baolin
    Schulz, Hannes
    Gao, Jianfeng
    Li, Jinchao
    Adada, Mahmoud
    Huang, Minlie
    Lastras, Luis
    Kummerfeld, Jonathan K.
    Lasecki, Walter S.
    Hori, Chiori
    Cherian, Anoop
    Marks, Tim K.
    Rastogi, Abhinav
    Zang, Xiaoxue
    Sunkara, Srinivas
    Gupta, Raghav
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 (29) : 2529 - 2540
  • [8] Overview of the Tenth Dialog System Technology Challenge: DSTC10
    Yoshino, Koichiro
    Chen, Yun-Nung
    Crook, Paul
    Kottur, Satwik
    Li, Jinchao
    Hedayatnia, Behnam
    Moon, Seungwhan
    Fei, Zhengcong
    Li, Zekang
    Zhang, Jinchao
    Feng, Yang
    Zhou, Jie
    Kim, Seokhwan
    Liu, Yang
    Jin, Di
    Papangelis, Alexandros
    Gopalakrishnan, Karthik
    Hakkani-Tur, Dilek
    Damavandi, Babak
    Geramifard, Alborz
    Hori, Chiori
    Shah, Ankit
    Zhang, Chen
    Li, Haizhou
    Sedoc, Joao
    D'Haro, Luis F.
    Banchs, Rafael
    Rudnicky, Alexander
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 765 - 778
  • [9] Evaluation of distance interactive learning in obstetrics and gynaecology (DIALOG)
    Jha, V
    Duffy, S
    McAleer, S
    BJOG-AN INTERNATIONAL JOURNAL OF OBSTETRICS AND GYNAECOLOGY, 2002, 109 (04) : 456 - 461
  • [10] Interactive visual dialog
    Arbel, T
    Ferrie, FP
    IMAGE AND VISION COMPUTING, 2002, 20 (9-10) : 639 - 646