Interactive Evaluation of Dialog Track at DSTC9

被引：0

作者：

Mehri, Shikib ^{[1
]}

Feng, Yulan ^{[1
]}

Gordon, Carla ^{[2
]}

Alavi, Seyed Hossein ^{[2
]}

Traum, David ^{[2
]}

Eskenazi, Maxine ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA

[2] Univ Southern Calif, Inst Creat Technol, Los Angeles, CA USA

来源：

LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2022年

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

The ultimate goal of dialog research is to develop systems that can be effectively used in interactive settings by real users. To this end, we introduced the Interactive Evaluation of Dialog Track at the 9th Dialog System Technology Challenge. This track consisted of two sub-tasks. The first sub-task involved building knowledge-grounded response generation models. The second sub-task aimed to extend dialog models beyond static datasets by assessing them in an interactive setting with real users. Our track challenges participants to develop strong response generation models and explore strategies that extend them to back-and-forth interactions with real users. The progression from static corpora to interactive evaluation introduces unique challenges and facilitates a more thorough assessment of open-domain dialog systems. This paper provides an overview of the track, including the methodology and results. Furthermore, it provides insights into how to best evaluate open-domain dialog models.

引用

页码：5731 / 5738

页数：8

共 50 条

[1] Overview of the Ninth Dialog System Technology Challenge: DSTC9
Gunasekara, Chulaka
Kim, Seokhwan
D'Haro, Luis Fernando
Rastogi, Abhinav
Chen, Yun-Nung
Eric, Mihail
Hedayatnia, Behnam
Gopalakrishnan, Karthik
Liu, Yang
Huang, Chao-Wei
Hakkani-Tur, Dilek
Li, Jinchao
Zhu, Qi
Luo, Lingxiao
Liden, Lars
Huang, Kaili
Shayandeh, Shahin
Liang, Runze
Peng, Baolin
Zhang, Zheng
Shukla, Swadheen
Huang, Minlie
Gao, Jianfeng
Mehri, Shikib
Feng, Yulan
Gordon, Carla
Alavi, Seyed Hossein
Traum, David
Eskenazi, Maxine
Beirami, Ahmad
Cho, Eunjoon
Crook, Paul A.
De, Ankita
Geramifard, Alborz
Kottur, Satwik
Moon, Seungwhan
Poddar, Shivani
Subba, Rajen
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4066 - 4076
[2] Task-Oriented Document-Grounded Dialog Systems by HLTPR@RWTH for DSTC9 and DSTC10
Thulke, David
Daheim, Nico
Dugast, Christian
Ney, Hermann
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 733 - 741
[3] Unsupervised Evaluation of Interactive Dialog with DialoGPT
Mehri, Shikib
Eskenazi, Maxine
SIGDIAL 2020: 21ST ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2020), 2020, : 225 - 235
[4] Speech Aware Dialog System Technology Challenge (DSTC11)
Soltau, Hagen
Shafran, Izhak
Wang, Mingqiu
Rastogi, Abhinav
Zhao, Jeffrey
Jia, Ye
Han, Wei
Cao, Yuan
Miranda, Aramys
INTERSPEECH 2023, 2023, : 4668 - 4672
[5] Overview of the seventh Dialog System Technology Challenge: DSTC7
Fernando D'Haro, Luis
Yoshino, Koichiro
Hori, Chiori
Marks, Tim K.
Polymenakos, Lazaros
Kummerfeld, Jonathan K.
Galley, Michel
Gao, Xiang
COMPUTER SPEECH AND LANGUAGE, 2020, 62
[6] Overview of the sixth dialog system technology challenge: DSTC6
Horia, Chiori
Perez, Julien
Higashinakac, Ryuichiro
Horia, Takaaki
Boureau, Y-Lan
Inabae, Michimasa
Tsunomorif, Yuiko
Takahashig, Tetsuro
Yoshinoh, Koichiro
Kim, Seokhwan
COMPUTER SPEECH AND LANGUAGE, 2019, 55 : 1 - 25
[7] Overview of the Eighth Dialog System Technology Challenge: DSTC8
Kim, Seokhwan
Galley, Michel
Gunasekara, Chulaka
Lee, Sungjin
Atkinson, Adam
Peng, Baolin
Schulz, Hannes
Gao, Jianfeng
Li, Jinchao
Adada, Mahmoud
Huang, Minlie
Lastras, Luis
Kummerfeld, Jonathan K.
Lasecki, Walter S.
Hori, Chiori
Cherian, Anoop
Marks, Tim K.
Rastogi, Abhinav
Zang, Xiaoxue
Sunkara, Srinivas
Gupta, Raghav
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 (29) : 2529 - 2540
[8] Overview of the Tenth Dialog System Technology Challenge: DSTC10
Yoshino, Koichiro
Chen, Yun-Nung
Crook, Paul
Kottur, Satwik
Li, Jinchao
Hedayatnia, Behnam
Moon, Seungwhan
Fei, Zhengcong
Li, Zekang
Zhang, Jinchao
Feng, Yang
Zhou, Jie
Kim, Seokhwan
Liu, Yang
Jin, Di
Papangelis, Alexandros
Gopalakrishnan, Karthik
Hakkani-Tur, Dilek
Damavandi, Babak
Geramifard, Alborz
Hori, Chiori
Shah, Ankit
Zhang, Chen
Li, Haizhou
Sedoc, Joao
D'Haro, Luis F.
Banchs, Rafael
Rudnicky, Alexander
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 765 - 778
[9] Evaluation of distance interactive learning in obstetrics and gynaecology (DIALOG)
Jha, V
Duffy, S
McAleer, S
BJOG-AN INTERNATIONAL JOURNAL OF OBSTETRICS AND GYNAECOLOGY, 2002, 109 (04) : 456 - 461
[10] Interactive visual dialog
Arbel, T
Ferrie, FP
IMAGE AND VISION COMPUTING, 2002, 20 (9-10) : 639 - 646

← 1 2 3 4 5 →