Predicting Ratings of Real Dialogue Participants from Artificial Data and Ratings of Human Dialogue Observers

被引：0

作者：

Georgila, Kallirroi ^{[1
]}

Gordon, Carla ^{[1
]}

Yanov, Volodymyr ^{[1
]}

Traum, David ^{[1
]}

机构：

[1] Univ Southern Calif, Inst Creat Technol, 12015 Waterfront Dr, Los Angeles, CA 90094 USA

来源：

PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020) | 2020年

关键词：

dialogue evaluation functions; real and simulated dialogues; Internet of Things; USER SIMULATION;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

We collected a corpus of dialogues in a Wizard of Oz (WOz) setting in the Internet of Things (IoT) domain. We asked users participating in these dialogues to rate the system on a number of aspects, namely, intelligence, naturalness, personality, friendliness, their enjoyment, overall quality, and whether they would recommend the system to others. Then we asked dialogue observers, i.e., Amazon Mechanical Turkers (MTurkers), to rate these dialogues on the same aspects. We also generated simulated dialogues between dialogue policies and simulated users and asked MTurkers to rate them again on the same aspects. Using linear regression, we developed dialogue evaluation functions based on features from the simulated dialogues and the MTurkers' ratings, the WOz dialogues and the MTurkers' ratings, and the WOz dialogues and the WOz participants' ratings. We applied all these dialogue evaluation functions to a held-out portion of our WOz dialogues, and we report results on the predictive power of these different types of dialogue evaluation functions. Our results suggest that for three conversational aspects (intelligence, naturalness, overall quality) just training evaluation functions on simulated data could be sufficient.

引用

页码：726 / 734

页数：9

共 50 条

[41] Learning from Real Users: Rating Dialogue Success with Neural Networks for Reinforcement Learning in Spoken Dialogue Systems
Su, Pei-Hao
Vandyke, David
Gasic, Milica
Kim, Dongho
Mrksic, Nikola
Wen, Tsung-Hsien
Young, Steve
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2007 - 2011
[42] Analyzing Dialogue Data for Real-World Emotional Speech Classification
Nisimura, Ryuichi
Omae, Souji
Kawahara, Hideki
Irino, Toshio
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1822 - 1825
[43] Present state bias in transition ratings was accurately estimated in simulated and real data
Terluin, Berend
Griffiths, Philip
Trigg, Andrew
Terwee, Caroline B.
Bjorner, Jakob B.
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2022, 143 : 128 - 136
[44] Pricing ESG Equity Ratings and Underlying Data in Listed Real Estate Securities
Brounen, Dirk
Marcato, Gianluca
Op't Veld, Hans
SUSTAINABILITY, 2021, 13 (04) : 1 - 20
[45] Combining accounting data and a structural model for predicting credit ratings: Empirical evidence from European listed firms
Doumpos, Michael
Niklis, Dimitrios
Zopounidis, Constantin
Andriosopoulos, Kostas
JOURNAL OF BANKING & FINANCE, 2015, 50 : 599 - 607
[46] SPOKEN DIALOGUE GRAMMAR INDUCTION FROM CROWDSOURCED DATA
Palogiannidi, Elisavet
Klasinas, Ioannis
Potamianos, Alexandros
Iosif, Elias
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[47] Leveraging Implicit Feedback from Deployment Data in Dialogue
Pane, Richard Yuanzhe
Roller, Stephen
Cho, Kyunghyun
Het, He
Weston, Jason
PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2: SHORT PAPERS, 2024, : 60 - 75
[48] The Real Effects of Ratings Actions: Evidence from Corporate Asset Sales
Bongaerts, Dion
Schlingemann, Frederik
MANAGEMENT SCIENCE, 2024, 70 (03) : 1505 - 1528
[49] Human rights and love of neighbor: theological texts in dialogue with real life
Goncalves Silva, Priscila Alves
HORIZONTE-REVISTA DE ESTUDOS DE TEOLOGIA E CIENCIAS DA RELIGIAO, 2020, 18 (55): : 425 - 431
[50] Approach for Predicting Cracking Deterioration in Sprayed Seals from Subjective Condition Ratings
Hwayyis, Khulood
Hassan, Rayya
Fahey, Michael T.
TRANSPORTATION RESEARCH RECORD, 2021, 2675 (06) : 151 - 164

← 1 2 3 4 5 →