Predicting Ratings of Real Dialogue Participants from Artificial Data and Ratings of Human Dialogue Observers

被引:0
|
作者
Georgila, Kallirroi [1 ]
Gordon, Carla [1 ]
Yanov, Volodymyr [1 ]
Traum, David [1 ]
机构
[1] Univ Southern Calif, Inst Creat Technol, 12015 Waterfront Dr, Los Angeles, CA 90094 USA
关键词
dialogue evaluation functions; real and simulated dialogues; Internet of Things; USER SIMULATION;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We collected a corpus of dialogues in a Wizard of Oz (WOz) setting in the Internet of Things (IoT) domain. We asked users participating in these dialogues to rate the system on a number of aspects, namely, intelligence, naturalness, personality, friendliness, their enjoyment, overall quality, and whether they would recommend the system to others. Then we asked dialogue observers, i.e., Amazon Mechanical Turkers (MTurkers), to rate these dialogues on the same aspects. We also generated simulated dialogues between dialogue policies and simulated users and asked MTurkers to rate them again on the same aspects. Using linear regression, we developed dialogue evaluation functions based on features from the simulated dialogues and the MTurkers' ratings, the WOz dialogues and the MTurkers' ratings, and the WOz dialogues and the WOz participants' ratings. We applied all these dialogue evaluation functions to a held-out portion of our WOz dialogues, and we report results on the predictive power of these different types of dialogue evaluation functions. Our results suggest that for three conversational aspects (intelligence, naturalness, overall quality) just training evaluation functions on simulated data could be sufficient.
引用
收藏
页码:726 / 734
页数:9
相关论文
共 50 条
  • [41] Learning from Real Users: Rating Dialogue Success with Neural Networks for Reinforcement Learning in Spoken Dialogue Systems
    Su, Pei-Hao
    Vandyke, David
    Gasic, Milica
    Kim, Dongho
    Mrksic, Nikola
    Wen, Tsung-Hsien
    Young, Steve
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2007 - 2011
  • [42] Analyzing Dialogue Data for Real-World Emotional Speech Classification
    Nisimura, Ryuichi
    Omae, Souji
    Kawahara, Hideki
    Irino, Toshio
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1822 - 1825
  • [43] Present state bias in transition ratings was accurately estimated in simulated and real data
    Terluin, Berend
    Griffiths, Philip
    Trigg, Andrew
    Terwee, Caroline B.
    Bjorner, Jakob B.
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2022, 143 : 128 - 136
  • [44] Pricing ESG Equity Ratings and Underlying Data in Listed Real Estate Securities
    Brounen, Dirk
    Marcato, Gianluca
    Op't Veld, Hans
    SUSTAINABILITY, 2021, 13 (04) : 1 - 20
  • [45] Combining accounting data and a structural model for predicting credit ratings: Empirical evidence from European listed firms
    Doumpos, Michael
    Niklis, Dimitrios
    Zopounidis, Constantin
    Andriosopoulos, Kostas
    JOURNAL OF BANKING & FINANCE, 2015, 50 : 599 - 607
  • [46] SPOKEN DIALOGUE GRAMMAR INDUCTION FROM CROWDSOURCED DATA
    Palogiannidi, Elisavet
    Klasinas, Ioannis
    Potamianos, Alexandros
    Iosif, Elias
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [47] Leveraging Implicit Feedback from Deployment Data in Dialogue
    Pane, Richard Yuanzhe
    Roller, Stephen
    Cho, Kyunghyun
    Het, He
    Weston, Jason
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2: SHORT PAPERS, 2024, : 60 - 75
  • [48] The Real Effects of Ratings Actions: Evidence from Corporate Asset Sales
    Bongaerts, Dion
    Schlingemann, Frederik
    MANAGEMENT SCIENCE, 2024, 70 (03) : 1505 - 1528
  • [49] Human rights and love of neighbor: theological texts in dialogue with real life
    Goncalves Silva, Priscila Alves
    HORIZONTE-REVISTA DE ESTUDOS DE TEOLOGIA E CIENCIAS DA RELIGIAO, 2020, 18 (55): : 425 - 431
  • [50] Approach for Predicting Cracking Deterioration in Sprayed Seals from Subjective Condition Ratings
    Hwayyis, Khulood
    Hassan, Rayya
    Fahey, Michael T.
    TRANSPORTATION RESEARCH RECORD, 2021, 2675 (06) : 151 - 164