Unsupervised Evaluation of Interactive Dialog with DialoGPT

被引:0
|
作者
Mehri, Shikib [1 ]
Eskenazi, Maxine [1 ]
机构
[1] Carnegie Mellon Univ, Dialog Res Ctr, Language Technol Inst, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is important to define meaningful and interpretable automatic evaluation metrics for open-domain dialog research. Standard language generation metrics have been shown to be ineffective for dialog. This paper introduces the FED metric (fine-grained evaluation of dialog), an automatic evaluation metric which uses DialoGPT, without any fine-tuning or supervision. It also introduces the FED dataset which is constructed by annotating a set of human-system and human-human conversations with eighteen fine-grained dialog qualities. The FED metric (1) does not rely on a ground-truth response, (2) does not require training data and (3) measures fine-grained dialog qualities at both the turn and whole dialog levels. FED attains moderate to strong correlation with human judgement at both levels.
引用
收藏
页码:225 / 235
页数:11
相关论文
共 50 条
  • [31] A FRAMEWORK FOR UNSUPERVISED TRANSFER LEARNING AND APPLICATION TO DIALOG DECISION CLASSIFICATION
    Marcheret, Etienne
    Deshmukh, Om D.
    Goel, Vaibhava
    Navratil, Jiri
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 1981 - 1984
  • [32] Unsupervised Slot Schema Induction for Task-oriented Dialog
    Yu, Dian
    Wang, Mingqiu
    Cao, Yuan
    Shafran, Izhak
    El Shafey, Laurent
    Soltau, Hagen
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 1174 - 1193
  • [33] Unsupervised Enrichment of Persona-grounded Dialog with Background Stories
    Majumder, Bodhisattwa Prasad
    Berg-Kirkpatrick, Taylor
    McAuley, Julian
    Jhamtani, Harsh
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 585 - 592
  • [34] ENVIRONMENTAL RELATIONS COMMITTEE - A MODEL FOR INTERACTIVE COMMUNITY DIALOG
    TOMBOULIAN, P
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1992, 204 : 7 - CEI
  • [35] VR interactive dialog system with verbal and nonverbal communication
    Uchino, Shunji
    Abe, Norihiro
    Tabuchi, Yoshihiro
    Taki, Hirokazu
    He, Shouji
    ARTIFICIAL LIFE AND ROBOTICS, 2009, 13 (02) : 512 - 516
  • [36] INTERACTIVE AMIABILITY - DIALOG, ORBIT, AND BRS UNDER SCRUTINY
    SHUMAN, BA
    PROCEEDINGS OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1983, 20 : 136 - 138
  • [37] Multimodal Dialog Systems for Interactive In-Car Applications
    Wahlster, Wolfgang
    Mueller, Christian
    AT-AUTOMATISIERUNGSTECHNIK, 2013, 61 (11) : 777 - 783
  • [38] HIERARCHICAL DIALOG STRUCTURES IN INTERACTIVE COMPUTER-SYSTEMS
    APPERLEY, MD
    SPENCE, R
    SOFTWARE-PRACTICE & EXPERIENCE, 1983, 13 (09): : 777 - 790
  • [39] VAL: Interactive Task Learning with GPT Dialog Parsing
    Lawley, Lane
    MacLellan, Christopher J.
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS, CHI 2024, 2024,
  • [40] WORD USAGE IN INTERACTIVE DIALOG WITH RESTRICTED AND UNRESTRICTED VOCABULARIES
    MICHAELIS, PR
    CHAPANIS, A
    WEEKS, GD
    KELLY, MJ
    IEEE TRANSACTIONS ON PROFESSIONAL COMMUNICATION, 1977, 20 (04) : 214 - 221