A Comparative Evaluation Methodology for NLG in Interactive Systems

被引:0
|
作者
Hastie, Helen [1 ]
Belz, Anja
机构
[1] Heriot Watt Univ, Edinburgh EH14 4AS, Midlothian, Scotland
关键词
Natural Language Generation; Evaluation Methodologies; Interactive Systems;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Interactive systems have become an increasingly important type of application for deployment of NLG technology over recent years. At present, we do not yet have commonly agreed terminology or methodology for evaluating NLG within interactive systems. In this paper, we take steps towards addressing this gap by presenting a set of principles for designing new evaluations in our comparative evaluation methodology. We start with presenting a categorisation framework, giving an overview of different categories of evaluation measures, in order to provide standard terminology for categorising existing and new evaluation techniques. Background on existing evaluation methodologies for NLG and interactive systems is presented. The comparative evaluation methodology is presented. Finally, a methodology for comparative evaluation of NLG components embedded within interactive systems is presented in terms of the comparative evaluation methodology, using a specific task for illustrative purposes.
引用
收藏
页码:4004 / 4011
页数:8
相关论文
共 50 条
  • [21] Offline Evaluation and Optimization for Interactive Systems
    Li, Lihong
    WSDM'15: PROCEEDINGS OF THE EIGHTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2015, : 413 - 413
  • [22] Interactive Customer Knowledge Management Systems: a Comparative Evaluation of Users' Perception of Trust and Level of Knowledge
    Alotaibi, Mutlaq B.
    Rigas, Dimitrios I.
    PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON E-ACTIVITIES: RECENT ADVANCES IN E-ACTIVITIES, 2008, : 54 - +
  • [23] Empirical evaluation of educational interactive systems
    Francisco José García-Peñalvo
    Lourdes Moreno López
    Mª Cruz Sánchez-Gómez
    Quality & Quantity, 2018, 52 (6) : 2427 - 2434
  • [24] A technique for evaluation of interactive evolutionary systems
    Shackelford, M
    Corne, DW
    ADAPTIVE COMPUTING IN DESIGN AND MANUFACTURE VI, 2004, : 197 - 208
  • [25] An Evaluation Framework for Interactive Recommender Systems
    Alkan, Oznur
    Daly, Elizabeth M.
    Botea, Adi
    ADJUNCT PUBLICATION OF THE 27TH CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION (ACM UMAP '19 ADJUNCT), 2019, : 217 - 218
  • [26] A model for performance evaluation of interactive systems
    Stohr, EA
    Kim, YB
    PROCEEDINGS OF THE THIRTY-FIRST HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOL IV: INTERNET AND THE DIGITAL ECONOMY TRACT, 1998, : 2 - 11
  • [27] Multivariate evaluation of interactive robot systems
    Chien-Ming Huang
    Bilge Mutlu
    Autonomous Robots, 2014, 37 : 335 - 349
  • [28] A METHODOLOGY FOR EVALUATION OF AN INTERACTIVE MULTISPECTRAL IMAGE-PROCESSING SYSTEM
    KOVALICK, WM
    NEWCOMER, JA
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 1987, 53 (08): : 1087 - 1092
  • [29] Cognitive Ergonomic Evaluation Metrics and Methodology for Interactive Information System
    Zhang, Yu
    Sun, Jianhua
    Jiang, Ting
    Yang, Zengyao
    ADVANCES IN ARTIFICIAL INTELLIGENCE, SOFTWARE AND SYSTEMS ENGINEERING, 2020, 965 : 559 - 570
  • [30] TEQ - A METHODOLOGY FOR COMPARATIVE-EVALUATION OF TECHNOLOGIES
    ZAIDMAN, B
    CEVIDALLI, G
    ENGINEERING COSTS AND PRODUCTION ECONOMICS, 1989, 18 (02): : 131 - 138