Towards Multilingual Automatic Open-Domain Dialogue Evaluation

被引:0
|
作者
Mendonca, John [1 ,2 ,3 ]
Lavie, Alon [3 ,4 ]
Trancoso, Isabel [1 ,2 ]
机构
[1] INESC ID, Lisbon, Portugal
[2] Univ Lisbon, Inst Super Tecn, Lisbon, Portugal
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[4] Phrase, Pittsburgh, PA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The main limiting factor in the development of robust multilingual open-domain dialogue evaluation metrics is the lack of multilingual data and the limited availability of open-sourced multilingual dialogue systems. In this work, we propose a workaround for this lack of data by leveraging a strong multilingual pretrained encoder-based Language Model and augmenting existing English dialogue data using Machine Translation. We empirically show that the naive approach of finetuning a pretrained multilingual encoder model with translated data is insufficient to outperform the strong baseline of finetuning a multilingual model with only source data. Instead, the best approach consists in the careful curation of translated data using MT Quality Estimation metrics, excluding low quality translations that hinder its performance.
引用
收藏
页码:130 / 141
页数:12
相关论文
共 50 条
  • [1] Towards Holistic and Automatic Evaluation of Open-Domain Dialogue Generation
    Pang, Bo
    Nijkamp, Erik
    Han, Wenjuan
    Zhou, Linqi
    Liu, Yixian
    Tu, Kewei
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3619 - 3629
  • [2] xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
    Zhang, Chen
    D'Haro, Luis Fernando
    Tang, Chengguang
    Shi, Ke
    Tang, Guohua
    Li, Haizhou
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 5579 - 5601
  • [3] An Automatic Evaluation Method for Open-domain Dialogue Based on BLEURT
    Wu, Shih-Hung
    Lee, Jia-Jun
    2022 IEEE 23RD INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2022), 2022, : 83 - 89
  • [4] Predictive Engagement: An Efficient Metric for Automatic Evaluation of Open-Domain Dialogue Systems
    Ghazarian, Sarik
    Weischedel, Ralph
    Galstyan, Aram
    Peng, Nanyun
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7789 - 7796
  • [5] ACCENT: An Automatic Event Commonsense Evaluation Metric for Open-Domain Dialogue Systems
    Ghazarian, Sarik
    Shao, Yijia
    Han, Rujun
    Galstyan, Aram
    Peng, Nanyun
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 4398 - 4419
  • [6] PONE: A Novel Automatic Evaluation Metric for Open-domain Generative Dialogue Systems
    Lan, Tian
    Mao, Xian-Ling
    Wei, Wei
    Gao, Xiaoyan
    Huang, Heyan
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2020, 39 (01)
  • [7] Adversarial Evaluation for Open-Domain Dialogue Generation
    Bruni, Elia
    Fernandez, Raquel
    18TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2017), 2017, : 284 - 288
  • [8] vBLEu: Uncertainty-Aware Automatic Evaluation Method for Open-Domain Dialogue Systems
    Tsuta, Yuma
    Yoshinaga, Naoki
    Toyoda, Masashi
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): STUDENT RESEARCH WORKSHOP, 2020, : 199 - 206
  • [9] Enhancing the Open-Domain Dialogue Evaluation in Latent Space
    Chan, Zhangming
    Liu, Lemao
    Li, Juntao
    Zhang, Haisong
    Zhao, Dongyan
    Shi, Shuming
    Yan, Rui
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4889 - 4900
  • [10] RADE: Reference-Assisted Dialogue Evaluation for Open-Domain Dialogue
    Shi, Zhengliang
    Sun, Weiwei
    Zhang, Shuo
    Zhang, Zhen
    Ren, Pengjie
    Ren, Zhaochun
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 12856 - 12875