Towards Multilingual Automatic Open-Domain Dialogue Evaluation

被引:0
|
作者
Mendonca, John [1 ,2 ,3 ]
Lavie, Alon [3 ,4 ]
Trancoso, Isabel [1 ,2 ]
机构
[1] INESC ID, Lisbon, Portugal
[2] Univ Lisbon, Inst Super Tecn, Lisbon, Portugal
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[4] Phrase, Pittsburgh, PA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The main limiting factor in the development of robust multilingual open-domain dialogue evaluation metrics is the lack of multilingual data and the limited availability of open-sourced multilingual dialogue systems. In this work, we propose a workaround for this lack of data by leveraging a strong multilingual pretrained encoder-based Language Model and augmenting existing English dialogue data using Machine Translation. We empirically show that the naive approach of finetuning a pretrained multilingual encoder model with translated data is insufficient to outperform the strong baseline of finetuning a multilingual model with only source data. Instead, the best approach consists in the careful curation of translated data using MT Quality Estimation metrics, excluding low quality translations that hinder its performance.
引用
收藏
页码:130 / 141
页数:12
相关论文
共 50 条
  • [31] Towards Open-Domain Semantic Role Labeling
    Croce, Danilo
    Giannone, Cristina
    Annesi, Paolo
    Basili, Roberto
    ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2010, : 237 - 246
  • [32] A Randomized Link Transformer for Diverse Open-Domain Dialogue Generation
    Lee, Jing Yang
    Lee, Kong Aik
    Gan, Woon Seng
    PROCEEDINGS OF THE 4TH WORKSHOP ON NLP FOR CONVERSATIONAL AI, 2022, : 1 - 11
  • [33] Selecting Stickers in Open-Domain Dialogue through Multitask Learning
    Zhang, Zhexin
    Zhu, Yeshuang
    Fei, Zhengcong
    Zhang, Jinchao
    Zhou, Jie
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3053 - 3060
  • [34] Contextual Dialogue Act Classification for Open-Domain Conversational Agents
    Ahmadvand, Ali
    Choi, Jason Ingyu
    Agichtein, Eugene
    PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1273 - 1276
  • [35] Achieving Reliable Human Assessment of Open-Domain Dialogue Systems
    Ji, Tianbo
    Graham, Yvette
    Jones, Gareth
    Lyu, Chenyang
    Liu, Qun
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6416 - 6437
  • [36] Generating Responses Expressing Emotion in an Open-Domain Dialogue System
    Huang, Chenyang
    Zaiane, Osmar R.
    INTERNET SCIENCE, 2019, 11551 : 100 - 112
  • [37] Leveraging Context for Neural Question Generation in Open-domain Dialogue Systems
    Ling, Yanxiang
    Cai, Fei
    Chen, Honghui
    de Rijke, Maarten
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 2486 - 2492
  • [38] Towards Open-domain Vision and Language Understanding with Wikimedia
    Semedo, David
    WEB CONFERENCE 2021: COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2021), 2021, : 591 - 593
  • [39] Towards Open-Domain Twitter User Profile Inference
    Wen, Haoyang
    Xiao, Zhenxin
    Hovy, Eduard H.
    Hauptmann, Alexander G.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 3172 - 3188
  • [40] OTTers: One-turn Topic Transitions for Open-Domain Dialogue
    Sevegnani, Karin
    Howcroft, David M.
    Konstas, Ioannis
    Rieser, Verena
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 2492 - 2504