ASSESSING EVALUATION METRICS FOR SPEECH-TO-SPEECH TRANSLATION

被引:4
|
作者
Salesky, Elizabeth [1 ]
Maeder, Julian [2 ]
Klinger, Severin [2 ]
机构
[1] Johns Hopkins Univ, Baltimore, MD 21218 USA
[2] Swiss Fed Inst Technol, Zurich, Switzerland
来源
2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU) | 2021年
关键词
evaluation; speech synthesis; speech translation; speech-to-speech; dialects;
D O I
10.1109/ASRU51503.2021.9688073
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech-to-speech translation combines machine translation with speech synthesis, introducing evaluation challenges not present in either task alone. How to automatically evaluate speech-to-speech translation is an open question which has not previously been explored. Translating to speech rather than to text is often motivated by unwritten languages or languages without standardized orthographies. However, we show that the previously used automatic metric for this task is best equipped for standardized high-resource languages only. In this work, we first evaluate current metrics for speech-to-speech translation, and second assess how translation to dialectal variants rather than to standardized languages impacts various evaluation methods.
引用
收藏
页码:733 / 740
页数:8
相关论文
共 50 条
  • [41] Speech translation by confusion network decoding
    Bertoldi, Nicola
    Zens, Richard
    Federico, Marcello
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1297 - +
  • [42] Punctuating Confusion Networks for Speech Translation
    Cattoni, Roldano
    Bertoldi, Nicola
    Federico, Marcello
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2001 - 2004
  • [43] Robust Speech Translation by Domain Adaptation
    He, Xiaodong
    Deng, Li
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2116 - 2119
  • [44] The KIT Lecture Corpus for Speech Translation
    Stueker, Sebastian
    Kraft, Florian
    Mohr, Christian
    Herrmann, Teresa
    Cho, Eunah
    Waibel, Alex
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 3409 - 3414
  • [45] ROMANIAN-ENGLISH SPEECH TRANSLATION
    Boros, Tiberiu
    Tufis, Dan
    PROCEEDINGS OF THE ROMANIAN ACADEMY SERIES A-MATHEMATICS PHYSICS TECHNICAL SCIENCES INFORMATION SCIENCE, 2014, 15 (01): : 68 - 75
  • [46] Contextualized Translation of Automatically Segmented Speech
    Gaido, Marco
    Di Gangi, Mattia A.
    Negri, Matteo
    Cettolo, Mauro
    Turchi, Marco
    INTERSPEECH 2020, 2020, : 1471 - 1475
  • [47] Application of speech technology in the translation system
    Zhou, Run
    Xiang, Wei
    Proceedings of the 2016 6th International Conference on Advanced Design and Manufacturing Engineering (ICADME 2016), 2016, 96 : 429 - 433
  • [48] Unsupervised phonetic and word level discovery for speech to speech translation for unwritten languages
    Hillis, Steven
    Kumar, Anushree Prasanna
    Black, Alan W.
    INTERSPEECH 2019, 2019, : 1138 - 1142
  • [49] RAPID INTEGRATION OF PARTS OF SPEECH INFORMATION TO IMPROVE REORDERING MODEL FOR ENGLISH-FARSI SPEECH TO SPEECH TRANSLATION
    Maskey, Sameer
    Zhou, Bowen
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5222 - 5225
  • [50] Improving Automatic Speech Recognition and Speech Translation via Word Embedding Prediction
    Chuang, Shun-Po
    Liu, Alexander H.
    Sung, Tzu-Wei
    Lee, Hung-yi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 93 - 105