Flexible speech translation systems

被引:8
|
作者
Schultz, T [1 ]
Black, AW [1 ]
Vogel, S [1 ]
Woszczyna, M [1 ]
机构
[1] Carnegie Mellon Univ, Interact Syst Lab, Pittsburgh, PA 15213 USA
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2006年 / 14卷 / 02期
关键词
multilinguality; portability; speech translation; system deployment;
D O I
10.1109/TSA.2005.860768
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech translation research has made significant progress over the years with many high-visibility efforts showing that translation of spontaneously spoken speech from and to diverse languages is possible and applicable in a variety of domains. As language and domains continue to expand, practical concerns such as portability and reconfigurability of speech come into play: system maintenance becomes a key issue and data is never sufficient to cover the changing domains over varying languages. In this paper, we discuss strategies to overcome the limits of today's speech translation systems. In the first part, we describe our layered system architecture that allows for easy component integration, resource sharing across components, comparison of alternative approaches, and the migration toward hybrid desktop/PDA or stand-alone PDA systems. In the second part, we show how flexibility and reconfigurability is implemented by more radically relying on learning approaches and use our English-Thai two-way speech translation system as a concrete example.
引用
收藏
页码:403 / 411
页数:9
相关论文
共 50 条
  • [1] Helping Domain Experts Build Phrasal Speech Translation Systems
    Rayner, Manny
    Armando, Alejandro
    Bouillon, Pierrette
    Ebling, Sarah
    Gerlach, Johanna
    Halimi, Sonia
    Strasly, Irene
    Tsourakis, Nikos
    FUTURE AND EMERGENT TRENDS IN LANGUAGE TECHNOLOGY, FETLT 2015, 2016, 9577 : 41 - 52
  • [2] Automatic Speech Segmentation for Automatic Speech Translation
    Klosowski, Piotr
    Dustor, Adam
    COMPUTER NETWORKS, CN 2013, 2013, 370 : 466 - 475
  • [3] Assessing the Tolerance of Neural Machine Translation Systems Against Speech Recognition Errors
    Ruiz, Nicholas
    Di Gangi, Mattia Antonino
    Bertoldi, Nicola
    Federico, Marcello
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2635 - 2639
  • [4] INTENT TRANSFER IN SPEECH-TO-SPEECH MACHINE TRANSLATION
    Anumanchipalli, Gopala Krishna
    Oliveira, Luis C.
    Black, Alan W.
    2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 153 - 158
  • [5] ASSESSING EVALUATION METRICS FOR SPEECH-TO-SPEECH TRANSLATION
    Salesky, Elizabeth
    Maeder, Julian
    Klinger, Severin
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 733 - 740
  • [6] A user interface-level integration method for multiple automatic speech translation systems
    Osada, Seiya
    Yamabana, Kiyoshi
    Hanazawa, Ken
    Okumura, Akitoshi
    PACLIC 20: PROCEEDINGS OF THE 20TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2006, : 72 - 79
  • [7] SPEECH-TO-SPEECH TRANSLATION BETWEEN UNTRANSCRIBED UNKNOWN LANGUAGES
    Tjandra, Andros
    Sakti, Sakriani
    Nakamura, Satoshi
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 593 - 600
  • [8] Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation
    Fujita, Tomoki
    Neubig, Graham
    Sakti, Sakriani
    Toda, Tomoki
    Nakamura, Satoshi
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3454 - 3458
  • [9] Improving Speech Translation by Understanding the Speech From Latent Code
    Zhang, Hao
    Si, Nianwen
    Zhang, Wenlin
    Yang, Xukui
    Qu, Dan
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 (1259-1263) : 1259 - 1263
  • [10] Speech translation by confusion network decoding
    Bertoldi, Nicola
    Zens, Richard
    Federico, Marcello
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1297 - +