Flexible speech translation systems

被引：8

作者：

Schultz, T ^{[1
]}

Black, AW ^{[1
]}

Vogel, S ^{[1
]}

Woszczyna, M ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Interact Syst Lab, Pittsburgh, PA 15213 USA

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2006年 / 14卷 / 02期

关键词：

multilinguality; portability; speech translation; system deployment;

D O I：

10.1109/TSA.2005.860768

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Speech translation research has made significant progress over the years with many high-visibility efforts showing that translation of spontaneously spoken speech from and to diverse languages is possible and applicable in a variety of domains. As language and domains continue to expand, practical concerns such as portability and reconfigurability of speech come into play: system maintenance becomes a key issue and data is never sufficient to cover the changing domains over varying languages. In this paper, we discuss strategies to overcome the limits of today's speech translation systems. In the first part, we describe our layered system architecture that allows for easy component integration, resource sharing across components, comparison of alternative approaches, and the migration toward hybrid desktop/PDA or stand-alone PDA systems. In the second part, we show how flexibility and reconfigurability is implemented by more radically relying on learning approaches and use our English-Thai two-way speech translation system as a concrete example.

引用

页码：403 / 411

页数：9

共 50 条

[1] Helping Domain Experts Build Phrasal Speech Translation Systems
Rayner, Manny
Armando, Alejandro
Bouillon, Pierrette
Ebling, Sarah
Gerlach, Johanna
Halimi, Sonia
Strasly, Irene
Tsourakis, Nikos
FUTURE AND EMERGENT TRENDS IN LANGUAGE TECHNOLOGY, FETLT 2015, 2016, 9577 : 41 - 52
[2] Automatic Speech Segmentation for Automatic Speech Translation
Klosowski, Piotr
Dustor, Adam
COMPUTER NETWORKS, CN 2013, 2013, 370 : 466 - 475
[3] Assessing the Tolerance of Neural Machine Translation Systems Against Speech Recognition Errors
Ruiz, Nicholas
Di Gangi, Mattia Antonino
Bertoldi, Nicola
Federico, Marcello
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2635 - 2639
[4] INTENT TRANSFER IN SPEECH-TO-SPEECH MACHINE TRANSLATION
Anumanchipalli, Gopala Krishna
Oliveira, Luis C.
Black, Alan W.
2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 153 - 158
[5] ASSESSING EVALUATION METRICS FOR SPEECH-TO-SPEECH TRANSLATION
Salesky, Elizabeth
Maeder, Julian
Klinger, Severin
2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 733 - 740
[6] A user interface-level integration method for multiple automatic speech translation systems
Osada, Seiya
Yamabana, Kiyoshi
Hanazawa, Ken
Okumura, Akitoshi
PACLIC 20: PROCEEDINGS OF THE 20TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2006, : 72 - 79
[7] SPEECH-TO-SPEECH TRANSLATION BETWEEN UNTRANSCRIBED UNKNOWN LANGUAGES
Tjandra, Andros
Sakti, Sakriani
Nakamura, Satoshi
2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 593 - 600
[8] Simple, Lexicalized Choice of Translation Timing for Simultaneous Speech Translation
Fujita, Tomoki
Neubig, Graham
Sakti, Sakriani
Toda, Tomoki
Nakamura, Satoshi
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3454 - 3458
[9] Improving Speech Translation by Understanding the Speech From Latent Code
Zhang, Hao
Si, Nianwen
Zhang, Wenlin
Yang, Xukui
Qu, Dan
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 (1259-1263) : 1259 - 1263
[10] Speech translation by confusion network decoding
Bertoldi, Nicola
Zens, Richard
Federico, Marcello
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1297 - +

← 1 2 3 4 5 →