Language-independent and language-adaptive acoustic modeling for speech recognition

被引：214

作者：

Schultz, T ^{[1
]}

Waibel, A

机构：

[1] Univ Karlsruhe, Interact Syst Labs, D-76131 Karlsruhe, Germany

[2] Carnegie Mellon Univ, Interact Syst Labs, Pittsburgh, PA 15213 USA

来源：

SPEECH COMMUNICATION | 2001年 / 35卷 / 1-2期

关键词：

language portability; multilingual acoustic models; large vocabulary continuous speech recognition; polyphone decision tree specialization (PDTS); GlobalPhone;

D O I：

10.1016/S0167-6393(00)00094-7

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

With the distribution of speech technology products all over the world, the portability to new target languages becomes a practical concern. As a consequence our research focuses on the question of how to port large vocabulary continuous speech recognition (LVCSR) systems in a fast and efficient way. More specifically we want to estimate acoustic models for a new target language using speech data from varied source languages, but only limited data from the target language. For this purpose, we introduce different methods for multilingual acoustic model combination and a polyphone decision tree specialization procedure. Recognition results using language-dependent, independent and language-adaptive acoustic models are presented and discussed in the framework of our GlobalPhone project which investigates LVCSR systems in 15 languages. (C) 2001 Published by Elsevier Science B.V.

引用

页码：31 / 51

页数：21

共 50 条

[1] Domain Generalization for Language-Independent Automatic Speech Recognition
Gao, Heting
Ni, Junrui
Zhang, Yang
Qian, Kaizhi
Chang, Shiyu
Hasegawa-Johnson, Mark
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
[2] Speaker-and language-independent speech recognition in mobile communication systems
Viikki, I
Kiss, I
Tian, J
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 5 - 8
[3] Language-independent hyperparameter optimization based speech emotion recognition system
Thakur A.
Dhull S.K.
International Journal of Information Technology, 2022, 14 (7) : 3691 - 3699
[4] Language-independent computer emotion recognition
Mitsuyoshi, S
Ren, FJ
Proceedings of the Ninth IASTED International Conference on Artificial Intelligence and Soft Computing, 2005, : 417 - 422
[5] Investigation of speech-based language-independent possibilities of depression recognition
Kiss, Gabor
2022 45TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING, TSP, 2022, : 226 - 229
[6] Joint acoustic and language modeling for speech recognition
Chien, Jen-Tzung
Chueh, Chuang-Hua
SPEECH COMMUNICATION, 2010, 52 (03) : 223 - 235
[7] Language-Independent Acoustic Biomarkers for Quantifying Speech Impairment in Huntington's Disease
Fahed, Vitoria S.
Doheny, Emer P.
Collazo, Carla
Krzysztofik, Joanna
Mann, Elliot
Morgan-Jones, Philippa
Mills, Laura
Drew, Cheney
Rosser, Anne E.
Cousins, Rebecca
Witkowski, Grzegorz
Cubo, Esther
Busse, Monica
Lowery, Madeleine M.
AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY, 2024, 33 (03) : 1390 - 1405
[8] Language-independent acoustic cloning of HTS voices
Magarinos, Carmen
Erro, Daniel
Banga, Eduardo R.
COMPUTER SPEECH AND LANGUAGE, 2019, 55 : 168 - 186
[9] CONFIDENCE INDEX DYNAMIC TIME WARPING FOR LANGUAGE-INDEPENDENT EMBEDDED SPEECH RECOGNITION
Zhang, Xianglilan
Sun, Jiping
Luo, Zhigang
Li, Ming
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8066 - 8070
[10] A Review on Language-Independent Search on Speech and its Applications
Kulkarni, Sushil Venkatesh
Pal, Sukomal
IEEE ACCESS, 2024, 12 : 194182 - 194202

← 1 2 3 4 5 →