Language-independent and language-adaptive acoustic modeling for speech recognition

被引:214
|
作者
Schultz, T [1 ]
Waibel, A
机构
[1] Univ Karlsruhe, Interact Syst Labs, D-76131 Karlsruhe, Germany
[2] Carnegie Mellon Univ, Interact Syst Labs, Pittsburgh, PA 15213 USA
关键词
language portability; multilingual acoustic models; large vocabulary continuous speech recognition; polyphone decision tree specialization (PDTS); GlobalPhone;
D O I
10.1016/S0167-6393(00)00094-7
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
With the distribution of speech technology products all over the world, the portability to new target languages becomes a practical concern. As a consequence our research focuses on the question of how to port large vocabulary continuous speech recognition (LVCSR) systems in a fast and efficient way. More specifically we want to estimate acoustic models for a new target language using speech data from varied source languages, but only limited data from the target language. For this purpose, we introduce different methods for multilingual acoustic model combination and a polyphone decision tree specialization procedure. Recognition results using language-dependent, independent and language-adaptive acoustic models are presented and discussed in the framework of our GlobalPhone project which investigates LVCSR systems in 15 languages. (C) 2001 Published by Elsevier Science B.V.
引用
收藏
页码:31 / 51
页数:21
相关论文
共 50 条
  • [1] Domain Generalization for Language-Independent Automatic Speech Recognition
    Gao, Heting
    Ni, Junrui
    Zhang, Yang
    Qian, Kaizhi
    Chang, Shiyu
    Hasegawa-Johnson, Mark
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
  • [2] Speaker-and language-independent speech recognition in mobile communication systems
    Viikki, I
    Kiss, I
    Tian, J
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 5 - 8
  • [3] Language-independent hyperparameter optimization based speech emotion recognition system
    Thakur A.
    Dhull S.K.
    International Journal of Information Technology, 2022, 14 (7) : 3691 - 3699
  • [4] Language-independent computer emotion recognition
    Mitsuyoshi, S
    Ren, FJ
    Proceedings of the Ninth IASTED International Conference on Artificial Intelligence and Soft Computing, 2005, : 417 - 422
  • [5] Investigation of speech-based language-independent possibilities of depression recognition
    Kiss, Gabor
    2022 45TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING, TSP, 2022, : 226 - 229
  • [6] Joint acoustic and language modeling for speech recognition
    Chien, Jen-Tzung
    Chueh, Chuang-Hua
    SPEECH COMMUNICATION, 2010, 52 (03) : 223 - 235
  • [7] Language-Independent Acoustic Biomarkers for Quantifying Speech Impairment in Huntington's Disease
    Fahed, Vitoria S.
    Doheny, Emer P.
    Collazo, Carla
    Krzysztofik, Joanna
    Mann, Elliot
    Morgan-Jones, Philippa
    Mills, Laura
    Drew, Cheney
    Rosser, Anne E.
    Cousins, Rebecca
    Witkowski, Grzegorz
    Cubo, Esther
    Busse, Monica
    Lowery, Madeleine M.
    AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY, 2024, 33 (03) : 1390 - 1405
  • [8] Language-independent acoustic cloning of HTS voices
    Magarinos, Carmen
    Erro, Daniel
    Banga, Eduardo R.
    COMPUTER SPEECH AND LANGUAGE, 2019, 55 : 168 - 186
  • [9] CONFIDENCE INDEX DYNAMIC TIME WARPING FOR LANGUAGE-INDEPENDENT EMBEDDED SPEECH RECOGNITION
    Zhang, Xianglilan
    Sun, Jiping
    Luo, Zhigang
    Li, Ming
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8066 - 8070
  • [10] A Review on Language-Independent Search on Speech and its Applications
    Kulkarni, Sushil Venkatesh
    Pal, Sukomal
    IEEE ACCESS, 2024, 12 : 194182 - 194202