Automatic Speech Recognition for Under-Resourced Languages: Application to Vietnamese Language

被引:57
|
作者
Le, Viet-Bac [1 ]
Besacier, Laurent [1 ]
机构
[1] Univ Grenoble 1, LIG Lab, CNRS, UMR 5217, F-38041 Grenoble 9, France
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2009年 / 17卷 / 08期
关键词
Crosslingual acoustic modeling; grapheme-based acoustic modeling; lattice decomposition and combination; speech recognition; under-resourced languages;
D O I
10.1109/TASL.2009.2021723
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents our work in automatic speech recognition (ASR) in the context of under-resourced languages with application to Vietnamese. Different techniques for bootstrapping acoustic models are presented. First, we present the use of acoustic-phonetic unit distances and the potential of crosslingual acoustic modeling for under-resourced languages. Experimental results on Vietnamese showed that with only a few hours of target language speech data, crosslingual context independent modeling worked better than crosslingual context dependent modeling. However, it was outperformed by the latter one, when more speech data were available. We concluded, therefore, that in both cases, crosslingual systems are better than monolingual baseline systems. The proposal of grapheme-based acoustic modeling, which avoids building a phonetic dictionary, is also investigated in our work. Finally, since the use of sub-word units (morphemes, syllables, characters, etc.) can reduce the high out-of-vocabulary rate and improve the lack of text resources in statistical language modeling for under-resourced languages, we propose several methods to decompose, normalize and combine word and sub-word lattices generated from different ASR systems. The proposed lattice combination scheme results in a relative syllable error rate reduction of 6.6% over the sentence MAP baseline method for a Vietnamese ASR task.
引用
收藏
页码:1471 / 1482
页数:12
相关论文
共 50 条
  • [1] Automatic speech recognition for under-resourced languages: A survey
    Besacier, Laurent
    Barnard, Etienne
    Karpov, Alexey
    Schultz, Tanja
    SPEECH COMMUNICATION, 2014, 56 : 85 - 100
  • [2] Automatic Speech Recognition for an Under-Resourced Language - Amharic
    Abate, Solomon Teferra
    Menzel, Wolfgang
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 973 - 976
  • [3] A Review on Speech Recognition for Under-Resourced Languages: A Case Study of Vietnamese
    Phung, Trung-Nghia
    Nguyen, Duc-Binh
    Pham, Ngoc-Phuong
    INTERNATIONAL JOURNAL OF KNOWLEDGE AND SYSTEMS SCIENCE, 2024, 15 (01)
  • [4] Automatic Speech Recognition for an Under-Resourced Language - Amharic
    Abate, Solomon Teferra
    Menzel, Wolfgang
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1737 - 1740
  • [5] Modeling under-resourced languages for speech recognition
    Kurimo, Mikko
    Enarvi, Seppo
    Tilk, Ottokar
    Varjokallio, Matti
    Mansikkaniemi, Andre
    Alumae, Tanel
    LANGUAGE RESOURCES AND EVALUATION, 2017, 51 (04) : 961 - 987
  • [6] Modeling under-resourced languages for speech recognition
    Mikko Kurimo
    Seppo Enarvi
    Ottokar Tilk
    Matti Varjokallio
    André Mansikkaniemi
    Tanel Alumäe
    Language Resources and Evaluation, 2017, 51 : 961 - 987
  • [7] Language Modeling for Speech Analytics in Under-Resourced Languages
    Wills, Simone
    Uys, Pieter
    van Heerden, Charl
    Barnard, Etienne
    INTERSPEECH 2020, 2020, : 4941 - 4945
  • [8] Automating the Creation of Speech Recognition Systems for Under-Resourced Languages
    Khusainov, Aidar
    Suleymanov, Dzhavdet
    2015 FOURTEENTH MEXICAN INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (MICAI), 2015, : 28 - 32
  • [9] Speech recognition of under-resourced languages using mismatched transcriptions
    Do, Van Hai
    Chen, Nancy F.
    Lim, Boon Pang
    Hasegawa-Johnson, Mark
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 112 - 115
  • [10] Automatic processing of under-resourced languages
    Bernhard, Delphine
    Soria, Claudia
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2018, 59 (03): : 7 - 14