Automatic Speech Transcription for Low-Resource Languages - The Case of Yoloxfochitl Mixtec (Mexico)

被引:4
|
作者
Mitral, Vikramjit [1 ]
Katholl, Andreas [1 ]
Amith, Jonathan D. [2 ]
Castillo Garcia, Rey [3 ]
机构
[1] SRI Int, Speech Technol & Res Lab, 333 Ravenswood Ave, Menlo Pk, CA 94025 USA
[2] Gettysburg Coll, Gettysburg, PA 17325 USA
[3] Secretaria Educ Publ, Chilpancingo De Los Brav, State Of Guerre, Mexico
来源
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年
基金
美国国家科学基金会;
关键词
automatic speech recognition; endangered languages; large vocabulary continuous speech recognition; articulatory features; tonal features; acoustic-phonetic features; convolutional neural networks; RECOGNITION; FEATURES;
D O I
10.21437/Interspeech.2016-546
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The rate at which endangered languages can be documented has been highly constrained by human factors. Although digital recording of natural speech in endangered languages may proceed at a fairly robust pace, transcription of this material is not only time consuming but severely limited by the lack of native-speaker personnel proficient in the orthography of their mother tongue. Our NSF-funded project in the Documenting Endangered Languages (DEL) program proposes to tackle this problem from two sides: first via a tool that helps native speakers become proficient in the orthographic conventions of their language, and second by using automatic speech recognition (ASR) output that assists in the transcription effort for newly recorded audio data. In the present study, we focus exclusively on progress in developing speech recognition for the language of interest, Yoloxochitl Mixtec (YM), an Oto-Manguean language spoken by fewer than 5000 speakers on the Pacific coast of Guerrero, Mexico. In particular, we present results from an initial set of experiments and discuss future directions through which better and more robust acoustic models for endangered languages with limited resources can be created.
引用
收藏
页码:3076 / 3080
页数:5
相关论文
共 50 条
  • [1] AUTOMATIC RATING OF SPONTANEOUS SPEECH FOR LOW-RESOURCE LANGUAGES
    Al-Ghezi, Ragheb
    Getman, Yaroslav
    Voskoboinik, Ekaterina
    Singh, Mittul
    Kurimo, Mikko
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 339 - 345
  • [2] Speech recognition datasets for low-resource Congolese languages
    Kimanuka, Ussen
    Maina, Ciira wa
    Buyuk, Osman
    DATA IN BRIEF, 2024, 52
  • [3] OpenASR21: The Second Open Challenge for Automatic Speech Recognition of Low-Resource Languages
    Peterson, Kay
    Tong, Audrey
    Yu, Yan
    INTERSPEECH 2022, 2022, : 4895 - 4899
  • [4] OpenASR20: An Open Challenge for Automatic Speech Recognition of Conversational Telephone Speech in Low-Resource Languages
    Peterson, Kay
    Tong, Audrey
    Yu, Yan
    INTERSPEECH 2021, 2021, : 4324 - 4328
  • [5] Leveraging translations for speech transcription in low-resource settings
    Anastasopoulos, Antonios
    Chiang, David
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1279 - 1283
  • [6] Evaluating Phonemic Transcription of Low-Resource Tonal Languages for Language Documentation
    Adams, Oliver
    Cohn, Trevor
    Neubig, Graham
    Cruz, Hilaria
    Bird, Steven
    Michaud, Alexis
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 3356 - 3365
  • [7] The Usefulness of Imperfect Speech Data for ASR Development in Low-Resource Languages
    Badenhorst, Jaco
    de Wet, Febe
    INFORMATION, 2019, 10 (09)
  • [8] Low-resource automatic speech recognition and error analyses of oral cancer speech
    Halpern, Bence Mark
    Feng, Siyuan
    van Son, Rob
    van den Brekel, Michiel
    Scharenborg, Odette
    SPEECH COMMUNICATION, 2022, 141 : 14 - 27
  • [9] An overview of high-resource automatic speech recognition methods and their empirical evaluation in low-resource environments
    Fatehi, Kavan
    Torres, Mercedes
    Kucukyilmaz, Ayse
    SPEECH COMMUNICATION, 2025, 167
  • [10] Importance of Signal Processing Cues in Transcription Correction for Low-Resource Indian Languages
    Prakash, Jeena J.
    Rajan, Golda Brunet
    Murthy, Hema A.
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (01)