Automatic Speech Transcription for Low-Resource Languages - The Case of Yoloxfochitl Mixtec (Mexico)

被引：4

作者：

Mitral, Vikramjit ^{[1
]}

Katholl, Andreas ^{[1
]}

Amith, Jonathan D. ^{[2
]}

Castillo Garcia, Rey ^{[3
]}

机构：

[1] SRI Int, Speech Technol & Res Lab, 333 Ravenswood Ave, Menlo Pk, CA 94025 USA

[2] Gettysburg Coll, Gettysburg, PA 17325 USA

[3] Secretaria Educ Publ, Chilpancingo De Los Brav, State Of Guerre, Mexico

来源：

17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年

基金：

美国国家科学基金会;

关键词：

automatic speech recognition; endangered languages; large vocabulary continuous speech recognition; articulatory features; tonal features; acoustic-phonetic features; convolutional neural networks; RECOGNITION; FEATURES;

D O I：

10.21437/Interspeech.2016-546

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The rate at which endangered languages can be documented has been highly constrained by human factors. Although digital recording of natural speech in endangered languages may proceed at a fairly robust pace, transcription of this material is not only time consuming but severely limited by the lack of native-speaker personnel proficient in the orthography of their mother tongue. Our NSF-funded project in the Documenting Endangered Languages (DEL) program proposes to tackle this problem from two sides: first via a tool that helps native speakers become proficient in the orthographic conventions of their language, and second by using automatic speech recognition (ASR) output that assists in the transcription effort for newly recorded audio data. In the present study, we focus exclusively on progress in developing speech recognition for the language of interest, Yoloxochitl Mixtec (YM), an Oto-Manguean language spoken by fewer than 5000 speakers on the Pacific coast of Guerrero, Mexico. In particular, we present results from an initial set of experiments and discuss future directions through which better and more robust acoustic models for endangered languages with limited resources can be created.

引用

页码：3076 / 3080

页数：5

共 50 条

[1] AUTOMATIC RATING OF SPONTANEOUS SPEECH FOR LOW-RESOURCE LANGUAGES
Al-Ghezi, Ragheb
Getman, Yaroslav
Voskoboinik, Ekaterina
Singh, Mittul
Kurimo, Mikko
2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 339 - 345
[2] Speech recognition datasets for low-resource Congolese languages
Kimanuka, Ussen
Maina, Ciira wa
Buyuk, Osman
DATA IN BRIEF, 2024, 52
[3] OpenASR21: The Second Open Challenge for Automatic Speech Recognition of Low-Resource Languages
Peterson, Kay
Tong, Audrey
Yu, Yan
INTERSPEECH 2022, 2022, : 4895 - 4899
[4] OpenASR20: An Open Challenge for Automatic Speech Recognition of Conversational Telephone Speech in Low-Resource Languages
Peterson, Kay
Tong, Audrey
Yu, Yan
INTERSPEECH 2021, 2021, : 4324 - 4328
[5] Leveraging translations for speech transcription in low-resource settings
Anastasopoulos, Antonios
Chiang, David
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1279 - 1283
[6] Evaluating Phonemic Transcription of Low-Resource Tonal Languages for Language Documentation
Adams, Oliver
Cohn, Trevor
Neubig, Graham
Cruz, Hilaria
Bird, Steven
Michaud, Alexis
PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 3356 - 3365
[7] The Usefulness of Imperfect Speech Data for ASR Development in Low-Resource Languages
Badenhorst, Jaco
de Wet, Febe
INFORMATION, 2019, 10 (09)
[8] Low-resource automatic speech recognition and error analyses of oral cancer speech
Halpern, Bence Mark
Feng, Siyuan
van Son, Rob
van den Brekel, Michiel
Scharenborg, Odette
SPEECH COMMUNICATION, 2022, 141 : 14 - 27
[9] An overview of high-resource automatic speech recognition methods and their empirical evaluation in low-resource environments
Fatehi, Kavan
Torres, Mercedes
Kucukyilmaz, Ayse
SPEECH COMMUNICATION, 2025, 167
[10] Importance of Signal Processing Cues in Transcription Correction for Low-Resource Indian Languages
Prakash, Jeena J.
Rajan, Golda Brunet
Murthy, Hema A.
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (01)

← 1 2 3 4 5 →