Cross-language Phoneme Mapping for Low-resource Languages: An Exploration of Benefits and Trade-offs

被引:1
|
作者
Chibuye, Nick K. [1 ]
Rosenstock, Todd S. [2 ]
DeRenzi, Brian [1 ]
机构
[1] Univ Cape Town, Rondebosch, South Africa
[2] World Agroforestry Ctr ICRAF, Nairobi, Kenya
来源
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES | 2018年
关键词
speech recognition; low-resource languages; human-computer interaction; cross-language phoneme mapping; spoken dialog systems; SALAAM; spoken language processing; nutrition;
D O I
10.21437/Interspeech.2018-2454
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Voice-based systems are an essential approach for engaging directly with low-literate and underrepresented populations. Previous work has taken advantage of high-resource speech recognition technology for low-resource language speech recognition through cross-language phoneme mapping. Unfortunately, there is little guidance in how to deploy these systems across a range of languages. We present a systematic exploration of four source languages and five target languages to understand the trade-offs and performance of different source languages and training techniques. We find that one can improve recognition accuracy by selecting a source language that has similar linguistic properties to that of the target language. We also find that the number of alternative pronunciations per word and gender of participants also impact recognition accuracy. Our work will allow other researchers and practitioners to quickly develop high quality small-vocabulary speech-based applications for underresourced languages.
引用
收藏
页码:2623 / 2627
页数:5
相关论文
共 17 条
  • [1] Evaluating Phonemic Transcription of Low-Resource Tonal Languages for Language Documentation
    Adams, Oliver
    Cohn, Trevor
    Neubig, Graham
    Cruz, Hilaria
    Bird, Steven
    Michaud, Alexis
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 3356 - 3365
  • [2] Multilingual Offensive Language Identification for Low-resource Languages
    Ranasinghe, Tharindu
    Zampieri, Marcos
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (01)
  • [3] Plugging a neural phoneme recognizer into a simple language model: a workflow for low-resource settings
    Guillaume, Severine
    Wisniewski, Guillaume
    Galliot, Benjamin
    Minh-Chau Nguyen
    Fily, Maxime
    Jacques, Guillaume
    Michaud, Alexis
    INTERSPEECH 2022, 2022, : 4905 - 4909
  • [4] Towards Language Service Creation and Customization for Low-Resource Languages
    Lin, Donghui
    Murakami, Yohei
    Ishida, Toru
    INFORMATION, 2020, 11 (02)
  • [5] On the Transferability of Pre-trained Language Models for Low-Resource Programming Languages
    Chen, Fuxiang
    Fard, Fatemeh H.
    Lo, David
    Bryksin, Timofey
    30TH IEEE/ACM INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC 2022), 2022, : 401 - 412
  • [6] Unsupervised Ranked Cross-Lingual Lexical Substitution for Low-Resource Languages
    Ecker, Stefan
    Horbach, Andrea
    Thater, Stefan
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 1709 - 1717
  • [7] Cross-Lingual Transfer of Large Language Model by Visually-Derived Supervision Toward Low-Resource Languages
    Muraoka, Masayasu
    Bhattacharjee, Bishwaranjan
    Merler, Michele
    Blackwood, Graeme
    Li, Yulong
    Zhao, Yang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3637 - 3646
  • [8] Exploring ASR Models in Low-Resource Languages: Use-Case the Macedonian Language
    Bogdanoski, Konstantin
    Mishev, Kostadin
    Simjanoska, Monika
    Trajanov, Dimitar
    DEEP LEARNING THEORY AND APPLICATIONS, DELTA 2023, 2023, 1875 : 254 - 268
  • [9] Cross-lingual offensive speech identification with transfer learning for low-resource languages
    Shi, Xiayang
    Liu, Xinyi
    Xu, Chun
    Huang, Yuanyuan
    Chen, Fang
    Zhu, Shaolin
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 101
  • [10] Deep Persian sentiment analysis: Cross-lingual training for low-resource languages
    Ghasemi, Rouzbeh
    Ashrafi Asli, Seyed Arad
    Momtazi, Saeedeh
    JOURNAL OF INFORMATION SCIENCE, 2022, 48 (04) : 449 - 462