Articulatory Feature based Multilingual MLPs for Low-Resource Speech Recognition

被引:0
|
作者
Qian, Yanmin [1 ]
Liu, Jia [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
关键词
low-resource language; multilayer perceptrons; articulatory features; hierarchical architectures;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large vocabulary continuous speech recognition is particularly difficult for low-resource languages. In the scenario we focus on here is that there is a very limited amount of acoustic training data in the target language, but more plentiful data in other languages. In our approach, we investigate approaches based on Automatic Speech Attribute Transcription (ASAT) framework, and train universal classifiers using multi-languages to learn articulatory features. A hierarchical architecture is applied on both the articulatory feature and phone level, to make the neural network more discriminative. Finally we train the multilayer perceptrons using multi-streams from cross-languages and obtain MLPs for this low-resource application. In our experiments, we get significant improvements of about 12% relative versus a conventional baseline in this low-resource scenario.
引用
收藏
页码:2601 / 2604
页数:4
相关论文
共 50 条
  • [31] Low-resource Multilingual Neural Translation Using Linguistic Feature-based Relevance Mechanisms
    Chakrabarty, Abhisek
    Dabre, Raj
    Ding, Chenchen
    Utiyama, Masao
    Sumita, Eiichiro
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (07)
  • [32] Web Data Selection Based on Word Embedding for Low-Resource Speech Recognition
    Xie, Chuandong
    Guo, Wu
    Hu, Guoping
    Liu, Junhua
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1340 - 1344
  • [33] Acoustic Modeling Based on Deep Learning for Low-Resource Speech Recognition: An Overview
    Yu, Chongchong
    Kang, Meng
    Chen, Yunbing
    Wu, Jiajia
    Zhao, Xia
    IEEE ACCESS, 2020, 8 : 163829 - 163843
  • [34] MULTILINGUAL REPRESENTATIONS FOR LOW RESOURCE SPEECH RECOGNITION AND KEYWORD SEARCH
    Cui, Jia
    Kingsbury, Brian
    Ramabhadran, Bhuvana
    Sethy, Abhinav
    Audhkhasi, Kartik
    Cui, Xiaodong
    Kislal, Ellen
    Mangu, Lidia
    Nussbaum-Thom, Markus
    Picheny, Michael
    Tueske, Zoltan
    Golik, Pavel
    Schlueter, Ralf
    Ney, Hermann
    Gales, Mark J. F.
    Knill, Kate M.
    Ragni, Anton
    Wang, Haipeng
    Woodland, Phil
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 259 - 266
  • [35] Adaptive Activation Network for Low Resource Multilingual Speech Recognition
    Luo, Jian
    Wang, Jianzong
    Cheng, Ning
    Zheng, Zhenpeng
    Xiao, Jing
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [36] CML-TTS: A Multilingual Dataset for Speech Synthesis in Low-Resource Languages
    Oliveira, Frederico S.
    Casanova, Edresson
    Candido, Arnaldo, Jr.
    Soares, Anderson S.
    Galva Filho, Arlindo R.
    TEXT, SPEECH, AND DIALOGUE, TSD 2023, 2023, 14102 : 188 - 199
  • [37] Efficient neural speech synthesis for low-resource languages through multilingual modeling
    de Korte, Marcel
    Kim, Jaebok
    Klabbers, Esther
    INTERSPEECH 2020, 2020, : 2967 - 2971
  • [38] Systems for Low-Resource Speech Recognition Tasks in Open Automatic Speech Recognition and Formosa Speech Recognition Challenges
    Lin, Hung-Pang
    Zhang, Yu-Jia
    Chen, Chia-Ping
    INTERSPEECH 2021, 2021, : 4339 - 4343
  • [39] Convolutional Maxout Neural Networks for Low-Resource Speech Recognition
    Cai, Meng
    Shi, Yongzhe
    Kang, Jian
    Liu, Jia
    Su, Tengrong
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 133 - +
  • [40] Low-resource Sinhala Speech Recognition using Deep Learning
    Karunathilaka, Hirunika
    Welgama, Viraj
    Nadungodage, Thilini
    Weerasinghe, Ruvan
    2020 20TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER-2020), 2020, : 196 - 201