Articulatory Feature based Multilingual MLPs for Low-Resource Speech Recognition

被引：0

作者：

Qian, Yanmin ^{[1
]}

Liu, Jia ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China

来源：

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年

关键词：

low-resource language; multilayer perceptrons; articulatory features; hierarchical architectures;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Large vocabulary continuous speech recognition is particularly difficult for low-resource languages. In the scenario we focus on here is that there is a very limited amount of acoustic training data in the target language, but more plentiful data in other languages. In our approach, we investigate approaches based on Automatic Speech Attribute Transcription (ASAT) framework, and train universal classifiers using multi-languages to learn articulatory features. A hierarchical architecture is applied on both the articulatory feature and phone level, to make the neural network more discriminative. Finally we train the multilayer perceptrons using multi-streams from cross-languages and obtain MLPs for this low-resource application. In our experiments, we get significant improvements of about 12% relative versus a conventional baseline in this low-resource scenario.

引用

页码：2601 / 2604

页数：4

共 50 条

[31] Low-resource Multilingual Neural Translation Using Linguistic Feature-based Relevance Mechanisms
Chakrabarty, Abhisek
Dabre, Raj
Ding, Chenchen
Utiyama, Masao
Sumita, Eiichiro
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (07)
[32] Web Data Selection Based on Word Embedding for Low-Resource Speech Recognition
Xie, Chuandong
Guo, Wu
Hu, Guoping
Liu, Junhua
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1340 - 1344
[33] Acoustic Modeling Based on Deep Learning for Low-Resource Speech Recognition: An Overview
Yu, Chongchong
Kang, Meng
Chen, Yunbing
Wu, Jiajia
Zhao, Xia
IEEE ACCESS, 2020, 8 : 163829 - 163843
[34] MULTILINGUAL REPRESENTATIONS FOR LOW RESOURCE SPEECH RECOGNITION AND KEYWORD SEARCH
Cui, Jia
Kingsbury, Brian
Ramabhadran, Bhuvana
Sethy, Abhinav
Audhkhasi, Kartik
Cui, Xiaodong
Kislal, Ellen
Mangu, Lidia
Nussbaum-Thom, Markus
Picheny, Michael
Tueske, Zoltan
Golik, Pavel
Schlueter, Ralf
Ney, Hermann
Gales, Mark J. F.
Knill, Kate M.
Ragni, Anton
Wang, Haipeng
Woodland, Phil
2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 259 - 266
[35] Adaptive Activation Network for Low Resource Multilingual Speech Recognition
Luo, Jian
Wang, Jianzong
Cheng, Ning
Zheng, Zhenpeng
Xiao, Jing
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[36] CML-TTS: A Multilingual Dataset for Speech Synthesis in Low-Resource Languages
Oliveira, Frederico S.
Casanova, Edresson
Candido, Arnaldo, Jr.
Soares, Anderson S.
Galva Filho, Arlindo R.
TEXT, SPEECH, AND DIALOGUE, TSD 2023, 2023, 14102 : 188 - 199
[37] Efficient neural speech synthesis for low-resource languages through multilingual modeling
de Korte, Marcel
Kim, Jaebok
Klabbers, Esther
INTERSPEECH 2020, 2020, : 2967 - 2971
[38] Systems for Low-Resource Speech Recognition Tasks in Open Automatic Speech Recognition and Formosa Speech Recognition Challenges
Lin, Hung-Pang
Zhang, Yu-Jia
Chen, Chia-Ping
INTERSPEECH 2021, 2021, : 4339 - 4343
[39] Convolutional Maxout Neural Networks for Low-Resource Speech Recognition
Cai, Meng
Shi, Yongzhe
Kang, Jian
Liu, Jia
Su, Tengrong
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 133 - +
[40] Low-resource Sinhala Speech Recognition using Deep Learning
Karunathilaka, Hirunika
Welgama, Viraj
Nadungodage, Thilini
Weerasinghe, Ruvan
2020 20TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER-2020), 2020, : 196 - 201

← 1 2 3 4 5 →