Frontier Research on Low-Resource Speech Recognition Technology

被引：3

作者：

Slam, Wushour ^{[1
]}

Li, Yanan ^{[1
]}

Urouvas, Nurmamet ^{[1
]}

机构：

[1] Xinjiang Univ, Coll Informat Sci & Engn, Xinjiang Lab Multilanguage Informat Technol, Xinjiang Multilingual Informat Technol Res Ctr, Urumqi 830046, Peoples R China

来源：

SENSORS | 2023年 / 23卷 / 22期

关键词：

low-resource speech recognition; deep feature extraction; acoustic models; resource expansion; COVARIANCE MATRICES; SPEAKER ADAPTATION; DATA AUGMENTATION; NEURAL-NETWORKS; FEATURES; SYSTEM; ASR; LANGUAGES; LEXICONS; IMPROVE;

D O I：

10.3390/s23229096

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

With the development of continuous speech recognition technology, users have put forward higher requirements in terms of speech recognition accuracy. Low-resource speech recognition, as a typical speech recognition technology under restricted conditions, has become a research hotspot nowadays because of its low recognition rate and great application value. Under the premise of low-resource speech recognition technology, this paper reviews the research status of feature extraction and acoustic models, and conducts research on resource expansion. Especially in terms of the technical challenges faced by this technology, solutions are proposed, and future research directions are prospected.

引用

页数：47

共 50 条

[31] Embedding Articulatory Constraints for Low-resource Speech Recognition Based on Large Pre-trained Model
Lee, Jaeyoung
Mimura, Masato
Kawahara, Tatsuya
INTERSPEECH 2023, 2023, : 1394 - 1398
[32] Advanced Convolutional Neural Network-Based Hybrid Acoustic Models for Low-Resource Speech Recognition
Fantaye, Tessfu Geteye
Yu, Junqing
Hailu, Tulu Tilahun
COMPUTERS, 2020, 9 (02)
[33] Fast and Efficient Multilingual Self-Supervised Pre-training for Low-Resource Speech Recognition
Zhang, Zhilong
Wang, Wei
Qian, Yanmin
INTERSPEECH 2023, 2023, : 2248 - 2252
[34] DISTRIBUTION AUGMENTATION FOR LOW-RESOURCE EXPRESSIVE TEXT-TO-SPEECH
Lajszczak, Mateusz
Prasad, Animesh
van Korlaar, Arent
Bollepalli, Bajibabu
Bonafonte, Antonio
Joly, Arnaud
Nicolis, Marco
Moinet, Alexis
Drugman, Thomas
Wood, Trevor
Sokolova, Elena
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8307 - 8311
[35] Cross-Lingual Self-training to Learn Multilingual Representation for Low-Resource Speech Recognition
Zhang, Zi-Qiang
Song, Yan
Wu, Ming-Hui
Fang, Xin
McLoughlin, Ian
Dai, Li-Rong
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (12) : 6827 - 6843
[36] End-to-End Speech Recognition with Deep Fusion: Leveraging External Language Models for Low-Resource Scenarios
Zhang, Lusheng
Wu, Shie
Wang, Zhongxun
ELECTRONICS, 2025, 14 (04):
[37] Introducing the Urdu-Sindhi Speech Emotion Corpus: A Novel Dataset of Speech Recordings for Emotion Recognition for Two Low-Resource Languages
Syed, Zafi Sherhan
Memon, Sajjad Ali
Shah, Muhammad Shehram
Syed, Abbas Shah
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (04) : 805 - 810
[38] LOW-RESOURCE EXPRESSIVE TEXT-TO-SPEECH USING DATA AUGMENTATION
Huybrechts, Goeric
Merritt, Thomas
Comini, Giulia
Perz, Bartek
Shah, Raahil
Lorenzo-Trueba, Jaime
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6593 - 6597
[39] LOW-RESOURCE LANGUAGE IDENTIFICATION FROM SPEECH USING TRANSFER LEARNING
Feng, Kexin
Chaspari, Theodora
2019 IEEE 29TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2019,
[40] Automatic detection and assessment of Alzheimer Disease using speech and language technologies in low-resource scenarios
Pappagari, Raghavendra
Cho, Jaejin
Joshi, Sonal
Moro-Velazquez, Laureano
Zelasko, Piotr
Villalba, Jesus
Dehak, Najim
INTERSPEECH 2021, 2021, : 3825 - 3829

← 1 2 3 4 5 →