Frontier Research on Low-Resource Speech Recognition Technology

被引:3
|
作者
Slam, Wushour [1 ]
Li, Yanan [1 ]
Urouvas, Nurmamet [1 ]
机构
[1] Xinjiang Univ, Coll Informat Sci & Engn, Xinjiang Lab Multilanguage Informat Technol, Xinjiang Multilingual Informat Technol Res Ctr, Urumqi 830046, Peoples R China
关键词
low-resource speech recognition; deep feature extraction; acoustic models; resource expansion; COVARIANCE MATRICES; SPEAKER ADAPTATION; DATA AUGMENTATION; NEURAL-NETWORKS; FEATURES; SYSTEM; ASR; LANGUAGES; LEXICONS; IMPROVE;
D O I
10.3390/s23229096
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
With the development of continuous speech recognition technology, users have put forward higher requirements in terms of speech recognition accuracy. Low-resource speech recognition, as a typical speech recognition technology under restricted conditions, has become a research hotspot nowadays because of its low recognition rate and great application value. Under the premise of low-resource speech recognition technology, this paper reviews the research status of feature extraction and acoustic models, and conducts research on resource expansion. Especially in terms of the technical challenges faced by this technology, solutions are proposed, and future research directions are prospected.
引用
收藏
页数:47
相关论文
共 50 条
  • [31] Embedding Articulatory Constraints for Low-resource Speech Recognition Based on Large Pre-trained Model
    Lee, Jaeyoung
    Mimura, Masato
    Kawahara, Tatsuya
    INTERSPEECH 2023, 2023, : 1394 - 1398
  • [32] Advanced Convolutional Neural Network-Based Hybrid Acoustic Models for Low-Resource Speech Recognition
    Fantaye, Tessfu Geteye
    Yu, Junqing
    Hailu, Tulu Tilahun
    COMPUTERS, 2020, 9 (02)
  • [33] Fast and Efficient Multilingual Self-Supervised Pre-training for Low-Resource Speech Recognition
    Zhang, Zhilong
    Wang, Wei
    Qian, Yanmin
    INTERSPEECH 2023, 2023, : 2248 - 2252
  • [34] DISTRIBUTION AUGMENTATION FOR LOW-RESOURCE EXPRESSIVE TEXT-TO-SPEECH
    Lajszczak, Mateusz
    Prasad, Animesh
    van Korlaar, Arent
    Bollepalli, Bajibabu
    Bonafonte, Antonio
    Joly, Arnaud
    Nicolis, Marco
    Moinet, Alexis
    Drugman, Thomas
    Wood, Trevor
    Sokolova, Elena
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8307 - 8311
  • [35] Cross-Lingual Self-training to Learn Multilingual Representation for Low-Resource Speech Recognition
    Zhang, Zi-Qiang
    Song, Yan
    Wu, Ming-Hui
    Fang, Xin
    McLoughlin, Ian
    Dai, Li-Rong
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (12) : 6827 - 6843
  • [36] End-to-End Speech Recognition with Deep Fusion: Leveraging External Language Models for Low-Resource Scenarios
    Zhang, Lusheng
    Wu, Shie
    Wang, Zhongxun
    ELECTRONICS, 2025, 14 (04):
  • [37] Introducing the Urdu-Sindhi Speech Emotion Corpus: A Novel Dataset of Speech Recordings for Emotion Recognition for Two Low-Resource Languages
    Syed, Zafi Sherhan
    Memon, Sajjad Ali
    Shah, Muhammad Shehram
    Syed, Abbas Shah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (04) : 805 - 810
  • [38] LOW-RESOURCE EXPRESSIVE TEXT-TO-SPEECH USING DATA AUGMENTATION
    Huybrechts, Goeric
    Merritt, Thomas
    Comini, Giulia
    Perz, Bartek
    Shah, Raahil
    Lorenzo-Trueba, Jaime
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6593 - 6597
  • [39] LOW-RESOURCE LANGUAGE IDENTIFICATION FROM SPEECH USING TRANSFER LEARNING
    Feng, Kexin
    Chaspari, Theodora
    2019 IEEE 29TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2019,
  • [40] Automatic detection and assessment of Alzheimer Disease using speech and language technologies in low-resource scenarios
    Pappagari, Raghavendra
    Cho, Jaejin
    Joshi, Sonal
    Moro-Velazquez, Laureano
    Zelasko, Piotr
    Villalba, Jesus
    Dehak, Najim
    INTERSPEECH 2021, 2021, : 3825 - 3829