Frontier Research on Low-Resource Speech Recognition Technology

被引:3
|
作者
Slam, Wushour [1 ]
Li, Yanan [1 ]
Urouvas, Nurmamet [1 ]
机构
[1] Xinjiang Univ, Coll Informat Sci & Engn, Xinjiang Lab Multilanguage Informat Technol, Xinjiang Multilingual Informat Technol Res Ctr, Urumqi 830046, Peoples R China
关键词
low-resource speech recognition; deep feature extraction; acoustic models; resource expansion; COVARIANCE MATRICES; SPEAKER ADAPTATION; DATA AUGMENTATION; NEURAL-NETWORKS; FEATURES; SYSTEM; ASR; LANGUAGES; LEXICONS; IMPROVE;
D O I
10.3390/s23229096
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
With the development of continuous speech recognition technology, users have put forward higher requirements in terms of speech recognition accuracy. Low-resource speech recognition, as a typical speech recognition technology under restricted conditions, has become a research hotspot nowadays because of its low recognition rate and great application value. Under the premise of low-resource speech recognition technology, this paper reviews the research status of feature extraction and acoustic models, and conducts research on resource expansion. Especially in terms of the technical challenges faced by this technology, solutions are proposed, and future research directions are prospected.
引用
收藏
页数:47
相关论文
共 50 条
  • [21] Using Large Self-Supervised Models for Low-Resource Speech Recognition
    Krishna, D. N.
    Wang, Pinyi
    Bozza, Bruno
    INTERSPEECH 2021, 2021, : 2436 - 2440
  • [22] Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition
    Pineiro-Martin, Andres
    Garcia-Mateo, Carmen
    Docio-Fernandez, Laura
    Del Carmen Lopez-Perez, Maria
    Rehm, Georg
    INTERSPEECH 2024, 2024, : 1235 - 1239
  • [23] MULTILINGUAL MRASTA FEATURES FOR LOW-RESOURCE KEYWORD SEARCH AND SPEECH RECOGNITION SYSTEMS
    Tueske, Zoltan
    Nolden, David
    Schlueter, Ralf
    Ney, Hermann
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [24] Transfer Learning from Multi-Lingual Speech Translation Benefits Low-Resource Speech Recognition
    Vanderreydt, Geoffroy
    Remy, Francois
    Demuynck, Kris
    INTERSPEECH 2022, 2022, : 3053 - 3057
  • [25] ANALYZING ASR PRETRAINING FOR LOW-RESOURCE SPEECH-TO-TEXT TRANSLATION
    Stoian, Mihaela C.
    Bansal, Sameer
    Goldwater, Sharon
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7909 - 7913
  • [26] Transfer Ability of Monolingual Wav2vec2.0 for Low-resource Speech Recognition
    Yi, Cheng
    Wang, Jianzong
    Cheng, Ning
    Zhou, Shiyu
    Xu, Bo
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [27] Cross-Language End-to-End Speech Recognition Research Based on Transfer Learning for the Low-Resource Tujia Language
    Yu, Chongchong
    Chen, Yunbing
    Li, Yueqiao
    Kang, Meng
    Xu, Shixuan
    Liu, Xueer
    SYMMETRY-BASEL, 2019, 11 (02):
  • [28] Learning Cross-lingual Mappings for Data Augmentation to Improve Low-Resource Speech Recognition
    Farooq, Muhammad Umar
    Hain, Thomas
    INTERSPEECH 2023, 2023, : 5072 - 5076
  • [29] Multilingual Data Selection For Low Resource Speech Recognition
    Thomas, Samuel
    Audhkhasi, Kartik
    Cui, Jia
    Kingsbury, Brian
    Ramabhadran, Bhuvana
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3853 - 3857
  • [30] Cross-Lingual Self-training to Learn Multilingual Representation for Low-Resource Speech Recognition
    Zi-Qiang Zhang
    Yan Song
    Ming-Hui Wu
    Xin Fang
    Ian McLoughlin
    Li-Rong Dai
    Circuits, Systems, and Signal Processing, 2022, 41 : 6827 - 6843