A comprehensive survey on automatic speech recognition using neural networks

被引:28
作者
Dhanjal, Amandeep Singh [1 ]
Singh, Williamjeet [2 ]
机构
[1] Punjabi Univ, Dept Comp Sci, Rajpura Rd, Patiala 147001, Punjab, India
[2] Punjabi Univ, Dept Comp Sci & Engn, Rajpura Rd, Patiala 147001, Punjab, India
关键词
Speech recognition; Dataset; Tools; Neural network; Deep learning; ARABIC SPEECH; SYSTEM; NOISE; HMM; ARCHITECTURES; SEGMENTATION; PRIMER;
D O I
10.1007/s11042-023-16438-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The continuous development in Automatic Speech Recognition has grown and demonstrated its enormous potential in Human Interaction Communication systems. It is quite a challenging task to achieve high accuracy due to several parameters such as different dialects, spontaneous speech, speaker's enrolment, computation power, dataset, and noisy environment that decrease the performance of the speech recognition system. It has motivated various researchers to make innovative contributions to the development of a robust speech recognition system. The study presents a systematic analysis of current state-of-the-art research work done in this field during 2015-2021. The prime focus of the study is to highlight the neural network-based speech recognition techniques, datasets, toolkits, and evaluation metrics utilized in the past seven years. It also synthesizes the evidence from past studies to provide empirical solutions for accuracy improvement. This study highlights the current status of speech recognition systems using neural networks and provides a brief knowledge to the new researchers.
引用
收藏
页码:23367 / 23412
页数:46
相关论文
共 50 条
[31]   A survey of technologies for automatic Dysarthric speech recognition [J].
Qian, Zhaopeng ;
Xiao, Kejing ;
Yu, Chongchong .
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
[32]   A Survey of Multilingual Models for Automatic Speech Recognition [J].
Yadav, Hemant ;
Sitaram, Sunayana .
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, :5071-5079
[33]   EXPERIMENTS IN DYSARTHRIC SPEECH RECOGNITION USING ARTIFICIAL NEURAL NETWORKS [J].
JAYARAM, G ;
ABDELHAMIED, K .
JOURNAL OF REHABILITATION RESEARCH AND DEVELOPMENT, 1995, 32 (02) :162-169
[34]   Speech recognition using cluster monitoring scheme and neural networks [J].
Yadav, Munshi ;
Singh, Amit Prakash ;
Singh, Tanya .
3RD INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS, AND APPLICAT/4TH INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 2, 2006, :21-+
[35]   Speech Recognition System Based On Phonemes Using Neural Networks [J].
Maheswari, N. Uma ;
Kabilan, A. P. ;
Venkatesh, R. .
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (07) :148-153
[36]   Yoruba Gender Recognition from Speech Using Neural Networks [J].
Sefara, Tshephisho Joseph ;
Modupe, Abiodun .
2019 6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI 2019), 2019, :50-55
[37]   SPEECH RECOGNITION USING BIOLOGICALLY-INSPIRED NEURAL NETWORKS [J].
Bohnstingl, Thomas ;
Garg, Ayush ;
Wozniak, Stanislaw ;
Saon, George ;
Eleftheriou, Evangelos ;
Pantazi, Angeliki .
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, :6992-6996
[38]   Speech Recognition Based on Weight Function Neural Networks [J].
Zhang, Daiyuan ;
Zhao, Ran .
APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 :1565-1568
[39]   Binary neural networks for speech recognition [J].
Yan-min Qian ;
Xu Xiang .
Frontiers of Information Technology & Electronic Engineering, 2019, 20 :701-715
[40]   Speech recognition with artificial neural networks [J].
Dede, Guelin ;
Sazli, Murat Huesnue .
DIGITAL SIGNAL PROCESSING, 2010, 20 (03) :763-768