A comprehensive survey on automatic speech recognition using neural networks

被引:28
作者
Dhanjal, Amandeep Singh [1 ]
Singh, Williamjeet [2 ]
机构
[1] Punjabi Univ, Dept Comp Sci, Rajpura Rd, Patiala 147001, Punjab, India
[2] Punjabi Univ, Dept Comp Sci & Engn, Rajpura Rd, Patiala 147001, Punjab, India
关键词
Speech recognition; Dataset; Tools; Neural network; Deep learning; ARABIC SPEECH; SYSTEM; NOISE; HMM; ARCHITECTURES; SEGMENTATION; PRIMER;
D O I
10.1007/s11042-023-16438-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The continuous development in Automatic Speech Recognition has grown and demonstrated its enormous potential in Human Interaction Communication systems. It is quite a challenging task to achieve high accuracy due to several parameters such as different dialects, spontaneous speech, speaker's enrolment, computation power, dataset, and noisy environment that decrease the performance of the speech recognition system. It has motivated various researchers to make innovative contributions to the development of a robust speech recognition system. The study presents a systematic analysis of current state-of-the-art research work done in this field during 2015-2021. The prime focus of the study is to highlight the neural network-based speech recognition techniques, datasets, toolkits, and evaluation metrics utilized in the past seven years. It also synthesizes the evidence from past studies to provide empirical solutions for accuracy improvement. This study highlights the current status of speech recognition systems using neural networks and provides a brief knowledge to the new researchers.
引用
收藏
页码:23367 / 23412
页数:46
相关论文
共 50 条
[41]   Binary neural networks for speech recognition [J].
Qian, Yan-min ;
Xiang, Xu .
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2019, 20 (05) :701-715
[42]   Automatic target recognition using deep convolutional neural networks [J].
Nasrabadi, Nasser M. ;
Kazemi, Hadi ;
Iranmanesh, Mehdi .
AUTOMATIC TARGET RECOGNITION XXVIII, 2018, 10648
[43]   Emotion Recognition from Speech using Artificial Neural Networks and. Recurrent Neural Networks [J].
Sharma, Shambhavi .
2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, :153-158
[44]   Automatic Detection Technique for Speech Recognition based on Neural Networks Inter-Disciplinary [J].
Al-Rababah, Mohamad A. A. ;
Al-Marghilani, Abdusamad ;
Hamarshi, Akram Aref .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (03) :179-184
[45]   DOMAIN ADAPTATION OF DEEP NEURAL NETWORKS FOR AUTOMATIC SPEECH RECOGNITION VIA WIRELESS SENSORS [J].
Gosztolya, Gabor ;
Grosz, Tamas .
JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS, 2016, 67 (02) :124-130
[46]   A Binaural Deep Neural Networks Parameter Mask for the Robust Automatic Speech Recognition System [J].
Jiang, Yi ;
Liu, Runsheng .
2016 INTERNATIONAL CONFERENCE ON NETWORK AND INFORMATION SYSTEMS FOR COMPUTERS (ICNISC), 2016, :352-356
[47]   Neural networks for automatic target recognition [J].
Rogers, SK ;
Colombi, JM ;
Martin, CE ;
Gainey, JC ;
Fielding, KH ;
Burns, TJ ;
Ruck, DW ;
Kabrisky, M ;
Oxley, M .
NEURAL NETWORKS, 1995, 8 (7-8) :1153-1184
[48]   Advances in Automatic Speech Recognition for Child Speech Using Factored Time Delay Neural Network [J].
Wu, Fei ;
Garcia, Leibny Paola ;
Povey, Daniel ;
Khudanpur, Sanjeev .
INTERSPEECH 2019, 2019, :1-5
[49]   Speech Emotion Recognition and Deep Learning: An Extensive Validation Using Convolutional Neural Networks [J].
Ri, Francesco Ardan Dal ;
Ciardi, Fabio Cifariello ;
Conci, Nicola .
IEEE ACCESS, 2023, 11 :116638-116649
[50]   Complex-Valued Neural Networks: A Comprehensive Survey [J].
Lee, ChiYan ;
Hasegawa, Hideyuki ;
Gao, Shangce .
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (08) :1406-1426