A comprehensive survey on automatic speech recognition using neural networks

被引:28
作者
Dhanjal, Amandeep Singh [1 ]
Singh, Williamjeet [2 ]
机构
[1] Punjabi Univ, Dept Comp Sci, Rajpura Rd, Patiala 147001, Punjab, India
[2] Punjabi Univ, Dept Comp Sci & Engn, Rajpura Rd, Patiala 147001, Punjab, India
关键词
Speech recognition; Dataset; Tools; Neural network; Deep learning; ARABIC SPEECH; SYSTEM; NOISE; HMM; ARCHITECTURES; SEGMENTATION; PRIMER;
D O I
10.1007/s11042-023-16438-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The continuous development in Automatic Speech Recognition has grown and demonstrated its enormous potential in Human Interaction Communication systems. It is quite a challenging task to achieve high accuracy due to several parameters such as different dialects, spontaneous speech, speaker's enrolment, computation power, dataset, and noisy environment that decrease the performance of the speech recognition system. It has motivated various researchers to make innovative contributions to the development of a robust speech recognition system. The study presents a systematic analysis of current state-of-the-art research work done in this field during 2015-2021. The prime focus of the study is to highlight the neural network-based speech recognition techniques, datasets, toolkits, and evaluation metrics utilized in the past seven years. It also synthesizes the evidence from past studies to provide empirical solutions for accuracy improvement. This study highlights the current status of speech recognition systems using neural networks and provides a brief knowledge to the new researchers.
引用
收藏
页码:23367 / 23412
页数:46
相关论文
共 50 条
[21]   Automatic speech recognition systems: A survey of discriminative techniques [J].
Kaur, Amrit Preet ;
Singh, Amitoj ;
Sachdeva, Rohit ;
Kukreja, Vinay .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (09) :13307-13339
[22]   Using Neural Networks for a Discriminant Speech Recognition System [J].
Schiopu, Daniela ;
Oprea, Mihaela .
2014 INTERNATIONAL CONFERENCE ON DEVELOPMENT AND APPLICATION SYSTEMS (DAS), 2014, :165-169
[23]   Automatic Methods and Neural Networks in Arabic Texts Diacritization: A Comprehensive Survey [J].
Almanea, Manar M. .
IEEE ACCESS, 2021, 9 (09) :145012-145032
[24]   Deep Neural Networks in Russian Speech Recognition [J].
Markovnikov, Nikita ;
Kipyatkova, Irina ;
Karpov, Alexey ;
Filchenkov, Andrey .
ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE, 2018, 789 :54-67
[25]   Automatic phoneme recognition by deep neural networks [J].
Pereira, Bianca Valeria L. ;
de Carvalho, Mateus B. F. ;
Alves, Pedro Augusto A. da S. de A. Nava ;
Ribeiro, Paulo Rogerio de A. ;
de Oliveira, Alexandre Cesar M. ;
de Almeida Neto, Areolino .
JOURNAL OF SUPERCOMPUTING, 2024, 80 (11) :16654-16678
[26]   CONFIDENCE ESTIMATION FOR BLACK BOX AUTOMATIC SPEECH RECOGNITION SYSTEMS USING LATTICE RECURRENT NEURAL NETWORKS [J].
Kastanos, A. ;
Ragni, A. ;
Gales, M. J. E. .
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, :6329-6333
[27]   ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context [J].
Han, Wei ;
Zhang, Zhengdong ;
Zhang, Yu ;
Yu, Jiahui ;
Chiu, Chung-Cheng ;
Qin, James ;
Gulati, Anmol ;
Pang, Ruoming ;
Wu, Yonghui .
INTERSPEECH 2020, 2020, :3610-3614
[28]   Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition [J].
Parcollet, Titouan ;
Zhang, Ying ;
Morchid, Mohamed ;
Trabelsi, Chiheb ;
Linares, Georges ;
De Mori, Renato ;
Bengio, Yoshua .
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, :22-26
[29]   A survey of hybrid ANN/HMM models for automatic speech recognition [J].
Trentin, E ;
Gori, M .
NEUROCOMPUTING, 2001, 37 :91-126
[30]   Automatic stenosis recognition from coronary angiography using convolutional neural networks [J].
Moon, Jong Hak ;
Lee, Da Young ;
Cha, Won Chul ;
Chung, Myung Jin ;
Lee, Kyu-Sung ;
Cho, Baek Hwan ;
Choi, Jin Ho .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 198