Arabic Speech Recognition with Deep Learning: A Review

被引:15
作者
Algihab, Wajdan [1 ]
Alawwad, Noura [1 ]
Aldawish, Anfal [1 ]
AlHumoud, Sarah [1 ]
机构
[1] Al Imam Mohammad Ibn Saud Islamic Univ IMSIU, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
来源
SOCIAL COMPUTING AND SOCIAL MEDIA: DESIGN, HUMAN BEHAVIOR AND ANALYTICS, SCSM 2019, PT I | 2019年 / 11578卷
关键词
Automatic speech recognition (ASR); Arabic Automatic Speech Recognition (AASR); Deep learning; Artificial neural networks (ANN); Deep neural network (DNN); Recurrent neural network (RNN);
D O I
10.1007/978-3-030-21902-4_2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic speech recognition is the area of research concerning the enablement of machines to accept vocal input from humans and interpreting it with the highest probability of correctness. There are several techniques to implement speech recognition models. One of the emerging techniques is using neural networks with deep learning for speech recognition. Arabic is one of the most spoken languages and least highlighted in terms of speech recognition. This paper serves as a brief review on the available studies on Arabic speech recognition. In addition, it sheds some light on the services and toolkits available for Arabic speech recognition systems' development.
引用
收藏
页码:15 / 31
页数:17
相关论文
共 40 条
[1]  
AbdAlmisreb A., 2015, MAXOUT BASED DEEP NE, P6
[2]  
Ahmad AM, 2004, IEEE INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2004 (ISCIT 2004), PROCEEDINGS, VOLS 1 AND 2, P98
[3]   Arabic Automatic Speech Recognition Enhancement [J].
Ahmed, Basem H. A. ;
Ghabayen, Ayman S. .
2017 PALESTINIAN INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (PICICT), 2017, :98-102
[4]  
Al-Anzi F., 2018, P 2018 INT C COMP SC, P1
[5]   Arabic broadcast news transcription system [J].
Alghamdi, Mansour ;
Elshafei, Moustafa ;
Al-Muhtaseb, Husni .
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2007, 10 (04) :183-195
[6]  
AlHanai T, 2016, IEEE W SP LANG TECH, P299, DOI 10.1109/SLT.2016.7846280
[7]  
Ali A, 2014, IEEE W SP LANG TECH, P525, DOI 10.1109/SLT.2014.7078629
[8]  
Alotaibi Y. A., 2009, JKAU, V20, P29
[9]   Spoken arabic digits recognizer using recurrent neural networks [J].
Alotaibi, YA .
PROCEEDINGS OF THE FOURTH IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2004, :195-199
[10]  
Amrouche A, 2003, Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, P689