Artificial Neural Network for Arabic Speech Recognition in Humanoid Robotic Systems

被引:10
作者
Al-Abdullah, A. [1 ]
Al-Ajmi, A. [1 ]
Al-Mutairi, A. [1 ]
Al-Mousa, N. [1 ]
Al-Daihani, S. [1 ]
Karar, A. S. [1 ]
Alkork, S. [1 ]
机构
[1] Amer Univ Middle East, Coll Engn & Technol, Kuwait, Kuwait
来源
2019 3RD INTERNATIONAL CONFERENCE ON BIO-ENGINEERING FOR SMART TECHNOLOGIES (BIOSMART) | 2019年
关键词
artificial neural network; speech recognition;
D O I
10.1109/biosmart.2019.8734261
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech recognition is projected to play an increasingly important role in the future of human/machine interfacing. The objective of this study is to engineer a speech recognition system capable of deployment onto a general humanoid robot. A MATLAB based program for speech extraction and identification is constructed. Although there are many different algorithms used in speech recognition, the utilization of artificial neural networks (ANNs) was found to be adequate for the Arabic language, with its multitude of complexities, accents and linguistic intentionality. Furthermore, ANN is powerful and can model complex functions while offering the opportunity for additional cognitive abilities. The software tool developed converts the incoming audio signal into a two dimensional spectrogram, which is subsequently supplied to the ANN through a mel frequency cepstral coefficients (mfcc) algorithm. A software product capable of converting Arabic speech to commands was developed for controlling the "NAO" humanoid robot with 90% hit rate.
引用
收藏
页数:4
相关论文
共 8 条
[1]   Improved Arabic speech recognition system through the automatic generation of fine-grained phonetic transcriptions [J].
Alsharhan, Eiman ;
Ramsay, Allan .
INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (02) :343-353
[2]  
[Anonymous], 1963, J SOC IND APPL MATH, DOI [DOI 10.1137/0111030, 10.1137/0111030]
[3]   COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].
DAVIS, SB ;
MERMELSTEIN, P .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366
[4]  
Levenberg K., 1944, Q. Appl. Math, V2, P164, DOI [10.1090/QAM/10666, 10.1090/qam/10666, DOI 10.1090/QAM/10666]
[5]  
Mital D. P., 1989, Robotics and Autonomous Systems, V4, P339, DOI 10.1016/0921-8890(89)90033-X
[6]  
Oppenheim Alan V, 1999, Discrete-time Signal Processing, DOI DOI 10.1049/EP.1977.0078
[7]   SPEECH RECOGNITION BY MACHINE - REVIEW [J].
REDDY, DR .
PROCEEDINGS OF THE IEEE, 1976, 64 (04) :501-531
[8]   Development of a Sign Language Dialogue System for a Healing Dialogue Robot [J].
Huang, Xuan ;
Wu, Bo ;
Kameda, Hiroyuki .
2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, :867-872