Automatic Isolated Speech Recognition System Using MFCC Analysis and Artificial Neural Network Classifier: Feasible For Diversity of Speech Applications

被引:0
作者
Shakil, Md Daud [1 ]
Rahman, Md Abdur [1 ]
Soliman, Md Mohiuddin [2 ]
Islam, Md Atiqul [1 ]
机构
[1] Int Islamic Univ Chittagong, Dept Elect & Elect Engn, Chittagong 4318, Bangladesh
[2] Int Islamic Univ Chittagong, Dept Elect & Telecommun Engn, Chittagong 4318, Bangladesh
来源
2020 18TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED) | 2020年
关键词
MFCC; AISR; Artificial Neural Network (ANN); Automatic Speech Recognition (ASR) system;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this research, an Automatic Isolated Speech Recognition System (AISR) has been proposed to categorize four distinct word class pronounced as "GO"; "STOP"; "LEFT" and "RIGHT". Speech data has investigated mainly employing Mel Frequency Cestrum analysis together with specific subsidiary discrete signal processing to generate compact and productive speech database. In this paper, an Artificial Neural Network (ANN) has been used as a classifier to categorize four distinct word classes. The ANN functions with the Feed Forward Multi-layer Perceptron (FFMP) with Back Propagation (BP) training algorithm. Trial and error procedure is adopted to determine an optimally performed ANN classifier. The presented method has achieved the best recognition rates 97.14% and 98.57% respectively for two training algorithms- Steepest Gradient Descent with Adaptive Learning and Scaled Conjugate Gradient by investigation on different configuration and parameters of the ANN. To validate the feasibility and efficiency of the proposed ASR system, a small-scale application has implemented where two filament bulbs have been efficiently operated with designated speech classes.
引用
收藏
页码:300 / 305
页数:6
相关论文
共 13 条
  • [1] [Anonymous], 1994, Simulation Neuronaler Netze
  • [2] Robust speech recognition using factorial HMMs for home environments
    Betkowska, Agnieszka
    Shinoda, Koichi
    Furui, Sadaoki
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)
  • [3] Chauhan N, 2019, 2019 IEEE 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2019), P130, DOI [10.1109/CCOMS.2019.8821751, 10.1109/ccoms.2019.8821751]
  • [4] Das B.P., 2012, INT J MODERN ENG RES, V2, P854
  • [5] Desai N., 2013, INT J EMERG TECHNOL, V3, P367
  • [6] Gevaert W., 2010, Journal of Automatic Control, V20, P1
  • [7] Islam M.A., 2019, ENG APPL SCI RES, V46, P56, DOI DOI 10.14456/EASR.2019.7
  • [8] Peeling S., 1987, EXPT ISOLARED DIGIT
  • [9] Introduction to Digital Speech Processing
    Rabiner, Lawrence R.
    Schafer, Ronald W.
    [J]. FOUNDATIONS AND TRENDS IN SIGNAL PROCESSING, 2007, 1 (1-2): : 1 - 194
  • [10] A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION
    RABINER, LR
    [J]. PROCEEDINGS OF THE IEEE, 1989, 77 (02) : 257 - 286