Optimal parameters selected for automatic recognition of spoken Amazigh digits and letters using Hidden Markov Model Toolkit

被引:0
作者
Safâa El Ouahabi
Mohamed Atounti
Mohamed Bellouki
机构
[1] Polydisciplinary Faculty of Nador,Laboratory of Applied Mathematics and Information System
来源
International Journal of Speech Technology | 2020年 / 23卷
关键词
Amazigh isolated words; Automatic speech recognition system (ASR); Hidden Markov Model (HMM); Hidden Markov Model Toolkit (HTK); Gaussian mixture modelling (GMM); Mel-frequency cepstral coefficients (MFCC);
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we present our Amazigh automatic speech recognition system. Its realization is constructed with context-independent phonetic Hidden Markov Models. Many choices are made on this system, such as the number of states of the models, the type of emission probability densities associated with the states, and the representation of the signal by cepstral coefficients. The results of recognition of our system place it at a level of height performance comparable to that achieved by Markovian automatic speech recognition systems. Our system is designed to recognize 43 distinct isolated Amazigh words (33 letters and 10 digits). The recognition rate is then calculated for each digit and letter. The overall accuracy and word recognition rate for the whole database achieved 91.31% after extensive testing and change of the recognition parameters. The results obtained in this work are improved in association with our previous work concerning Amazigh spoken digits and letters automatic speech recognition, using Hidden Markov Model Toolkit.
引用
收藏
页码:861 / 871
页数:10
相关论文
共 39 条
[1]  
Chapaneri SV(2012)Spoken digits recognition using Weighted MFCC and improved features for dynamic time warping International Journal of Computer Applications 40 6-12
[2]  
Dempster AP(1977)Maximum-likelihood from incomplete data via the EM algorithm Journal of the Royal Statistical Society. Series B 39 1-38
[3]  
Laird NM(1991)Large vocabulary word recognition using context-dependent allophonic hidden Markov models Computer Speech & Language 4 345-357
[4]  
Rubin DB(2014)Idrissi, Automatic Speech Recognition for Tamazight Enchained Digit World Journal Control Science and Engineering 2 1-5
[5]  
Deng L(2019)Amazigh Speech Recognition using triphone modeling and clustering tree decision Annals of the University of Craiova, Mathematics and Computer Science Series 46 56-432
[6]  
Lennig M(2019)Toward an automatic speech recognition system for amazigh-tarifit language International Journal of Speech Technology 22 421-148
[7]  
Seitz F(1993)The SPHINX-II speech recognition system : an overview Computer Speech and Language 7 137-309
[8]  
Mermelstein P(1986)Maximum likelihood estimation for mixture multivariate stochastic observations of Markov chains IEEE International Symposium on Information Theory 32 307-45
[9]  
El Ghazi C(1990)An overview of the SPHINX speech recognition system IEEE Transactions on Acoustics, Speech and Signal Processing 38 35-272
[10]  
Daoui N(2011)Automatic isolated digit recognition system: an approach using HMM Journal of Scientific and Industrial Research 70 270-40