Amazigh Isolated-Word Speech Recognition System Using Hidden Markov Model Toolkit (HTK)

被引：0

作者：

Elouahabi, Safaa ^{[1
]}

Atounti, Mohamed ^{[1
]}

Bellouki, Mohamed ^{[1
]}

机构：

[1] Fac Polydisciplinary Nador, Lab Appl Math & Informat Syst, Nador, Morocco

来源：

2016 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY FOR ORGANIZATIONS DEVELOPMENT (IT4OD) | 2016年

关键词：

automatic speech recognition; hidden markov models; mel frequency spectral coefficients; hidden markov model toolkit (HTK); amazigh language;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

This paper aims to build a speaker-independent automatic Amazigh Isolated-Word speech recognition system. Hidden Markov Model toolkit (HTK) that uses hidden Markov Models has been used to develop the system. The recognition vocabulary consists on the Amazigh Letters and Digits. The system has been trained to recognize the Amazigh 10 first digits and 33 alphabets. Mel frequency spectral coefficients (MFCCs) have been used to extract the feature. The training data has been collected from 60 speakers including both males and females. The test-data used for evaluating the system-performance has been collected from 20 speakers. The experimental results show that the presented system provides the overall word-accuracy 80%. The initial results obtained are very satisfactory in comparison with the training database's size, this encourages us to increase system performance to achieve a higher recognition rate.

引用

页数：7

共 18 条

[11]

Gales M. J. F, 2007, AUT SPEECH REC UND, P24, DOI [10.1109/ASRU.2007.4430078, DOI 10.1109/ASRU.2007.4430078]

[12]

Kimutai S. K., 2013, INT J EMERGING SCI E, V2

[13]

Kumar Kuldeep, 2012, International Journal of Computational Systems Engineering, V1, P25, DOI 10.1504/IJCSYSE.2012.044740

[14]

Moustaoui A, 2003, AMAZIGH LANGUAGE MOR

[15]

Nimje K, 2011, J SCI IND RES INDIA, V70, P270

[16]

REDDY DR, 1975, SPEECH RECOGNITION

[17] A large vocabulary continuous speech recognition system for Persian language [J].

Sameti, Hossein ;

Veisi, Hadi ;

Bahrani, Mohammad ;

Babaali, Bagher ;

Hosseinzadeh, Khosro .

EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, :1-12

[18] Investigation Amazigh speech recognition using CMU tools [J].

Satori, Hassan ;

ElHaoussi, Fatima .

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (03) :235-243

← 1 2 →