Optimization learning of hidden Markov model using the bacterial foraging optimization algorithm for speech recognition

被引:1
作者
Benmachiche, A. [1 ]
Makhlouf, A. [1 ]
Bouhadada, T. [2 ,3 ]
机构
[1] Chadli Bendjedid Univ, Dept Comp Sci, PB 73, El Tarf 36000, Algeria
[2] Badji Mokhtar Univ, Dept Comp Sci, PB 12, Annaba 23000, Algeria
[3] Badji Mokhtar Univ, Lab LRI, PB 12, Annaba 23000, Algeria
关键词
Automatic speech recognition; acoustic information; bacterial foraging optimization algorithm; BFOA/HMM; Gaussian mixture densities; Baum-Welch; DISTRIBUTED OPTIMIZATION; BIOMIMICRY;
D O I
10.3233/KES-200039
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, the speech recognition applications can be found in several activities, and their existence as a field of study and research lasts for a long time. Although, many studies deal with different problems, in security-related areas, biometric identification, access to the Smartphone ... Etc. In automatic speech recognition (ASR) systems, hidden Markov models (HMMs) have widely used for modeling the temporal speech signal. In order to optimize HMM parameters (i.e., observation and transition probabilities), iterative algorithms commonly used such as Forward-Backward or Baum-Welch. In this article, we propose to use the bacterial foraging optimization algorithm (BFOA) to enhance HMM with Gaussian mixture densities. As a global optimization algorithm of current interest, BFOA has proven itself for distributed optimization and control. Our experimental results show that the proposed approach yields a significant improvement of the transcription accuracy at signal/noise ratios greater than 15 dB.
引用
收藏
页码:171 / 181
页数:11
相关论文
共 29 条
[21]  
Panayotov V, 2015, INT CONF ACOUST SPEE, P5206, DOI 10.1109/ICASSP.2015.7178964
[22]  
Passino KM, 2002, IEEE CONTR SYST MAG, V22, P52, DOI 10.1109/MCS.2002.1004010
[23]   A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286
[24]  
Ratnadeep D., 2016, 2 INT C COGN KNOWL E
[25]  
Saborido R., 2017, EVOLUTIONARY COMPUTA, V25
[26]  
Sagheer A, 2005, 2005 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), VOLS 1 AND 2, P761
[27]   Multi swarm optimization based adaptive fuzzy multi agent system for microgrid multi-objective energy management [J].
Serraji, Maria ;
El Amine, Didi Omar ;
Boumhidi, Jaouad .
INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2016, 20 (04) :229-243
[28]   Human and machine consonant recognition [J].
Sroka, JJ ;
Braida, LD .
SPEECH COMMUNICATION, 2005, 45 (04) :401-423
[29]   AUTOMATIC RECOGNITION OF 200 WORDS [J].
VELICHKO, VM ;
ZAGORUYKO, NG .
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1970, 2 (03) :223-234