Optimization learning of hidden Markov model using the bacterial foraging optimization algorithm for speech recognition

被引:1
作者
Benmachiche, A. [1 ]
Makhlouf, A. [1 ]
Bouhadada, T. [2 ,3 ]
机构
[1] Chadli Bendjedid Univ, Dept Comp Sci, PB 73, El Tarf 36000, Algeria
[2] Badji Mokhtar Univ, Dept Comp Sci, PB 12, Annaba 23000, Algeria
[3] Badji Mokhtar Univ, Lab LRI, PB 12, Annaba 23000, Algeria
关键词
Automatic speech recognition; acoustic information; bacterial foraging optimization algorithm; BFOA/HMM; Gaussian mixture densities; Baum-Welch; DISTRIBUTED OPTIMIZATION; BIOMIMICRY;
D O I
10.3233/KES-200039
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, the speech recognition applications can be found in several activities, and their existence as a field of study and research lasts for a long time. Although, many studies deal with different problems, in security-related areas, biometric identification, access to the Smartphone ... Etc. In automatic speech recognition (ASR) systems, hidden Markov models (HMMs) have widely used for modeling the temporal speech signal. In order to optimize HMM parameters (i.e., observation and transition probabilities), iterative algorithms commonly used such as Forward-Backward or Baum-Welch. In this article, we propose to use the bacterial foraging optimization algorithm (BFOA) to enhance HMM with Gaussian mixture densities. As a global optimization algorithm of current interest, BFOA has proven itself for distributed optimization and control. Our experimental results show that the proposed approach yields a significant improvement of the transcription accuracy at signal/noise ratios greater than 15 dB.
引用
收藏
页码:171 / 181
页数:11
相关论文
共 29 条
[1]  
[Anonymous], 1991, TR91069 ICSI
[2]  
[Anonymous], 2005, HTK BOOK HTK VERSION
[3]  
Benmachiche A., 2016, INTELLIGENT DECISION, V10
[4]  
Bourouba H, 2006, PROC 2 INT C INF COM, V1, P1264
[5]   Towards Evaluating the Robustness of Neural Networks [J].
Carlini, Nicholas ;
Wagner, David .
2017 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2017, :39-57
[6]  
Cumani S., 2014, IEEE T AUDIO SPEECH
[7]   AUTOMATIC RECOGNITION OF SPOKEN DIGITS [J].
DAVIS, KH ;
BIDDULPH, R ;
BALASHEK, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1952, 24 (06) :637-642
[8]  
Dorigo M., 1997, IEEE Transactions on Evolutionary Computation, V1, P53, DOI 10.1109/4235.585892
[9]  
Elmahdy M., 2009, P 8 INT S NAT LANG
[10]  
Graves A, 2013, INT CONF ACOUST SPEE, P6645, DOI 10.1109/ICASSP.2013.6638947