Fuzzy Speech Recognition Algorithm Based on Continuous Density Hidden Markov Model and Self Organizing Feature Map

被引:0
|
作者
Zhang, Yanning [1 ]
Ma, Lei [1 ]
Li, Yunwei [2 ]
机构
[1] Beijing Polytech Univ, Telecommun Engn Inst, Beijing 100176, Peoples R China
[2] Beijing Youth Polit Coll, Deans Off, Beijing, Peoples R China
关键词
Speech recognition; wiener filter; Mel-frequency cepstrum coefficient; continuous hidden Markov model; self- organizing feature neural network; FREQUENCY CEPSTRAL COEFFICIENTS;
D O I
10.34028/iajit/22/2/11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech recognition refers to the process of receiving and understanding human speech input through a computer, converting it into readable text or instructions. In order to improve the denoising effect and speech recognition effect of fuzzy speech, a fuzzy speech recognition algorithm based on continuous density hidden Markov model and self-organizing feature map is proposed. Firstly, the conventional Wiener filtering algorithm is improved by using the dynamic estimation algorithm of noise power spectrum, and the endpoint detection of noisy speech signal is performed by using spectral entropy, and the noise power spectrum of the silent segment is dynamically updated according to the detection results to obtain a more ideal priori signal to noise ratio; Secondly, the fuzzy speech is input into the Wiener filter to eliminate the noise in the speech signal; then, Mel- Frequency Cepstrum Coefficient (MFCC) of speech signal is extracted as speech feature; Finally, combined with the continuous hidden Markov model and the self-organizing feature neural network in the artificial intelligence algorithm, through the process of adjusting parameters, Viterbi decoding, and the time adjustment of the voice signal in the same state, the speech classification and recognition are realized according to the speech characteristics. In the experiment, comparative experiments were conducted on the LibriSpeech dataset using speech recognition algorithms based on convolutional neural networks and recurrent neural networks, speech recognition algorithms based on residual networks and gated convolutional networks, speech recognition algorithms based on multi-scale Mel domain feature map extraction. The experimental results show that the algorithm has good denoising performance. With the increase of added environmental noise intensity, the algorithm can maintain the Signal-to-Noise Ratio (SNR) of speech signals between 88dB-98dB; This algorithm can accurately detect the sound areas in the signal, and the endpoint detection accuracy is high; The accuracy and recall of the Continuous Density Hidden Markov Model-Self-Organizing Feature Neural Network (CDHMM-SOFM) designed in the algorithm increase with the number of iterations, and the highest levels of accuracy and recall can reach 0.89, respectively; The minimum recognition time of this algorithm is only 8.2 seconds, and the highest recognition rate can reach 98.7%; after applying this algorithm, the user's error rate ranges from 0.0031 to 0.0084. The above results indicate that the algorithm has good application performance.
引用
收藏
页数:18
相关论文
共 50 条
  • [2] Fuzzy Hidden Markov Models for Speech Recognition on based FEM Algorithm
    Taheri, Asghar
    Tarihi, Mohammad Reza
    Baghgar, Hassan
    Abad, Bostan
    Bababeyk, Hassan
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 4, 2005, 4 : 59 - 61
  • [3] Continuous Density Hidden Markov Model for Context Dependent Hindi speech Recognition
    Sinha, Shweta
    Agrawal, S. S.
    Jain, Aruna
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1953 - 1958
  • [4] Self-Organizing Hidden Markov Model Map (SOHMMM)
    Ferles, Christos
    Stafylopatis, Andreas
    NEURAL NETWORKS, 2013, 48 : 133 - 147
  • [5] Speech recognition algorithm based on neural network and hidden Markov model
    Zhao Jianhui
    Gao Hongbo
    Liu Yuchao
    Cheng Bo
    TheJournalofChinaUniversitiesofPostsandTelecommunications, 2018, 25 (04) : 28 - 37
  • [6] Speech recognition algorithm based on neural network and hidden Markov model
    Jianhui Z.
    Hongbo G.
    Yuchao L.
    Bo C.
    Journal of China Universities of Posts and Telecommunications, 2018, 25 (04): : 28 - 37
  • [7] Hybrid model of hidden Markov models and a self-organizing neural network model in speech recognition
    Li, Jingjiao
    Sun, Jie
    Zhang, Li
    Yao, Tianshun
    Dongbei Daxue Xuebao/Journal of Northeastern University, 1999, 20 (02): : 144 - 147
  • [8] A hybrid model of hidden Markov models and a self-organizing neural network model in speech recognition
    Li, JJ
    Sun, J
    Li, YQ
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 742 - 746
  • [9] A parallel phoneme recognition algorithm based on continuous Hidden Markov Model
    Chung, SH
    Park, MU
    Kim, HS
    IPPS/SPDP 1999: 13TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & 10TH SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 1999, : 453 - 457
  • [10] Parallel phoneme recognition algorithm based on continuous Hidden Markov Model
    Chung, Sang-Hwa
    Park, Min-Uk
    Kim, Hyung-Soon
    Proceedings of the International Parallel Processing Symposium, IPPS, 1999, : 453 - 457