Fuzzy Speech Recognition Algorithm Based on Continuous Density Hidden Markov Model and Self Organizing Feature Map

被引：0

作者：

Zhang, Yanning ^{[1
]}

Ma, Lei ^{[1
]}

Li, Yunwei ^{[2
]}

机构：

[1] Beijing Polytech Univ, Telecommun Engn Inst, Beijing 100176, Peoples R China

[2] Beijing Youth Polit Coll, Deans Off, Beijing, Peoples R China

来源：

INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY | 2025年 / 22卷 / 02期

关键词：

Speech recognition; wiener filter; Mel-frequency cepstrum coefficient; continuous hidden Markov model; self- organizing feature neural network; FREQUENCY CEPSTRAL COEFFICIENTS;

D O I：

10.34028/iajit/22/2/11

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Speech recognition refers to the process of receiving and understanding human speech input through a computer, converting it into readable text or instructions. In order to improve the denoising effect and speech recognition effect of fuzzy speech, a fuzzy speech recognition algorithm based on continuous density hidden Markov model and self-organizing feature map is proposed. Firstly, the conventional Wiener filtering algorithm is improved by using the dynamic estimation algorithm of noise power spectrum, and the endpoint detection of noisy speech signal is performed by using spectral entropy, and the noise power spectrum of the silent segment is dynamically updated according to the detection results to obtain a more ideal priori signal to noise ratio; Secondly, the fuzzy speech is input into the Wiener filter to eliminate the noise in the speech signal; then, Mel- Frequency Cepstrum Coefficient (MFCC) of speech signal is extracted as speech feature; Finally, combined with the continuous hidden Markov model and the self-organizing feature neural network in the artificial intelligence algorithm, through the process of adjusting parameters, Viterbi decoding, and the time adjustment of the voice signal in the same state, the speech classification and recognition are realized according to the speech characteristics. In the experiment, comparative experiments were conducted on the LibriSpeech dataset using speech recognition algorithms based on convolutional neural networks and recurrent neural networks, speech recognition algorithms based on residual networks and gated convolutional networks, speech recognition algorithms based on multi-scale Mel domain feature map extraction. The experimental results show that the algorithm has good denoising performance. With the increase of added environmental noise intensity, the algorithm can maintain the Signal-to-Noise Ratio (SNR) of speech signals between 88dB-98dB; This algorithm can accurately detect the sound areas in the signal, and the endpoint detection accuracy is high; The accuracy and recall of the Continuous Density Hidden Markov Model-Self-Organizing Feature Neural Network (CDHMM-SOFM) designed in the algorithm increase with the number of iterations, and the highest levels of accuracy and recall can reach 0.89, respectively; The minimum recognition time of this algorithm is only 8.2 seconds, and the highest recognition rate can reach 98.7%; after applying this algorithm, the user's error rate ranges from 0.0031 to 0.0084. The above results indicate that the algorithm has good application performance.

引用

页数：18

共 50 条

[1] NEURAL NETWORK BASED CONTINUOUS SPEECH RECOGNITION BY COMBINING SELF-ORGANIZING FEATURE MAPS AND HIDDEN MARKOV MODELING
RIGOLL, G
LECTURE NOTES IN COMPUTER SCIENCE, 1990, 412 : 205 - 214
[2] Fuzzy Hidden Markov Models for Speech Recognition on based FEM Algorithm
Taheri, Asghar
Tarihi, Mohammad Reza
Baghgar, Hassan
Abad, Bostan
Bababeyk, Hassan
PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 4, 2005, 4 : 59 - 61
[3] Continuous Density Hidden Markov Model for Context Dependent Hindi speech Recognition
Sinha, Shweta
Agrawal, S. S.
Jain, Aruna
2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1953 - 1958
[4] Self-Organizing Hidden Markov Model Map (SOHMMM)
Ferles, Christos
Stafylopatis, Andreas
NEURAL NETWORKS, 2013, 48 : 133 - 147
[5] Speech recognition algorithm based on neural network and hidden Markov model
Zhao Jianhui
Gao Hongbo
Liu Yuchao
Cheng Bo
TheJournalofChinaUniversitiesofPostsandTelecommunications, 2018, 25 (04) : 28 - 37
[6] Speech recognition algorithm based on neural network and hidden Markov model
Jianhui Z.
Hongbo G.
Yuchao L.
Bo C.
Journal of China Universities of Posts and Telecommunications, 2018, 25 (04): : 28 - 37
[7] Hybrid model of hidden Markov models and a self-organizing neural network model in speech recognition
Li, Jingjiao
Sun, Jie
Zhang, Li
Yao, Tianshun
Dongbei Daxue Xuebao/Journal of Northeastern University, 1999, 20 (02): : 144 - 147
[8] A hybrid model of hidden Markov models and a self-organizing neural network model in speech recognition
Li, JJ
Sun, J
Li, YQ
2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 742 - 746
[9] A parallel phoneme recognition algorithm based on continuous Hidden Markov Model
Chung, SH
Park, MU
Kim, HS
IPPS/SPDP 1999: 13TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & 10TH SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 1999, : 453 - 457
[10] Parallel phoneme recognition algorithm based on continuous Hidden Markov Model
Chung, Sang-Hwa
Park, Min-Uk
Kim, Hyung-Soon
Proceedings of the International Parallel Processing Symposium, IPPS, 1999, : 453 - 457

← 1 2 3 4 5 →