Speech recognition algorithm based on neural network and hidden Markov model

被引:0
作者
Jianhui Z. [1 ]
Hongbo G. [2 ,3 ]
Yuchao L. [1 ]
Bo C. [2 ]
机构
[1] Department of Computer Science and Technology, Tsinghua University, Beijing
[2] State Key Laboratory of Automotive Safety and Energy, Tsinghua University, Beijing
[3] Center for Intelligent Connected Vehicles and Transportation, Tsinghua University, Beijing
来源
Journal of China Universities of Posts and Telecommunications | 2018年 / 25卷 / 04期
基金
中国博士后科学基金;
关键词
HMM; Neural network; Speech recognition;
D O I
10.19682/j.cnki.1005-8885.2018.1014
中图分类号
学科分类号
摘要
This study proposes a hybrid model of speech recognition parallel algorithm based on hidden Markov model (HMM) and artificial neural network (ANN). First, the algorithm uses HMM for time-series modeling of speech signals and calculates the voice to the HMM of the output probability score. Second, with the probability score as input to the neural network, the algorithm gets information for classification and recognition and makes a decision based on the hybrid model. Finally, Matlab software is used to train and test sample data. Simulation results show that using the strong time-series modeling ability of HMM and the classification features of the neural network, the proposed algorithm possesses stronger noise immunity than the traditional HMM. Moreover, the hybrid model corrects the individual flaws of the HMM and the neural network, and greatly improves the speed and performance of speech recognition. © 2018 Beijing University of Posts and Telecommunications. All rights reserved.
引用
收藏
页码:28 / 37
页数:9
相关论文
共 18 条
  • [1] Feng H.W., Xue L., Application of speech recognition system based on algebra algorithm and HMM, Computer Engineering and Design, 31, 24, pp. 5324-5330, (2010)
  • [2] Wang G.Y., Ma X.A., Yu H., Monotonic uncertainty measures for attribute reduction in probabilistic rough set model, International Journal of Approximate Reasoning, 59, 2015, pp. 41-67, (2015)
  • [3] Li H.F., Artieres T., Gallinari P., Data driven design of an ANN/ HMM system for on-line unconstrained handwritten character recognition, The Proceedings of 4th IEEE International Conference on Multimedia Interfaces, pp. l49-154, (2002)
  • [4] Gao H.B., Cheng B., Wang J.Q., Et al., Object classification using CNN-based fusion of vision and LIDAR in autonomous vehicle environment, IEEE Transactions on Industrial Informatics, 14, 9, pp. 4224-4231, (2018)
  • [5] Li X.G., Shao L.H., Li N., The realization of artificial neural network BP algorithm, Computer Knowledge and Technology, 5, pp. 921-924, (2008)
  • [6] Liu H.P., Sun F.C., Fang B., Et al., Robotic room-level localization using multiple sets of sonar measurements, IEEE Transactions on Instrumentation and Measurement, 66, 1, pp. 2-13, (2017)
  • [7] Lan S.K., The speaker recognition system based on GA-BP neural network, Journal of Chongqing University of Technology (Natural Science), 27, 10, pp. 91-95, (2013)
  • [8] Xie G.T., Gao H.B., Wang J.Q., Et al., Vehicle trajectory prediction by integrating physics-and maneuver-based approaches using interactive multiple models, IEEE Transactions on Industrial Electronics, 65, 7, pp. 5999-6008, (2018)
  • [9] Wang Z.Y., Xi X., Duration-distribution-based HMM for speech recognition, Frontiers of Electrical and Electronic Engineering in China, 1, 1, pp. 26-30, (2006)
  • [10] Wang S.L., Zhao Y.P., Shu Y., Et al., Fast search local extremum for maximal information coefficient ( MIC ), Journal of Computational and Applied Mathematics, 5, (2017)