Speech/speaker recognition using a HMM/GMM hybrid model

被引:0
|
作者
Rodriguez, E [1 ]
Ruiz, B [1 ]
Garcia-Crespo, A [1 ]
Garcia, F [1 ]
机构
[1] Univ Carlos III Madrid, Legeanes 28911, Madrid, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a speaker recognition voice based system is presented [5]. We have implemented it in a Sun platform. We train (and test) the system using a Database recorded in several sessions in order to repair the huge effects that the speech variability with time has in the recognition rate system. Several experiments have been made iri order to achieve the best configuration in the system set up. This is an important point to take into account in a real world system in which users train,the system once and the models generated in the training process are not updated for strategic reasons. The recognition rate obtained for the proposed system is around 93% if the speech came from a microphone is around 90% when the speech came from a phone line.
引用
收藏
页码:227 / 234
页数:8
相关论文
共 50 条
  • [31] Comparison of acoustical models of GMM-HMM based for speech recognition in Hindi using PocketSphinx
    Manasa, Chadalavada Sai
    Priya, K. Jeeva
    Gupta, Deepa
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 534 - 539
  • [32] Visual Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System
    Zimmermann, Marina
    Ghazi, Mostafa Mehdipour
    Ekenel, Hazim Kemal
    Thiran, Jean-Philippe
    COMPUTER VISION - ACCV 2016 WORKSHOPS, PT II, 2017, 10117 : 264 - 276
  • [33] Hybrid modeling of PHMM and HMM for speech recognition
    Ogawa, T
    Kobayashi, T
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 140 - 143
  • [34] GMM and CNN Hybrid Method for Short Utterance Speaker Recognition
    Liu, Zheli
    Wu, Zhendong
    Li, Tong
    Li, Jin
    Shen, Chao
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (07) : 3244 - 3252
  • [35] An improved HMM speech recognition model
    Yuan, Lichi
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1311 - 1315
  • [36] Hybrid HMM/ANN and GMM combination for user-customized password speaker verification
    BenZeghiba, MF
    Bourlard, H
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 225 - 228
  • [37] Text-independent speaker recognition by combining speaker-specific GMM with speaker adapted syllable-based HMM
    Nakagawa, S
    Zhang, W
    Takahashi, M
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 81 - 84
  • [38] Constructing accurate and robust HMM/GMM models for an Arabic speech recognition system
    Khelifa M.O.M.
    Elhadj Y.M.
    Abdellah Y.
    Belkasmi M.
    International Journal of Speech Technology, 2017, 20 (04) : 937 - 949
  • [39] Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition
    Kim, Myungjong
    Kim, Younggwan
    Yoo, Joohong
    Wang, Jun
    Kim, Hoirin
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2017, 25 (09) : 1581 - 1591
  • [40] HMM-Based Speaker Emotional Recognition Technology for Speech Signal
    Qin, Yuqiang
    Zhang, Xueying
    FRONTIERS OF MANUFACTURING SCIENCE AND MEASURING TECHNOLOGY, PTS 1-3, 2011, 230-232 : 261 - 265