Speech/speaker recognition using a HMM/GMM hybrid model

被引:0
|
作者
Rodriguez, E [1 ]
Ruiz, B [1 ]
Garcia-Crespo, A [1 ]
Garcia, F [1 ]
机构
[1] Univ Carlos III Madrid, Legeanes 28911, Madrid, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a speaker recognition voice based system is presented [5]. We have implemented it in a Sun platform. We train (and test) the system using a Database recorded in several sessions in order to repair the huge effects that the speech variability with time has in the recognition rate system. Several experiments have been made iri order to achieve the best configuration in the system set up. This is an important point to take into account in a real world system in which users train,the system once and the models generated in the training process are not updated for strategic reasons. The recognition rate obtained for the proposed system is around 93% if the speech came from a microphone is around 90% when the speech came from a phone line.
引用
收藏
页码:227 / 234
页数:8
相关论文
共 50 条
  • [41] HMM-separation-based speech recognition for a distant moving speaker
    Takiguchi, T
    Nakamura, S
    Shikano, K
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (02): : 127 - 140
  • [42] An Innovative Method for Speech Signal Emotion Recognition Based on Spectral Features Using GMM and HMM Techniques
    Mohammed Jawad Al-Dujaili Al-Khazraji
    Abbas Ebrahimi-Moghadam
    Wireless Personal Communications, 2024, 134 : 735 - 753
  • [43] An Innovative Method for Speech Signal Emotion Recognition Based on Spectral Features Using GMM and HMM Techniques
    Al-Khazraji, Mohammed Jawad Al-Dujaili
    Ebrahimi-Moghadam, Abbas
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 134 (02) : 735 - 753
  • [44] Speech recognition for a distant moving speaker based on HMM composition and separation
    Takiguchi, T
    Nakamura, S
    Shikano, K
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1403 - 1406
  • [45] Discriminative speaker recognition using large margin GMM
    Jourani, Reda
    Daoudi, Khalid
    Andre-Obrecht, Regine
    Aboutajdine, Driss
    NEURAL COMPUTING & APPLICATIONS, 2013, 22 (7-8): : 1329 - 1336
  • [46] Discriminative speaker recognition using large margin GMM
    Reda Jourani
    Khalid Daoudi
    Régine André-Obrecht
    Driss Aboutajdine
    Neural Computing and Applications, 2013, 22 : 1329 - 1336
  • [47] TEXT INDEPENDENT SPEAKER RECOGNITION SYSTEM USING GMM
    Bagul, S. G.
    Shastri, R. K.
    2013 INTERNATIONAL CONFERENCE ON HUMAN COMPUTER INTERACTIONS (ICHCI), 2013,
  • [48] Applying Batch Normalization to Hybrid NN-HMM Model For Speech Recognition
    Zhan, Hongjian
    Chen, Guilin
    Lu, Yue
    PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 427 - 435
  • [49] A Kinect Based Gesture Recognition Algorithm Using GMM and HMM
    Song, Yang
    Gu, Yu
    Wang, Peisen
    Liu, Yuanning
    Li, Ao
    PROCEEDINGS OF THE 2013 6TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS (BMEI 2013), VOLS 1 AND 2, 2013, : 750 - 754
  • [50] FULL-SUM DECODING FOR HYBRID HMM BASED SPEECH RECOGNITION USING LSTM LANGUAGE MODEL
    Zhou, Wei
    Schlueter, Ralf
    Ney, Hermann
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7834 - 7838