Speech/speaker recognition using a HMM/GMM hybrid model

被引：0

作者：

Rodriguez, E ^{[1
]}

Ruiz, B ^{[1
]}

Garcia-Crespo, A ^{[1
]}

Garcia, F ^{[1
]}

机构：

[1] Univ Carlos III Madrid, Legeanes 28911, Madrid, Spain

来源：

AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION | 1997年 / 1206卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a speaker recognition voice based system is presented [5]. We have implemented it in a Sun platform. We train (and test) the system using a Database recorded in several sessions in order to repair the huge effects that the speech variability with time has in the recognition rate system. Several experiments have been made iri order to achieve the best configuration in the system set up. This is an important point to take into account in a real world system in which users train,the system once and the models generated in the training process are not updated for strategic reasons. The recognition rate obtained for the proposed system is around 93% if the speech came from a microphone is around 90% when the speech came from a phone line.

引用

页码：227 / 234

页数：8

共 50 条

[41] HMM-separation-based speech recognition for a distant moving speaker
Takiguchi, T
Nakamura, S
Shikano, K
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (02): : 127 - 140
[42] An Innovative Method for Speech Signal Emotion Recognition Based on Spectral Features Using GMM and HMM Techniques
Mohammed Jawad Al-Dujaili Al-Khazraji
Abbas Ebrahimi-Moghadam
Wireless Personal Communications, 2024, 134 : 735 - 753
[43] An Innovative Method for Speech Signal Emotion Recognition Based on Spectral Features Using GMM and HMM Techniques
Al-Khazraji, Mohammed Jawad Al-Dujaili
Ebrahimi-Moghadam, Abbas
WIRELESS PERSONAL COMMUNICATIONS, 2024, 134 (02) : 735 - 753
[44] Speech recognition for a distant moving speaker based on HMM composition and separation
Takiguchi, T
Nakamura, S
Shikano, K
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1403 - 1406
[45] Discriminative speaker recognition using large margin GMM
Jourani, Reda
Daoudi, Khalid
Andre-Obrecht, Regine
Aboutajdine, Driss
NEURAL COMPUTING & APPLICATIONS, 2013, 22 (7-8): : 1329 - 1336
[46] Discriminative speaker recognition using large margin GMM
Reda Jourani
Khalid Daoudi
Régine André-Obrecht
Driss Aboutajdine
Neural Computing and Applications, 2013, 22 : 1329 - 1336
[47] TEXT INDEPENDENT SPEAKER RECOGNITION SYSTEM USING GMM
Bagul, S. G.
Shastri, R. K.
2013 INTERNATIONAL CONFERENCE ON HUMAN COMPUTER INTERACTIONS (ICHCI), 2013,
[48] Applying Batch Normalization to Hybrid NN-HMM Model For Speech Recognition
Zhan, Hongjian
Chen, Guilin
Lu, Yue
PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 427 - 435
[49] A Kinect Based Gesture Recognition Algorithm Using GMM and HMM
Song, Yang
Gu, Yu
Wang, Peisen
Liu, Yuanning
Li, Ao
PROCEEDINGS OF THE 2013 6TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS (BMEI 2013), VOLS 1 AND 2, 2013, : 750 - 754
[50] FULL-SUM DECODING FOR HYBRID HMM BASED SPEECH RECOGNITION USING LSTM LANGUAGE MODEL
Zhou, Wei
Schlueter, Ralf
Ney, Hermann
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7834 - 7838

← 1 2 3 4 5 →