Speech/speaker recognition using a HMM/GMM hybrid model

被引：0

作者：

Rodriguez, E ^{[1
]}

Ruiz, B ^{[1
]}

Garcia-Crespo, A ^{[1
]}

Garcia, F ^{[1
]}

机构：

[1] Univ Carlos III Madrid, Legeanes 28911, Madrid, Spain

来源：

AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION | 1997年 / 1206卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a speaker recognition voice based system is presented [5]. We have implemented it in a Sun platform. We train (and test) the system using a Database recorded in several sessions in order to repair the huge effects that the speech variability with time has in the recognition rate system. Several experiments have been made iri order to achieve the best configuration in the system set up. This is an important point to take into account in a real world system in which users train,the system once and the models generated in the training process are not updated for strategic reasons. The recognition rate obtained for the proposed system is around 93% if the speech came from a microphone is around 90% when the speech came from a phone line.

引用

页码：227 / 234

页数：8

共 50 条

[31] Comparison of acoustical models of GMM-HMM based for speech recognition in Hindi using PocketSphinx
Manasa, Chadalavada Sai
Priya, K. Jeeva
Gupta, Deepa
PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 534 - 539
[32] Visual Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System
Zimmermann, Marina
Ghazi, Mostafa Mehdipour
Ekenel, Hazim Kemal
Thiran, Jean-Philippe
COMPUTER VISION - ACCV 2016 WORKSHOPS, PT II, 2017, 10117 : 264 - 276
[33] Hybrid modeling of PHMM and HMM for speech recognition
Ogawa, T
Kobayashi, T
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 140 - 143
[34] GMM and CNN Hybrid Method for Short Utterance Speaker Recognition
Liu, Zheli
Wu, Zhendong
Li, Tong
Li, Jin
Shen, Chao
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (07) : 3244 - 3252
[35] An improved HMM speech recognition model
Yuan, Lichi
2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1311 - 1315
[36] Hybrid HMM/ANN and GMM combination for user-customized password speaker verification
BenZeghiba, MF
Bourlard, H
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 225 - 228
[37] Text-independent speaker recognition by combining speaker-specific GMM with speaker adapted syllable-based HMM
Nakagawa, S
Zhang, W
Takahashi, M
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 81 - 84
[38] Constructing accurate and robust HMM/GMM models for an Arabic speech recognition system
Khelifa M.O.M.
Elhadj Y.M.
Abdellah Y.
Belkasmi M.
International Journal of Speech Technology, 2017, 20 (04) : 937 - 949
[39] Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition
Kim, Myungjong
Kim, Younggwan
Yoo, Joohong
Wang, Jun
Kim, Hoirin
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2017, 25 (09) : 1581 - 1591
[40] HMM-Based Speaker Emotional Recognition Technology for Speech Signal
Qin, Yuqiang
Zhang, Xueying
FRONTIERS OF MANUFACTURING SCIENCE AND MEASURING TECHNOLOGY, PTS 1-3, 2011, 230-232 : 261 - 265

← 1 2 3 4 5 →