A Method to Integrate GMM, SVM and DTW for Speaker Recognition

被引:0
|
作者
Ding, Ing-Jr [1 ]
Yen, Chih-Ta [1 ]
Ou, Da-Cheng [1 ]
机构
[1] Natl Formosa Univ, Dept Elect Engn, Huwei Township, Yunlin, Taiwan
关键词
speaker recognition; Gaussian mixture model; support vector machine; dynamic time wrapping; SVMGMM-DTW;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper develops an effective and efficient scheme to integrate Gaussian mixture model (GMM), support vector machine (SVM), and dynamic time wrapping (DTW) for automatic speaker recognition. GMM and SVM are two popular classifiers for speaker recognition applications. DTW is a fast and simple template matching method, and it is frequently seen in applications of speech recognition. In this work, DTW does not play a role to perform speech recognition, and it will be employed to be a verifier for verification of valid speakers. The proposed combination scheme of GMM, SVM and DTW, called SVMGMM-DTW, for speaker recognition in this study is a two-phase verification process task including GMM-SVM verification of the first phase and DTW verification of the second phase. By providing a double check to verify the identity of a speaker, it will be difficult for imposters to try to pass the security protection; therefore, the safety degree of speaker recognition systems will be largely increased. A series of experiments designed on door access control applications demonstrated that the superiority of the developed SVMGMM-DTW on speaker recognition accuracy.
引用
收藏
页码:38 / 47
页数:10
相关论文
共 50 条
  • [1] SVM AGAINST GMM/SVM FOR DIALECT INFLUENCE ON AUTOMATIC SPEAKER RECOGNITION TASK
    Zergat, Kawthar
    Amrouche, Abderrahmane
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2014, 13 (02)
  • [2] Performances Evaluation of GMM-UBM and GMM-SVM for Speaker Recognition in Realistic World
    Asbai, Nassim
    Amrouche, Abderrahmane
    Debyeche, Mohamed
    NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 284 - 291
  • [3] A GMM SUPERVECTOR KERNEL WITH THE BHATTACHARYYA DISTANCE FOR SVM BASED SPEAKER RECOGNITION
    You, Chang Huai
    Lee, Kong Aik
    Li, Haizhou
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4221 - 4224
  • [4] An SVM Kernel With GMM-Supervector Based on the Bhattacharyya Distance for Speaker Recognition
    You, Chang Huai
    Lee, Kong Aik
    Li, Haizhou
    IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (1-3) : 49 - 52
  • [5] Text-independent speaker recognition using probabilistic SVM with GMM adjustment
    Hou, FL
    Wang, BX
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 305 - 308
  • [6] GMM-SVM Kernel With a Bhattacharyya-Based Distance for Speaker Recognition
    You, Chang Huai
    Lee, Kong Aik
    Li, Haizhou
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1300 - 1312
  • [7] A new hybrid GMM/SVM for speaker verification
    Liu, Minghui
    Xie, Yanlu
    Yao, Zhiqiang
    Dai, Beiqian
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 314 - +
  • [8] A hybrid GMM/SVM approach to speaker identification
    Fine, S
    Navrátil, J
    Gopinath, RA
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 417 - 420
  • [9] A Robust SVM/GMM Classifier for Speaker Verification
    Cirovic, Zoran
    Cirovic, Natasa
    SPEECH AND COMPUTER, 2014, 8773 : 74 - 80
  • [10] GMM and CNN Hybrid Method for Short Utterance Speaker Recognition
    Liu, Zheli
    Wu, Zhendong
    Li, Tong
    Li, Jin
    Shen, Chao
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (07) : 3244 - 3252