A GMM SUPERVECTOR KERNEL WITH THE BHATTACHARYYA DISTANCE FOR SVM BASED SPEAKER RECOGNITION

被引:0
|
作者
You, Chang Huai [1 ]
Lee, Kong Aik [1 ]
Li, Haizhou [1 ]
机构
[1] ASTAR, Inst Infocomm Res I2R, Singapore, Singapore
关键词
Gaussian Mixture Model; Support Vector Machine; Supervector; Speaker Verification; NIST Evaluation; VERIFICATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Gaussian mixture model (GMM) supervector is one of the effective techniques in text independent speaker recognition. In our previous work, we introduce the GMM-UBM mean interval (GUMI) concept based on the Bhattacharyya distance. Subsequently GUMI kernel was successfully used in conjunction with support vector machine (SVM) for speaker recognition. Besides the first order statistics, it is generally believed that speaker cues are also partly conveyed by second order statistics. In this paper, we extend the Bhattacharyya-based SVM kernel by constructing the supervector with the mean statistical vector and the covariance statistical vector. Comparing with the Kullback-Leibler (KL) kernel, we demonstrate the effectiveness of the new kernel on the 2006 National Institute of Standards and Technology (NIST) speaker recognition evaluation (SRE) dataset.
引用
收藏
页码:4221 / 4224
页数:4
相关论文
共 50 条
  • [21] A Method to Integrate GMM, SVM and DTW for Speaker Recognition
    Ding, Ing-Jr
    Yen, Chih-Ta
    Ou, Da-Cheng
    INTERNATIONAL JOURNAL OF ENGINEERING AND TECHNOLOGY INNOVATION, 2014, 4 (01) : 38 - 47
  • [22] Human Action Recognition based on GMM-UBM supervector using SVM with non-linear GMM KL and GUMI
    Bui, Nam N.
    Kim, Young J.
    SEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2015), 2015, 9631
  • [23] Enhanced Speaker Verification Using GMM-Supervector Based Modified Adaptive GMM Training
    Trinh, Tan Dat
    Park, Min Kyung
    Kim, Jin Young
    Lee, Kyong Rok
    Cho, Keeseong
    MOBILE AND WIRELESS TECHNOLOGY 2015, 2015, 310 : 147 - 153
  • [24] Singing speaker clustering based on subspace learning in the GMM mean supervector space
    Mehrabani, Mahnoosh
    Hansen, John H. L.
    SPEECH COMMUNICATION, 2013, 55 (05) : 653 - 666
  • [25] SVM AGAINST GMM/SVM FOR DIALECT INFLUENCE ON AUTOMATIC SPEAKER RECOGNITION TASK
    Zergat, Kawthar
    Amrouche, Abderrahmane
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2014, 13 (02)
  • [26] New scheme based on GMM-PCA-SVM modelling for automatic speaker recognition
    Zergat, Kawthar
    Amrouche, Abderrahmane
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (04) : 373 - 381
  • [27] Speaker Cluster based GMM Tokenization for Speaker Recognition
    Ma, Bin
    Zhu, Donglai
    Tong, Rong
    Li, Haizhou
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 505 - 508
  • [28] A Non-linear GMM KL and GUMI Kernel for SVM Using GMM-UBM Supervector in Home Acoustic Event Classification
    Ngoc Nam Bui
    Kim, Jin Young
    Trinh, Tan Dat
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2014, E97A (08): : 1791 - 1794
  • [29] A non-linear GMM KL and GUMI kernel for SVM using GMM-UBM supervector in home acoustic event classification
    Kim, J.Y. (beyond@jnu.ac.kr), 1791, Institute of Electronics, Information and Communication, Engineers, IEICE (E97-A):
  • [30] Adaboost with SVM using GMM Supervector for Imbalanced Phoneme Data
    Amami, Rimah
    Ben Ayed, Dorra
    Ellouze, Noureddine
    2013 6TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTIONS (HSI), 2013, : 328 - 333