Discriminative speaker recognition using large margin GMM

被引:2
|
作者
Reda Jourani
Khalid Daoudi
Régine André-Obrecht
Driss Aboutajdine
机构
[1] University Paul Sabatier,SAMoVA Group, IRIT—UMR 5505 du CNRS
[2] GeoStat Group,Laboratoire LRIT, Faculty of Sciences
[3] INRIA Bordeaux-Sud Ouest,undefined
[4] Mohammed 5 Agdal University,undefined
来源
关键词
Large margin training; Gaussian mixture models; Discriminative learning; Speaker recognition; Session variability modeling;
D O I
暂无
中图分类号
学科分类号
摘要
Most state-of-the-art speaker recognition systems are based on discriminative learning approaches. On the other hand, generative Gaussian mixture models (GMM) have been widely used in speaker recognition during the last decades. In an earlier work, we proposed an algorithm for discriminative training of GMM with diagonal covariances under a large margin criterion. In this paper, we propose an improvement of this algorithm, which has the major advantage of being computationally highly efficient, thus well suited to handle large-scale databases. We also develop a new strategy to detect and handle the outliers that occur in the training data. To evaluate the performances of our new algorithm, we carry out full NIST speaker identification and verification tasks using NIST-SRE’2006 data, in a Symmetrical Factor Analysis compensation scheme. The results show that our system significantly outperforms the traditional discriminative support vector machines (SVM)-based system of SVM-GMM supervectors, in the two speaker recognition tasks.
引用
收藏
页码:1329 / 1336
页数:7
相关论文
共 50 条
  • [1] Discriminative speaker recognition using large margin GMM
    Jourani, Reda
    Daoudi, Khalid
    Andre-Obrecht, Regine
    Aboutajdine, Driss
    NEURAL COMPUTING & APPLICATIONS, 2013, 22 (7-8): : 1329 - 1336
  • [2] Speaker Identification Using Discriminative Learning of Large Margin GMM
    Daoudi, Khalid
    Jourani, Reda
    Andre-Obrecht, Regine
    Aboutajdine, Driss
    NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 300 - +
  • [3] COMBINATION OF SVM AND LARGE MARGIN GMM MODELING FOR SPEAKER IDENTIFICATION
    Jourani, Reda
    Daoudi, Khalid
    Andre-Obrecht, Regine
    Aboutajdine, Driss
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [4] Discriminative training of GMM for speaker identification
    delAlamo, CM
    Gil, FJC
    Munilla, CDL
    Gomez, LH
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 89 - 92
  • [5] TEXT INDEPENDENT SPEAKER RECOGNITION SYSTEM USING GMM
    Bagul, S. G.
    Shastri, R. K.
    2013 INTERNATIONAL CONFERENCE ON HUMAN COMPUTER INTERACTIONS (ICHCI), 2013,
  • [6] Improving GMM-UBM speaker verification using discriminative feedback adaptation
    Chao, Yi-Hsiang
    Tsai, Wei-Ho
    Wang, Hsin-Min
    COMPUTER SPEECH AND LANGUAGE, 2009, 23 (03): : 376 - 388
  • [7] Constructing the discriminative kernels using GMM for text-independent speaker identification
    Lei, ZC
    Yang, YC
    Wu, ZH
    ADVANCES IN BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3781 : 165 - 171
  • [8] Speaker Cluster based GMM Tokenization for Speaker Recognition
    Ma, Bin
    Zhu, Donglai
    Tong, Rong
    Li, Haizhou
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 505 - 508
  • [9] Speaker recognition using mfcc and hybrid model of VQ and GMM
    Desai, Dhruv
    Joshi, Maulin
    1600, Springer Verlag (235): : 53 - 63