Discriminative speaker recognition using large margin GMM

被引：2

作者：

Reda Jourani

Khalid Daoudi

Régine André-Obrecht

Driss Aboutajdine

机构：

[1] University Paul Sabatier,SAMoVA Group, IRIT—UMR 5505 du CNRS

[2] GeoStat Group,Laboratoire LRIT, Faculty of Sciences

[3] INRIA Bordeaux-Sud Ouest,undefined

[4] Mohammed 5 Agdal University,undefined

来源：

Neural Computing and Applications | 2013年 / 22卷

关键词：

Large margin training; Gaussian mixture models; Discriminative learning; Speaker recognition; Session variability modeling;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Most state-of-the-art speaker recognition systems are based on discriminative learning approaches. On the other hand, generative Gaussian mixture models (GMM) have been widely used in speaker recognition during the last decades. In an earlier work, we proposed an algorithm for discriminative training of GMM with diagonal covariances under a large margin criterion. In this paper, we propose an improvement of this algorithm, which has the major advantage of being computationally highly efficient, thus well suited to handle large-scale databases. We also develop a new strategy to detect and handle the outliers that occur in the training data. To evaluate the performances of our new algorithm, we carry out full NIST speaker identification and verification tasks using NIST-SRE’2006 data, in a Symmetrical Factor Analysis compensation scheme. The results show that our system significantly outperforms the traditional discriminative support vector machines (SVM)-based system of SVM-GMM supervectors, in the two speaker recognition tasks.

引用

页码：1329 / 1336

页数：7

共 50 条

[1] Discriminative speaker recognition using large margin GMM
Jourani, Reda
Daoudi, Khalid
Andre-Obrecht, Regine
Aboutajdine, Driss
NEURAL COMPUTING & APPLICATIONS, 2013, 22 (7-8): : 1329 - 1336
[2] Speaker Identification Using Discriminative Learning of Large Margin GMM
Daoudi, Khalid
Jourani, Reda
Andre-Obrecht, Regine
Aboutajdine, Driss
NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 300 - +
[3] COMBINATION OF SVM AND LARGE MARGIN GMM MODELING FOR SPEAKER IDENTIFICATION
Jourani, Reda
Daoudi, Khalid
Andre-Obrecht, Regine
Aboutajdine, Driss
2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
[4] Discriminative training of GMM for speaker identification
delAlamo, CM
Gil, FJC
Munilla, CDL
Gomez, LH
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 89 - 92
[5] TEXT INDEPENDENT SPEAKER RECOGNITION SYSTEM USING GMM
Bagul, S. G.
Shastri, R. K.
2013 INTERNATIONAL CONFERENCE ON HUMAN COMPUTER INTERACTIONS (ICHCI), 2013,
[6] Improving GMM-UBM speaker verification using discriminative feedback adaptation
Chao, Yi-Hsiang
Tsai, Wei-Ho
Wang, Hsin-Min
COMPUTER SPEECH AND LANGUAGE, 2009, 23 (03): : 376 - 388
[7] Constructing the discriminative kernels using GMM for text-independent speaker identification
Lei, ZC
Yang, YC
Wu, ZH
ADVANCES IN BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3781 : 165 - 171
[8] Speaker Cluster based GMM Tokenization for Speaker Recognition
Ma, Bin
Zhu, Donglai
Tong, Rong
Li, Haizhou
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 505 - 508
[9] Speaker recognition using mfcc and hybrid model of VQ and GMM
Desai, Dhruv
Joshi, Maulin
1600, Springer Verlag (235): : 53 - 63
[10] Speaker recognition using MFCC and hybrid model of VQ and GMM
1600, Springer Verlag (235):

← 1 2 3 4 5 →