PLDA Modeling in I-Vector and Supervector Space for Speaker Verification

被引:0
作者
Jiang, Ye [1 ]
Lee, Kong Aik
Tang, Zhenmin [1 ]
Ma, Bin
Larcher, Anthony
Li, Haizhou
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Technol, Nanjing, Jiangsu, Peoples R China
来源
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年
关键词
speaker verification; i-vector; probabilistic LDA; VARIABILITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we advocate the use of uncompressed form of i-vector. We employ the probabilistic linear discriminant analysis (PLDA) to handle speaker and session variability for speaker verification task. An i-vector is a low-dimensional vector containing both speaker and channel information acquired from a speech segment. When PLDA is used on i-vector, dimension reduction is performed twice - first in the i-vector extraction process and second in the PLDA model. Keeping the full dimensionality of i-vector in the supervector space for PLDA modeling and scoring would avoid unnecessary loss of information. The drawback of using PLDA on uncompressed i-vector is the inversion of large matrices, which we show can be solved rather efficiently by portioning large matrix into smaller blocks. We also introduce the Gaussianized rank-norm, as an alternative to whitening, for feature normalization prior to PLDA modeling.
引用
收藏
页码:1678 / 1681
页数:4
相关论文
共 9 条
[1]  
[Anonymous], FAREWELL SVM BAYES F
[2]  
[Anonymous], 2010, P OD 2010
[3]  
[Anonymous], 2011, INTERSPEECH
[4]  
[Anonymous], P INT C COMP VIS
[5]   Front-End Factor Analysis for Speaker Verification [J].
Dehak, Najim ;
Kenny, Patrick J. ;
Dehak, Reda ;
Dumouchel, Pierre ;
Ouellet, Pierre .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04) :788-798
[6]   A study of interspeaker variability in speaker verification [J].
Kenny, Patrick ;
Ouellet, Pierre ;
Dehak, Najim ;
Gupta, Vishwa ;
Dumouchel, Pierre .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (05) :980-988
[7]   Speaker and session variability in GMM-based speaker verification [J].
Kenny, Patrick ;
Boulianne, Gilles ;
Ouellet, Pierre ;
Dumouchel, Pierre .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04) :1448-1460
[8]   Speaker verification using adapted Gaussian mixture models [J].
Reynolds, DA ;
Quatieri, TF ;
Dunn, RB .
DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) :19-41
[9]   Nonparametric feature normalization for SVM-based speaker verification [J].
Stolcke, Andreas ;
Kajarekar, Sachin ;
Ferrer, Luciana .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :1577-+