SPEAKER VERIFICATION USING SIMPLIFIED AND SUPERVISED I-VECTOR MODELING

被引:0
作者
Li, Ming [1 ]
Tsiartas, Andreas [1 ]
Van Segbroeck, Maarten [1 ]
Narayanan, Shrikanth S. [1 ]
机构
[1] Univ So Calif, Signal Anal & Interpretat Lab, Los Angeles, CA 90089 USA
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
Speaker verification; Simplified i-vector; Supervised i-vector; VARIABILITY;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a simplified and supervised i-vector modeling framework that is applied in the task of robust and efficient speaker verification (SRE). First, by concatenating the mean supervector and the i-vector factor loading matrix with respectively the label vector and the linear classifier matrix, the traditional i-vectors are then extended to label-regularized supervised i-vectors. These supervised i-vectors are optimized to not only reconstruct the mean supervectors well but also minimize the mean squared error between the original and the reconstructed label vectors, such that they become more discriminative. Second, factor analysis (FA) can be performed on the pre-normalized centered GMM first order statistics supervector to ensure that the Gaussian statistics sub-vector of each Gaussian component is treated equally in the FA, which reduces the computational cost significantly. Experimental results are reported on the female part of the NIST SRE 2010 task with common condition 5. The proposed supervised i-vector approach outperforms the i-vector baseline by relatively 12% and 7% in terms of equal error rate (EER) and norm old minDCF values, respectively.
引用
收藏
页码:7199 / 7203
页数:5
相关论文
共 50 条
[21]   Speaker Verification Under Adverse Conditions Using I-vector Adaptation and Neural Networks [J].
Alam, Jahangir ;
Kenny, Patrick ;
Bhattacharya, Gautam ;
Kockmann, Marcel .
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :3732-3736
[22]   Partial Least Squares Based Total Variability Space Modeling for I-Vector Speaker Verification [J].
CHEN Chen ;
HAN Jiqing .
ChineseJournalofElectronics, 2018, 27 (06) :1229-1233
[23]   Effect of long-term ageing on i-vector speaker verification [J].
Kelly, Finnian ;
Saeidi, Rahim ;
Harte, Naomi ;
van Leeuwen, David .
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, :86-90
[24]   Partial Least Squares Based Total Variability Space Modeling for I-Vector Speaker Verification [J].
Chen Chen ;
Han Jiqing .
CHINESE JOURNAL OF ELECTRONICS, 2018, 27 (06) :1229-1233
[25]   Improving short utterance i-vector speaker verification using utterance variance modelling and compensation techniques [J].
Kanagasundaram, A. ;
Dean, D. ;
Sridharan, S. ;
Gonzalez-Dominguez, J. ;
Gonzalez-Rodriguez, J. ;
Ramos, D. .
SPEECH COMMUNICATION, 2014, 59 :69-82
[26]   Discriminant Analysis Methods Comparison in I-Vector Space for Speaker Verification [J].
Mohammadi, Mohsen ;
Mohammadi, Hamid Reza Sadegh .
2018 9TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2018, :166-172
[27]   Deep Nonlinear Metric Learning for Speaker Verification in the I-Vector Space [J].
Feng, Yong ;
Xiong, Qingyu ;
Shi, Weiren .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (01) :215-219
[28]   Best Feature Selection for Emotional Speaker Verification in i-vector Representation [J].
Mackova, Lenka ;
Cizmar, Anton ;
Juhar, Jozef .
2015 25TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2015, :209-212
[29]   I-Vector DNN Scoring and Calibration for Noise Robust Speaker Verification [J].
Tan, Zhili ;
Mak, Man-Wai .
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :1562-1566
[30]   Evaluation of the I-vector System for Text-dependent Speaker Verification [J].
Li, Lin ;
Guo, Huiyang ;
Shang, Fengyi ;
Hong, Qingyang ;
Liu, Kai .
PROCEEDINGS OF 2017 11TH IEEE INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2017, :60-63