Duration Dependent Covariance Regularization in PLDA Modeling for Speaker Verification

被引：0

作者：

Cai, Weicheng ^{[2
,3
]}

Li, Ming ^{[1
,2
]}

Li, Lin ^{[4
]}

Hong, Qingyang ^{[4
]}

机构：

[1] Sun Yat Sen Univ, SYSU CMU Joint Inst Engn, Guangzhou, Guangdong, Peoples R China

[2] SYSU CMU Shunde Int Joint Res Inst, Shunde, Guangdong, Peoples R China

[3] Sun Yat Sen Univ, Sch Informat Sci & Technol, Guangzhou, Guangdong, Peoples R China

[4] Xiamen Univ, Sch Informat Sci & Technol, Xiamen, Peoples R China

来源：

16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 | 2015年

关键词：

PLDA; covariance regularization; i-vector; speaker verification; duration; ROBUST;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we present a covariance regularized probabilistic linear discriminant analysis (CR-PLDA) model for text independent speaker verification. In the conventional simplified PLDA modeling, the covariance matrix used to capture the residual energies is globally shared for all i-vectors. However, we believe that the point estimated i-vectors from longer speech utterances may be more accurate and their corresponding co-variances in the PLDA modeling should be smaller. Similar to the inverse 0th order statistics weighted covariance in the i-vector model training, we propose a duration dependent normalized exponential term containing the duration normalizing factor mu and duration extent factor v to regularize the covariance in the PLDA modeling. Experimental results are reported on the NIST SRE 2010 common condition 5 female part task and the NIST 2014 i-vector machine learning challenge, respectively. For both tasks, the proposed covariance regularized PLDA system outperforms the baseline PLDA system by more than 13% relatively in terms of equal error rate (EER) and norm minDCF values.

引用

页码：1027 / 1031

页数：5

共 50 条

[31] Supervized Mixture of PLDA Models for Cross-Channel Speaker Verification
Simonchik, Konstantin
Pekhovsky, Timur
Shulipa, Andrey
Afanasyev, Anton
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1682 - 1685
[32] Short Utterance Variance Modelling and Utterance Partitioning for PLDA Speaker Verification
Kanagasundaram, Ahilan
Dean, David
Sridharan, Sridha
Fookes, Clinton
Himawan, Ivan
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1835 - 1838
[33] Non-linear PLDA for i-Vector Speaker Verification
Novoselov, Sergey
Pekhovsky, Timur
Kudashev, Oleg
Mendelev, Valentin
Prudnikov, Alexey
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 214 - 218
[34] Improving the PLDA based Speaker Verification in Limited Microphone Data Conditions
Kanagasundaram, A.
Dean, D.
Gonzalez-Dominguez, J.
Sridharan, S.
Ramos, D.
Gonzalez-Rodriguez, J.
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3641 - 3645
[35] DIFFUSION MAPS FOR PLDA-BASED SPEAKER VERIFICATION
Barkan, Oren
Aronowitz, Hagai
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7639 - 7643
[36] Twin Model G-PLDA for Duration Mismatch Compensation in Text-Independent Speaker Verification
Ma, Jianbo
Sethu, Vidhyasaharan
Arnbikairajah, Eliatharnby
Lee, Kong Aik
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1853 - 1857
[37] Duration and Pronunciation Conditioned Lexical Modeling for Speaker Verification
Tur, Gokhan
Shriberg, Elizabeth
Stolcke, Andreas
Kajarekar, Sachin
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2664 - 2667
[38] Domain mismatch modeling of out-domain i-vectors for PLDA speaker verification
Rahman, Md Hafizur
Himawan, Ivan
Dean, David
Sridharan, Sridha
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1581 - 1585
[39] Study of the Effect of I-vector Modeling on Short and Mismatch Utterance Duration for Speaker Verification
Sarkar, A. K.
Matrouf, D.
Bousquet, P. M.
Bonastre, J. F.
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2661 - 2664
[40] PLDA in the i-supervector space for text-independent speaker verification
Ye Jiang
Kong Aik Lee
Longbiao Wang
EURASIP Journal on Audio, Speech, and Music Processing, 2014 (1)

← 1 2 3 4 5 →