CONSTRAINED DISCRIMINATIVE PLDA TRAINING FOR SPEAKER VERIFICATION

被引：0

作者：

Rohdin, Johan ^{[1
]}

Biswas, Sangeeta ^{[1
]}

Shinoda, Koichi ^{[1
]}

机构：

[1] Tokyo Inst Technol, Dept Comp Sci, Tokyo 152, Japan

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

PLDA; discriminative training; speaker verification; i-vector;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Many studies have proven the effectiveness of discriminative training for speaker verification based on probabilistic linear discriminative analysis (PLDA) with i-vectors as features. Most of them directly optimize the log-likelihood ratio score function of the PLDA model instead of explicitly train the PLDA model. But this optimization process removes some of the constraints that normally are imposed on the PLDA log likelihood ratio score function. This may deteriorate the verification performance when the amount of training data is limited. In this paper, we first show two constraints which the score function should follow, and then we propose a new constrained discriminative training algorithm which keeps these constraints. Our experiments show that our method obtained significant improvements in the verification performance in the male trials of the telephone speaker verification tasks of NIST SRE08 and SRE10.

引用

页数：5

共 50 条

[41] UNSUPERVISED DOMAIN ADAPTATION OF NEURAL PLDA USING SEGMENT PAIRS FOR SPEAKER VERIFICATION
Ulgen, I. Rasim
Arslan, Levent M.
2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 571 - 576
[42] SPEAKER VERIFICATION PERFORMANCE WITH CONSTRAINED DURATIONS
Sordo Martinez, Pablo L.
Fauve, Benoit
Larcher, Anthony
Mason, John S. D.
2ND INTERNATIONAL WORKSHOP ON BIOMETRICS AND FORENSICS (IWBF2014), 2014,
[43] PLDA in the i-supervector space for text-independent speaker verification
Jiang, Ye
Lee, Kong Aik
Wang, Longbiao
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014, : 1 - 13
[44] Discriminative subspace modeling of SNR and duration variabilities for robust speaker verification
Li, Na
Mak, Man-Wai
Lin, Wei-Wei
Chien, Jen-Tzung
COMPUTER SPEECH AND LANGUAGE, 2017, 45 : 83 - 103
[45] CHANNEL ADAPTATION OF PLDA FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Chen, Liping
Lee, Kong Aik
Ma, Bin
Guo, Wu
Li, Haizhou
Dai, Li Rong
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5251 - 5255
[46] Improving PLDA speaker verification performance using domain mismatch compensation techniques
Rahman, Md Hafizur
Kanagasundaram, Ahilan
Himawan, Ivan
Dean, David
Sridharan, Sridha
COMPUTER SPEECH AND LANGUAGE, 2018, 47 : 240 - 258
[47] Neural PLDA Modeling for End-to-End Speaker Verification
Ramoji, Shreyas
Krishnan, Prashant
Ganapathy, Sriram
INTERSPEECH 2020, 2020, : 4333 - 4337
[48] DNN-Driven Mixture of PLDA for Robust Speaker Verification
Li, Na
Mak, Man-Wai
Chien, Jen-Tzung
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1371 - 1383
[49] I-VECTOR KULLBACK-LEIBLER DIVISIVE NORMALIZATION FOR PLDA SPEAKER VERIFICATION
Pan, Yilin
Zheng, Tieran
Chen, Chen
2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 56 - 60
[50] Dataset-Invariant Covariance Normalization for Out-domain PLDA Speaker Verification
Rahman, Md Hafizur
Kanagasundaram, Ahilan
Dean, David
Sridharan, Sridha
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1017 - 1021

← 1 2 3 4 5 →