CONSTRAINED DISCRIMINATIVE PLDA TRAINING FOR SPEAKER VERIFICATION

被引:0
|
作者
Rohdin, Johan [1 ]
Biswas, Sangeeta [1 ]
Shinoda, Koichi [1 ]
机构
[1] Tokyo Inst Technol, Dept Comp Sci, Tokyo 152, Japan
来源
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年
关键词
PLDA; discriminative training; speaker verification; i-vector;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Many studies have proven the effectiveness of discriminative training for speaker verification based on probabilistic linear discriminative analysis (PLDA) with i-vectors as features. Most of them directly optimize the log-likelihood ratio score function of the PLDA model instead of explicitly train the PLDA model. But this optimization process removes some of the constraints that normally are imposed on the PLDA log likelihood ratio score function. This may deteriorate the verification performance when the amount of training data is limited. In this paper, we first show two constraints which the score function should follow, and then we propose a new constrained discriminative training algorithm which keeps these constraints. Our experiments show that our method obtained significant improvements in the verification performance in the male trials of the telephone speaker verification tasks of NIST SRE08 and SRE10.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] UNSUPERVISED DOMAIN ADAPTATION OF NEURAL PLDA USING SEGMENT PAIRS FOR SPEAKER VERIFICATION
    Ulgen, I. Rasim
    Arslan, Levent M.
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 571 - 576
  • [42] SPEAKER VERIFICATION PERFORMANCE WITH CONSTRAINED DURATIONS
    Sordo Martinez, Pablo L.
    Fauve, Benoit
    Larcher, Anthony
    Mason, John S. D.
    2ND INTERNATIONAL WORKSHOP ON BIOMETRICS AND FORENSICS (IWBF2014), 2014,
  • [43] PLDA in the i-supervector space for text-independent speaker verification
    Jiang, Ye
    Lee, Kong Aik
    Wang, Longbiao
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014, : 1 - 13
  • [44] Discriminative subspace modeling of SNR and duration variabilities for robust speaker verification
    Li, Na
    Mak, Man-Wai
    Lin, Wei-Wei
    Chien, Jen-Tzung
    COMPUTER SPEECH AND LANGUAGE, 2017, 45 : 83 - 103
  • [45] CHANNEL ADAPTATION OF PLDA FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
    Chen, Liping
    Lee, Kong Aik
    Ma, Bin
    Guo, Wu
    Li, Haizhou
    Dai, Li Rong
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5251 - 5255
  • [46] Improving PLDA speaker verification performance using domain mismatch compensation techniques
    Rahman, Md Hafizur
    Kanagasundaram, Ahilan
    Himawan, Ivan
    Dean, David
    Sridharan, Sridha
    COMPUTER SPEECH AND LANGUAGE, 2018, 47 : 240 - 258
  • [47] Neural PLDA Modeling for End-to-End Speaker Verification
    Ramoji, Shreyas
    Krishnan, Prashant
    Ganapathy, Sriram
    INTERSPEECH 2020, 2020, : 4333 - 4337
  • [48] DNN-Driven Mixture of PLDA for Robust Speaker Verification
    Li, Na
    Mak, Man-Wai
    Chien, Jen-Tzung
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1371 - 1383
  • [49] I-VECTOR KULLBACK-LEIBLER DIVISIVE NORMALIZATION FOR PLDA SPEAKER VERIFICATION
    Pan, Yilin
    Zheng, Tieran
    Chen, Chen
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 56 - 60
  • [50] Dataset-Invariant Covariance Normalization for Out-domain PLDA Speaker Verification
    Rahman, Md Hafizur
    Kanagasundaram, Ahilan
    Dean, David
    Sridharan, Sridha
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1017 - 1021