Relevance Vector Machines with Empirical Likelihood-Ratio Kernels for PLDA Speaker Verification

被引:0
作者
Rao, Wei [1 ]
Mak, Man-Wai [1 ]
机构
[1] Hong Kong Polytech Univ, Elect & Informat Engn Dept, Hong Kong, Peoples R China
来源
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2014年
关键词
Relevance Vector Machines; Empirical LR kernel; Probabilistic Linear Discriminant Analysis; I-vectors; NIST SRE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous works have shown the benefits of empirical likelihood ratio (LR) kernels for i-vector/PLDA speaker verification. The method not only utilizes the multiple enrollment utterances of target speakers effectively, but also opens up opportunity for adopting sparse kernel machines for PLDA-based speaker verification systems. This paper proposes taking the advantages of the empirical LR kernels by incorporating them into relevance vector machines (RVMs). Results on NIST 2012 SRE demonstrate that the performance of RVM regression equipped with empirical LR kernels is slightly better than that of the support vector machines after performing utterance partitioning.
引用
收藏
页码:64 / 68
页数:5
相关论文
共 23 条
  • [11] Kung S.Y., 2005, Biometric Authentication: A Machine Learning Approach
  • [12] Mak MW, 2013, INT CONF ACOUST SPEE, P7702, DOI 10.1109/ICASSP.2013.6639162
  • [13] A study of voice activity detection techniques for NIST speaker recognition evaluations
    Mak, Man-Wai
    Yu, Hon-Bill
    [J]. COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01) : 295 - 313
  • [14] Utterance partitioning with acoustic vector resampling for GMM-SVM speaker verification
    Mak, Man-Wai
    Rao, Wei
    [J]. SPEECH COMMUNICATION, 2011, 53 (01) : 119 - 130
  • [15] Pelecanos J., 2001, Proc. Speaker Odyssey, V13, P1
  • [16] Prince S.J.D., 2007, P 11 INT C COMP VIS
  • [17] Rajan P., 2013, P INT 2013 LYON FRAN
  • [18] Boosting the Performance of I-Vector Based Speaker Verification via Utterance Partitioning
    Rao, Wei
    Mak, Man-Wai
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (05): : 1012 - 1022
  • [19] Speaker verification using adapted Gaussian mixture models
    Reynolds, DA
    Quatieri, TF
    Dunn, RB
    [J]. DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 19 - 41
  • [20] Input space versus feature space in kernel-based methods
    Schölkopf, B
    Mika, S
    Burges, CJC
    Knirsch, P
    Müller, KR
    Rätsch, G
    Smola, AJ
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05): : 1000 - 1017