Relevance Vector Machines with Empirical Likelihood-Ratio Kernels for PLDA Speaker Verification

被引：0

作者：

Rao, Wei ^{[1
]}

Mak, Man-Wai ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Elect & Informat Engn Dept, Hong Kong, Peoples R China

来源：

2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2014年

关键词：

Relevance Vector Machines; Empirical LR kernel; Probabilistic Linear Discriminant Analysis; I-vectors; NIST SRE;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Previous works have shown the benefits of empirical likelihood ratio (LR) kernels for i-vector/PLDA speaker verification. The method not only utilizes the multiple enrollment utterances of target speakers effectively, but also opens up opportunity for adopting sparse kernel machines for PLDA-based speaker verification systems. This paper proposes taking the advantages of the empirical LR kernels by incorporating them into relevance vector machines (RVMs). Results on NIST 2012 SRE demonstrate that the performance of RVM regression equipped with empirical LR kernels is slightly better than that of the support vector machines after performing utterance partitioning.

引用

页码：64 / 68

页数：5

共 23 条

[11] Kung S.Y., 2005, Biometric Authentication: A Machine Learning Approach
[12] Mak MW, 2013, INT CONF ACOUST SPEE, P7702, DOI 10.1109/ICASSP.2013.6639162
[13] A study of voice activity detection techniques for NIST speaker recognition evaluations
Mak, Man-Wai
Yu, Hon-Bill
[J]. COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01) : 295 - 313
[14] Utterance partitioning with acoustic vector resampling for GMM-SVM speaker verification
Mak, Man-Wai
Rao, Wei
[J]. SPEECH COMMUNICATION, 2011, 53 (01) : 119 - 130
[15] Pelecanos J., 2001, Proc. Speaker Odyssey, V13, P1
[16] Prince S.J.D., 2007, P 11 INT C COMP VIS
[17] Rajan P., 2013, P INT 2013 LYON FRAN
[18] Boosting the Performance of I-Vector Based Speaker Verification via Utterance Partitioning
Rao, Wei
Mak, Man-Wai
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (05): : 1012 - 1022
[19] Speaker verification using adapted Gaussian mixture models
Reynolds, DA
Quatieri, TF
Dunn, RB
[J]. DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 19 - 41
[20] Input space versus feature space in kernel-based methods
Schölkopf, B
Mika, S
Burges, CJC
Knirsch, P
Müller, KR
Rätsch, G
Smola, AJ
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05): : 1000 - 1017

← 1 2 3 →