Sparse kernel machines with empirical kernel maps for PLDA speaker verification

被引:0
作者
Rao, Wei [1 ]
Mak, Man-Wai [1 ]
机构
[1] Hong Kong Polytech Univ, Elect & Informat Engn Dept, Hong Kong, Hong Kong, Peoples R China
关键词
Relevance vector machines; Empirical kernel maps; Probabilistic linear discriminant analysis; I-vectors; NIST SRE; RECOGNITION; SPACE;
D O I
10.1016/j.csl.2016.01.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous studies have demonstrated the benefits of PLDA SVM scoring with empirical kernel maps for i-vector/PLDA speaker verification. The method not only performs significantly better than the conventional PLDA scoring and utilizes the multiple enrollment utterances of target speakers effectively, but also opens up opportunity for adopting sparse kernel machines in PLDAbased speaker verification systems. This paper proposes taking the advantages of empirical kernel maps by incorporating them into a more advanced kernel machine called relevance vector machines (RVMs). The paper reports extensive analyses on the behaviors of RVMs and provides insight into the properties of RVMs and their applications in i-vector/PLDA speaker verification. Results on NIST 2012 SRE demonstrate that PLDA RVM outperforms the conventional PLDA and that it achieves a comparable performance as PLDA SVM. Results also show that PLDA RVM is much sparser than PLDA SVM. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:104 / 121
页数:18
相关论文
共 31 条
[1]  
[Anonymous], ODYSSEY 2006
[2]  
[Anonymous], P INT 2011 FLOR
[3]  
[Anonymous], ACOUST SPEECH SIG PR
[4]  
[Anonymous], DIGITAL SIGNAL PROCE
[5]  
[Anonymous], 2010, P OD 2010
[6]  
[Anonymous], 2011, INTERSPEECH
[7]  
[Anonymous], P INT S CHIN SPOK LA
[8]  
[Anonymous], 2006, Pattern Recognition and Machine Learning
[9]  
[Anonymous], P ICASSP 2014 FLOR I
[10]  
[Anonymous], IEEE INT C NETW SENS