Relevance Vector Machines with Empirical Likelihood-Ratio Kernels for PLDA Speaker Verification

被引:0
作者
Rao, Wei [1 ]
Mak, Man-Wai [1 ]
机构
[1] Hong Kong Polytech Univ, Elect & Informat Engn Dept, Hong Kong, Peoples R China
来源
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2014年
关键词
Relevance Vector Machines; Empirical LR kernel; Probabilistic Linear Discriminant Analysis; I-vectors; NIST SRE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous works have shown the benefits of empirical likelihood ratio (LR) kernels for i-vector/PLDA speaker verification. The method not only utilizes the multiple enrollment utterances of target speakers effectively, but also opens up opportunity for adopting sparse kernel machines for PLDA-based speaker verification systems. This paper proposes taking the advantages of the empirical LR kernels by incorporating them into relevance vector machines (RVMs). Results on NIST 2012 SRE demonstrate that the performance of RVM regression equipped with empirical LR kernels is slightly better than that of the support vector machines after performing utterance partitioning.
引用
收藏
页码:64 / 68
页数:5
相关论文
共 23 条
  • [1] [Anonymous], P INT 2011 FLOR
  • [2] [Anonymous], 2011, INTERSPEECH
  • [3] [Anonymous], P ICASSP 2014 FLOR I
  • [4] [Anonymous], IEEE INT C NETW SENS
  • [5] [Anonymous], 2011, INTERSPEECH
  • [6] EFFECTIVENESS OF LINEAR PREDICTION CHARACTERISTICS OF SPEECH WAVE FOR AUTOMATIC SPEAKER IDENTIFICATION AND VERIFICATION
    ATAL, BS
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (06) : 1304 - 1312
  • [7] Bishop C., 2006, PATTERN RECOGN, DOI DOI 10.1117/1.2819119
  • [8] Front-End Factor Analysis for Speaker Verification
    Dehak, Najim
    Kenny, Patrick J.
    Dehak, Reda
    Dumouchel, Pierre
    Ouellet, Pierre
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 788 - 798
  • [9] Hatch AO, 2006, INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, P1471
  • [10] Kenny P., 2010, OD 2010 SPEAK LANG R, P14