ON THE USE OF SPEAKER SUPERFACTORS FOR SPEAKER RECOGNITION

被引:0
作者
Scheffer, Nicolas [1 ]
Vogt, Robbie [2 ]
机构
[1] SRI Int, Menlo Pk, CA 94025 USA
[2] Queensland Univ Technol, Brisbane, Qld, Australia
来源
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年
基金
澳大利亚研究理事会;
关键词
speaker recognition;
D O I
10.1109/ICASSP.2010.5495631
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose a new method to characterize a speaker within the Joint Factor Analysis (JFA) framework. Scoring within the JFA framework can be costly and a new method was proposed to produce an accurate score in a fast manner. However, this method is nonsymmetric and performs badly without any score normalization. We propose a new JFA scoring method that is both symmetrical and efficient. In the same way as means of Gaussians can be concatenated to form a supervector, we use several estimates of speaker factors from the eigenvoice space to build a supervector of factors that we call superfactors. We motivate the use of such factors in the current JFA model through comparison with a Tied Factor Analysis model. We show that this method substantially improves the performance of a system that uses only the standard speaker factors to produce scores, and usually outperforms the baseline system. We also show that this method is relatively effective even when score normalization is not an option.
引用
收藏
页码:4410 / 4413
页数:4
相关论文
共 9 条
[1]  
[Anonymous], 2008, J HOPKINS U SUMM WOR
[2]  
Brummer N., 2008, SUN SDV SYSTEM DESCR
[3]  
Dehak N., 2009, P ICASSP 2009
[4]  
Glembek O., 2009, P ICASSP 2009
[5]   Eigenvoice modeling with sparse training data [J].
Kenny, P ;
Boulianne, G ;
Dumouchel, P .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (03) :345-354
[6]   A study of interspeaker variability in speaker verification [J].
Kenny, Patrick ;
Ouellet, Pierre ;
Dehak, Najim ;
Gupta, Vishwa ;
Dumouchel, Pierre .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (05) :980-988
[7]  
Prince S., 2006, P BRIT MACH VIS C, V3, P889
[8]  
Scheffer N., 2009, P ICASSP 2009
[9]  
Vogt R., 2006, P ICASSP 2006