Speaker verification robust to talking style variation using multiple kernel learning based on conditional entropy minimization

被引:0
作者
Ogawa, Tetsuji [1 ]
Hino, Hideitsu [1 ]
Murata, Noboru [1 ]
Kobayashi, Tetsunori [1 ]
机构
[1] Waseda Inst Adv Study, Tokyo, Japan
来源
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年
关键词
Multiple kernel learning; MCEM; intra-speaker variation; speaker verification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We developed a new speaker verification system that is robust to intra-speaker variation. There is a strong likelihood that intra-speaker variations will occur due to changes in talking styles, the periods when an individual speaks, and so on. It is well known that such variation generally degrades the performance of speaker verification systems. To solve this problem, we applied multiple kernel learning (MKL) based on conditional entropy minimization, which impose the data to be compactly aggregated for each speaker class and ensure that the different speaker classes were far apart from each other. Experimental results showed that the proposed speaker verification system achieved a robust performance to intra-speaker variation derived from changes in the talking styles compared to the conventional maximum margin-based system.
引用
收藏
页码:2752 / +
页数:2
相关论文
共 14 条
[1]  
[Anonymous], 2005, Proceedings of Interspeech
[2]  
Aronowitz H., 2005, P INTERSPEECH, P2177
[3]   Support vector machines using GMM supervectors for speaker verification [J].
Campbell, WM ;
Sturim, DE ;
Reynolds, DA .
IEEE SIGNAL PROCESSING LETTERS, 2006, 13 (05) :308-311
[4]  
Do H, 2009, LECT NOTES ARTIF INT, V5781, P330
[5]  
Faivishevsky L., 2009, Advances in Neural Information Processing Systems, V21, P433
[6]  
Hino H., 2010, P ICMLA DEC
[7]  
Lanckriet GRG, 2004, J MACH LEARN RES, V5, P27
[8]   Multiple Kernel Learning for speaker verification [J].
Longworth, C. ;
Gales, M. J. F. .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :1581-1584
[9]   A kernel trick for sequences applied to text-independent speaker verification systems [J].
Mariethoz, Johnny ;
Bengio, Samy .
PATTERN RECOGNITION, 2007, 40 (08) :2315-2324
[10]  
Mika S., 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468), P41, DOI 10.1109/NNSP.1999.788121