A robust speaker-adaptive and text-prompted speaker verification system

被引:2
作者
Hong, Qingyang [1 ]
Wang, Sheng [1 ]
Liu, Zhijian [1 ]
机构
[1] Department of Cognitive Science and Fujian Key Lab of Brain-Like Intelligent System Xiamen University, Xiamen
来源
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | 2014年 / 8833卷
关键词
Recording playback; Speaker Verification;
D O I
10.1007/978-3-319-12484-1_43
中图分类号
学科分类号
摘要
Currently, the recording playback attack has become a major security risk for speaker verification. The text-independent or text-dependent system is being troubled by it. In this paper, we propose an effective text-prompted system to overcome this problem, in which speaker verification and speech recognition are combined together. We further adopt speaker-adaptive hidden Markov model (HMM) so as to improve the verification performance. After HMM-based speaker adaptation, this system needs not to be retrained at each verification step. Experimental results demonstrated that the proposed method had quite good performance with the equal error rate (EER) lower than 2% and was also robust for different cases. © Springer International Publishing Switzerland 2014.
引用
收藏
页码:385 / 393
页数:8
相关论文
共 9 条
[1]  
Weichen L., Qingyang H., Sheng W., Dawei L., Text-prompted speaker recognition system based on Viterbi-GMM, NCMMSC, (2013)
[2]  
Chen W., Hong Q., Li X., GMM-UBM for Text-Dependent Speaker Recognition, 2012 Third IEEE/IET International Conference on Audio, pp. 16-18, (2012)
[3]  
Li Q., Juang B.-H., Lee C.-H., Automatic verbal information verification for user authentication, IEEE Transactions on Speech and Audio Processing, 8, 5, pp. 585-595, (2000)
[4]  
Li X., Chen K., Mandarin verbal information verification, ICASSP 2002, pp. I833-I836, (2002)
[5]  
Reynolds D.A., Quatieri T.F., Dunn R.B., Speaker Verification Using Adapted Gaussian Mixture Models, Digital Signal Processing, 10, pp. 19-41, (2000)
[6]  
Young S., Evermann G., Gales M., Hain T., Kershaw D., Moore G., Odell J., Ollason D., Povey D., Valtchev V., Woodland P., The HTK Book (for HTK Version 3.4)
[7]  
Doddington G.R., Przybocki M.A., Martin A.F., Reynolds D.A., The NIST speaker recognition evaluation – overview, methodology. Systems, Results, Perspective, Speech Communication, 31, pp. 225-254, (2000)
[8]  
Matsui T., Furui S., Speaker adaptation of tied-mixture-based phoneme models for textprompted speaker recognition, 1994 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1994, 1, (1994)
[9]  
Al-Hassani M.D., Kadhim A.A., Design A Text-Prompt Speaker Recognition System Using LPC-Derived Features, The 13th International Arab Conference on Information Technology ACIT 2012, (2012)