Further feature extraction for speaker recognition

被引:0
作者
Ma, ZY [1 ]
Yang, YC [1 ]
Wu, ZH [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
来源
2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS | 2003年
关键词
MFCC; LPCC; further feature extraction; speaker recognition;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This thesis presents a method of extracting a new speaker's voice features for the purpose of synthetically using the voice of the donor speaker. In the small speaker set, it seems good to recognize speaker by their voice by means of the traditional feature extraction. Nevertheless, the performance of recognizer usually depressed owning to the limited feature space, it is hard to deal with the increasing of speaker set to be recognized Accordingly it proposes a novel feature extraction method, further feature extract (FFE), which is based on some measures such as weight, differential, combination and selection, are taken to explore those voice characteristics that can be used to distinguish different speakers. Experiment based on 138-person YOHO database demonstrates that better performance can be achieved by the proposed method.
引用
收藏
页码:4153 / 4158
页数:6
相关论文
共 11 条
[1]  
[Anonymous], 1999, Biometrics: personal identification in networked society
[2]   AUTOMATIC RECOGNITION OF SPEAKERS FROM THEIR VOICES [J].
ATAL, BS .
PROCEEDINGS OF THE IEEE, 1976, 64 (04) :460-475
[3]   Speaker recognition: A tutorial [J].
Campbell, JP .
PROCEEDINGS OF THE IEEE, 1997, 85 (09) :1437-1462
[4]  
CAMPBELL JP, 1995, P INT C AC SPEECH SI, P341
[5]  
Che C., 1995, EUROSPEECH 1995 4 EU, P625
[6]  
COHEN A, FEATURE SELECTION SP
[7]   Speech Recognition Using Hidden Markov Models with Polynomial Regression Functions as Nonstationary States [J].
Deng, Li ;
Aksmanovic, Mike ;
Sun, Xiaodong ;
Wu, C. F. Jeff .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04) :507-520
[8]   A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286
[9]  
RABINER LR, FUNDAMENTALS SPEECH, P14
[10]  
REYNOLDS D, 1995, IEEE T SPEECH AUDIO, P72