Speaker identification under background noise using features extracted from steady vowel regions

被引:3
作者
Vuppala, Anil Kumar [1 ]
Rao, K. Sreenivasa [2 ]
机构
[1] Int Inst Informat Technol, LTRC, Hyderabad, Andhra Pradesh, India
[2] Indian Inst Technol, Sch Informat Technol, Kharagpur 721302, W Bengal, India
关键词
speaker identification; background noise; steady vowel region; vowel onset points; epochs; REVERBERANT SPEECH; RECOGNITION; VERIFICATION; ENHANCEMENT;
D O I
10.1002/acs.2357
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we are exploring features extracted from steady vowel segments for improving the performance of speaker identification system under background noise. Steady vowel regions are produced by periodic impulse-like excitation and they contain relatively high signal energy. Hence, speaker specific information present in steady vowel regions may be less affected by the noise. In this work, steady vowel regions are determined by using the knowledge of accurate vowel onset points and epochs. Speaker identification studies are carried out using TIMIT database for white and vehicle noises. Universal background model-Gaussian mixture model-based modeling is explored for developing speaker models. Significant improvement in the performance of speaker identification is observed by using features extracted from steady vowel region in presence of noisy environments. Copyright (c) 2012 John Wiley & Sons, Ltd.
引用
收藏
页码:781 / 792
页数:12
相关论文
共 28 条
[1]   New LP-Derived Features for Speaker Identification [J].
Assaleh, Khaled T. ;
Mammone, Richard J. .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04) :630-638
[2]   AUTOMATIC RECOGNITION OF SPEAKERS FROM THEIR VOICES [J].
ATAL, BS .
PROCEEDINGS OF THE IEEE, 1976, 64 (04) :460-475
[3]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[4]   Speaker recognition: A tutorial [J].
Campbell, JP .
PROCEEDINGS OF THE IEEE, 1997, 85 (09) :1437-1462
[5]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121
[6]  
Garofolo J. S., 1993, P IEEE ICISIP PHIL P
[7]   Text-independent speaker identification [J].
Gish, Herbert ;
Schmidt, Michael .
IEEE SIGNAL PROCESSING MAGAZINE, 1994, 11 (04) :18-32
[8]  
Kamath S, 2002, INT CONF ACOUST SPEE, P4164
[9]   Enhancement of noisy speech by temporal and spectral processing [J].
Krishnamoorthy, P. ;
Prasanna, S. R. M. .
SPEECH COMMUNICATION, 2011, 53 (02) :154-174
[10]   Application of combined temporal and spectral processing methods for speaker recognition under noisy, reverberant or multi-speaker environments [J].
Krishnamoorthy, P. ;
Prasanna, S. R. Mahadeva .
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2009, 34 (05) :729-754