Hard-Mask Missing Feature Theory for Robust Speaker Recognition

被引:4
作者
Lim, Shin-Cheol [1 ]
Jang, Sei-Jin [2 ]
Lee, Soek-Pil [2 ]
Kim, Moo Young [1 ]
机构
[1] Sejong Univ, Human Comp Interact Lab, Dept Informat & Commun Engn, Seoul, South Korea
[2] Korea Elect Technol Inst, Digital Media Res Ctr, Seoul, South Korea
关键词
Speaker recognition; missing feature theory; MFT; AMFT; NOISE;
D O I
10.1109/TCE.2011.6018880
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Compared with conventional full-band speaker recognition systems, Advanced Missing Feature Theory (AMFT) produces a much lower error rate, but requires increased computational complexity. We propose a weighting function for the score calculation algorithm in AMFT. The weighting function is estimated by calculating the number of reliable spectral components. A modified mask is also proposed to reduce the number of reliable components based on the estimated weighting function. In the proposed Hard-mask MFT-8 (HMFT-8), only 8 elements are selected out of 10 spectral components in a feature vector. Compared with the full-band system and the AMFT, the proposed HMFT-8 gives a lower identification error rate by 16.95% and 2.67%, respectively. In terms of computational complexity, AMFT and HMFT-8 require 307 and 41 arithmetic and conditional operations for each frame, respectively.
引用
收藏
页码:1245 / 1250
页数:6
相关论文
共 19 条
[1]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[2]   Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging [J].
Cohen, I .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05) :466-475
[3]  
HIRSCH HG, 1995, INT CONF ACOUST SPEE, P153, DOI 10.1109/ICASSP.1995.479387
[4]   Text-independent speaker identification using soft channel selection in home robot environments [J].
Ji, Mikyong ;
Kim, Sungtak ;
Kim, Hoirin ;
Yoon, Ho-Sub .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (01) :140-144
[5]   Advanced missing feature theory with fast score calculation for noise robust speaker identification [J].
Jung, J. ;
Kim, K. ;
Kim, M. Y. .
ELECTRONICS LETTERS, 2010, 46 (14) :1027-1028
[6]   A Voice Trigger System using Keyword and Speaker Recognition for Mobile Devices [J].
Lee, Hyeopwoo ;
Chang, Sukmoon ;
Yook, Dongsuk ;
Kim, Yongserk .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2009, 55 (04) :2377-2384
[7]  
LIM JS, 1978, IEEE T ACOUST SPEECH, V26, P197, DOI 10.1109/TASSP.1978.1163086
[8]   Noise power spectral density estimation based on optimal smoothing and minimum statistics [J].
Martin, R .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (05) :504-512
[9]  
Ming J, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, P961
[10]  
Ming J, 2003, 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, P420