Hard-Mask Missing Feature Theory for Robust Speaker Recognition

被引：4

作者：

Lim, Shin-Cheol ^{[1
]}

Jang, Sei-Jin ^{[2
]}

Lee, Soek-Pil ^{[2
]}

Kim, Moo Young ^{[1
]}

机构：

[1] Sejong Univ, Human Comp Interact Lab, Dept Informat & Commun Engn, Seoul, South Korea

[2] Korea Elect Technol Inst, Digital Media Res Ctr, Seoul, South Korea

来源：

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS | 2011年 / 57卷 / 03期

关键词：

Speaker recognition; missing feature theory; MFT; AMFT; NOISE;

D O I：

10.1109/TCE.2011.6018880

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Compared with conventional full-band speaker recognition systems, Advanced Missing Feature Theory (AMFT) produces a much lower error rate, but requires increased computational complexity. We propose a weighting function for the score calculation algorithm in AMFT. The weighting function is estimated by calculating the number of reliable spectral components. A modified mask is also proposed to reduce the number of reliable components based on the estimated weighting function. In the proposed Hard-mask MFT-8 (HMFT-8), only 8 elements are selected out of 10 spectral components in a feature vector. Compared with the full-band system and the AMFT, the proposed HMFT-8 gives a lower identification error rate by 16.95% and 2.67%, respectively. In terms of computational complexity, AMFT and HMFT-8 require 307 and 41 arithmetic and conditional operations for each frame, respectively.

引用

页码：1245 / 1250

页数：6

共 19 条

[1] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].

BOLL, SF .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120

[2] Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging [J].

Cohen, I .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05) :466-475

[3]

HIRSCH HG, 1995, INT CONF ACOUST SPEE, P153, DOI 10.1109/ICASSP.1995.479387

[4] Text-independent speaker identification using soft channel selection in home robot environments [J].

Ji, Mikyong ;

Kim, Sungtak ;

Kim, Hoirin ;

Yoon, Ho-Sub .

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (01) :140-144

[5] Advanced missing feature theory with fast score calculation for noise robust speaker identification [J].

Jung, J. ;

Kim, K. ;

Kim, M. Y. .

ELECTRONICS LETTERS, 2010, 46 (14) :1027-1028

[6] A Voice Trigger System using Keyword and Speaker Recognition for Mobile Devices [J].

Lee, Hyeopwoo ;

Chang, Sukmoon ;

Yook, Dongsuk ;

Kim, Yongserk .

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2009, 55 (04) :2377-2384

[7]

LIM JS, 1978, IEEE T ACOUST SPEECH, V26, P197, DOI 10.1109/TASSP.1978.1163086

[8] Noise power spectral density estimation based on optimal smoothing and minimum statistics [J].

Martin, R .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (05) :504-512

[9]

Ming J, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, P961

[10]

Ming J, 2003, 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, P420

← 1 2 →