Hard-Mask Missing Feature Theory for Robust Speaker Recognition

被引：4

作者：

Lim, Shin-Cheol ^{[1
]}

Jang, Sei-Jin ^{[2
]}

Lee, Soek-Pil ^{[2
]}

Kim, Moo Young ^{[1
]}

机构：

[1] Sejong Univ, Human Comp Interact Lab, Dept Informat & Commun Engn, Seoul, South Korea

[2] Korea Elect Technol Inst, Digital Media Res Ctr, Seoul, South Korea

来源：

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS | 2011年 / 57卷 / 03期

关键词：

Speaker recognition; missing feature theory; MFT; AMFT; NOISE;

D O I：

10.1109/TCE.2011.6018880

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Compared with conventional full-band speaker recognition systems, Advanced Missing Feature Theory (AMFT) produces a much lower error rate, but requires increased computational complexity. We propose a weighting function for the score calculation algorithm in AMFT. The weighting function is estimated by calculating the number of reliable spectral components. A modified mask is also proposed to reduce the number of reliable components based on the estimated weighting function. In the proposed Hard-mask MFT-8 (HMFT-8), only 8 elements are selected out of 10 spectral components in a feature vector. Compared with the full-band system and the AMFT, the proposed HMFT-8 gives a lower identification error rate by 16.95% and 2.67%, respectively. In terms of computational complexity, AMFT and HMFT-8 require 307 and 41 arithmetic and conditional operations for each frame, respectively.

引用

页码：1245 / 1250

页数：6

共 50 条

[41] Feature recovery for noise-robust speaker verification [J].

Huang, Houjun ;

Xu, Yunfei ;

Zhou, Ruohua ;

Yan, Yonghong .

ELECTRONICS LETTERS, 2015, 51 (18) :1459-1461

[42] Improved Deep Speaker Feature Learning for Text-Dependent Speaker Recognition [J].

Li, Lantian ;

Lin, Yiye ;

Zhang, Zhiyong ;

Wang, Dong .

2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, :426-429

[43] Soft missing-feature mask generation for Robot Audition [J].

Takahashi T. ;

Nakadai K. ;

Komatani K. ;

Ogata T. ;

Okuno H.G. .

Paladyn, 2010, 1 (01) :37-47

[44] Speaker Recognition via Statistics of Acoustic Feature Distribution [J].

Li Shaomei ;

Guo Yunfei ;

Wei Hongquan .

MINES 2009: FIRST INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY, VOL 2, PROCEEDINGS, 2009, :190-192

[45] GENDER-DEPENDENT FEATURE EXTRACTION FOR SPEAKER RECOGNITION [J].

Li, Lantian ;

Zheng, Thomas Fang .

2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, :509-513

[46] Speaker Recognition System Based on weighted feature parameter [J].

Zhu, Li ;

Yang, Qing .

INTERNATIONAL CONFERENCE ON SOLID STATE DEVICES AND MATERIALS SCIENCE, 2012, 25 :1515-1522

[47] Feature Extraction and Classification Techniques for Speaker Recognition: A Review [J].

Dhameliya, Kinnal ;

Bhatt, Ninad .

2015 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, SIGNALS, COMMUNICATION AND OPTIMIZATION (EESCO), 2015,

[48] Using MAP Estimation of Feature Transformation for Speaker Recognition [J].

Zhu, Donglai ;

Ma, Bin ;

Li, Haizhou .

INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, :849-852

[49] New Feature Vector Extraction Method for Speaker Recognition [J].

Sukhostat, Lyudmila ;

Imamverdiyev, Yadigar .

2012 IV INTERNATIONAL CONFERENCE PROBLEMS OF CYBERNETICS AND INFORMATICS (PCI), 2012,

[50] The Research of Feature Extraction Based on MFCC for Speaker Recognition [J].

Zhang Wanli ;

Li Guoxin .

2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, :1074-1077

← 1 2 3 4 5 →