Hard-Mask Missing Feature Theory for Robust Speaker Recognition

被引:4
作者
Lim, Shin-Cheol [1 ]
Jang, Sei-Jin [2 ]
Lee, Soek-Pil [2 ]
Kim, Moo Young [1 ]
机构
[1] Sejong Univ, Human Comp Interact Lab, Dept Informat & Commun Engn, Seoul, South Korea
[2] Korea Elect Technol Inst, Digital Media Res Ctr, Seoul, South Korea
关键词
Speaker recognition; missing feature theory; MFT; AMFT; NOISE;
D O I
10.1109/TCE.2011.6018880
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Compared with conventional full-band speaker recognition systems, Advanced Missing Feature Theory (AMFT) produces a much lower error rate, but requires increased computational complexity. We propose a weighting function for the score calculation algorithm in AMFT. The weighting function is estimated by calculating the number of reliable spectral components. A modified mask is also proposed to reduce the number of reliable components based on the estimated weighting function. In the proposed Hard-mask MFT-8 (HMFT-8), only 8 elements are selected out of 10 spectral components in a feature vector. Compared with the full-band system and the AMFT, the proposed HMFT-8 gives a lower identification error rate by 16.95% and 2.67%, respectively. In terms of computational complexity, AMFT and HMFT-8 require 307 and 41 arithmetic and conditional operations for each frame, respectively.
引用
收藏
页码:1245 / 1250
页数:6
相关论文
共 50 条
[41]   Feature recovery for noise-robust speaker verification [J].
Huang, Houjun ;
Xu, Yunfei ;
Zhou, Ruohua ;
Yan, Yonghong .
ELECTRONICS LETTERS, 2015, 51 (18) :1459-1461
[42]   Improved Deep Speaker Feature Learning for Text-Dependent Speaker Recognition [J].
Li, Lantian ;
Lin, Yiye ;
Zhang, Zhiyong ;
Wang, Dong .
2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, :426-429
[43]   Soft missing-feature mask generation for Robot Audition [J].
Takahashi T. ;
Nakadai K. ;
Komatani K. ;
Ogata T. ;
Okuno H.G. .
Paladyn, 2010, 1 (01) :37-47
[44]   Speaker Recognition via Statistics of Acoustic Feature Distribution [J].
Li Shaomei ;
Guo Yunfei ;
Wei Hongquan .
MINES 2009: FIRST INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY, VOL 2, PROCEEDINGS, 2009, :190-192
[45]   GENDER-DEPENDENT FEATURE EXTRACTION FOR SPEAKER RECOGNITION [J].
Li, Lantian ;
Zheng, Thomas Fang .
2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, :509-513
[46]   Speaker Recognition System Based on weighted feature parameter [J].
Zhu, Li ;
Yang, Qing .
INTERNATIONAL CONFERENCE ON SOLID STATE DEVICES AND MATERIALS SCIENCE, 2012, 25 :1515-1522
[47]   Feature Extraction and Classification Techniques for Speaker Recognition: A Review [J].
Dhameliya, Kinnal ;
Bhatt, Ninad .
2015 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, SIGNALS, COMMUNICATION AND OPTIMIZATION (EESCO), 2015,
[48]   Using MAP Estimation of Feature Transformation for Speaker Recognition [J].
Zhu, Donglai ;
Ma, Bin ;
Li, Haizhou .
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, :849-852
[49]   New Feature Vector Extraction Method for Speaker Recognition [J].
Sukhostat, Lyudmila ;
Imamverdiyev, Yadigar .
2012 IV INTERNATIONAL CONFERENCE PROBLEMS OF CYBERNETICS AND INFORMATICS (PCI), 2012,
[50]   The Research of Feature Extraction Based on MFCC for Speaker Recognition [J].
Zhang Wanli ;
Li Guoxin .
2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, :1074-1077