Hard-Mask Missing Feature Theory for Robust Speaker Recognition

被引:4
作者
Lim, Shin-Cheol [1 ]
Jang, Sei-Jin [2 ]
Lee, Soek-Pil [2 ]
Kim, Moo Young [1 ]
机构
[1] Sejong Univ, Human Comp Interact Lab, Dept Informat & Commun Engn, Seoul, South Korea
[2] Korea Elect Technol Inst, Digital Media Res Ctr, Seoul, South Korea
关键词
Speaker recognition; missing feature theory; MFT; AMFT; NOISE;
D O I
10.1109/TCE.2011.6018880
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Compared with conventional full-band speaker recognition systems, Advanced Missing Feature Theory (AMFT) produces a much lower error rate, but requires increased computational complexity. We propose a weighting function for the score calculation algorithm in AMFT. The weighting function is estimated by calculating the number of reliable spectral components. A modified mask is also proposed to reduce the number of reliable components based on the estimated weighting function. In the proposed Hard-mask MFT-8 (HMFT-8), only 8 elements are selected out of 10 spectral components in a feature vector. Compared with the full-band system and the AMFT, the proposed HMFT-8 gives a lower identification error rate by 16.95% and 2.67%, respectively. In terms of computational complexity, AMFT and HMFT-8 require 307 and 41 arithmetic and conditional operations for each frame, respectively.
引用
收藏
页码:1245 / 1250
页数:6
相关论文
共 50 条
[31]   MULTILEVEL SPEECH INTELLIGIBILITY FOR ROBUST SPEAKER RECOGNITION [J].
Nemala, Sridhar Krishna ;
Elhilali, Mounya .
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, :4393-4396
[32]   Robust Speaker Recognition Based on Improved GFCC [J].
Shi, Xiaoyuan ;
Yang, Haiyan ;
Zhou, Ping .
2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, :1927-1931
[33]   Corpora for the Evaluation of Robust Speaker Recognition Systems [J].
Sturim, Douglas E. ;
Torres-Carrasquillo, Pedro A. ;
Campbell, Joseph P. .
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, :2776-2780
[34]   THE EFFECT OF LANGUAGE FACTORS FOR ROBUST SPEAKER RECOGNITION [J].
Lu, Liang ;
Dong, Yuan ;
Zhao, Xianyu ;
Liu, Jiqing ;
Wang, Haila .
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, :4217-+
[35]   Robust Speaker Recognition from Distant Speech under Real Reverberant Environments Using Speaker Embeddings [J].
Nandwana, Mahesh Kumar ;
van Hout, Julien ;
McLaren, Mitchell ;
Stauffer, Allen ;
Richey, Colleen ;
Lawson, Aaron ;
Graciarena, Martin .
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, :1106-1110
[36]   Robust features for text-independent speaker recognition with short utterances [J].
Chakroun, Rania ;
Frikha, Mondher .
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (17) :13863-13883
[37]   Robust speaker modeling using perceptually motivated feature [J].
Abdulla, Waleed H. .
PATTERN RECOGNITION LETTERS, 2007, 28 (11) :1333-1342
[38]   Robust features for text-independent speaker recognition with short utterances [J].
Rania Chakroun ;
Mondher Frikha .
Neural Computing and Applications, 2020, 32 :13863-13883
[39]   ROBUST FEATURE FRONT-END FOR SPEAKER IDENTIFICATION [J].
Liu, Gang ;
Lei, Yun ;
Hansen, John H. L. .
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, :4233-4236
[40]   Improved Multitaper PNCC Feature for Robust Speaker Verification [J].
Liu, Yi ;
He, Liang ;
Liu, Jia .
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, :168-172