Hard-Mask Missing Feature Theory for Robust Speaker Recognition

被引：4

作者：

Lim, Shin-Cheol ^{[1
]}

Jang, Sei-Jin ^{[2
]}

Lee, Soek-Pil ^{[2
]}

Kim, Moo Young ^{[1
]}

机构：

[1] Sejong Univ, Human Comp Interact Lab, Dept Informat & Commun Engn, Seoul, South Korea

[2] Korea Elect Technol Inst, Digital Media Res Ctr, Seoul, South Korea

来源：

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS | 2011年 / 57卷 / 03期

关键词：

Speaker recognition; missing feature theory; MFT; AMFT; NOISE;

D O I：

10.1109/TCE.2011.6018880

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Compared with conventional full-band speaker recognition systems, Advanced Missing Feature Theory (AMFT) produces a much lower error rate, but requires increased computational complexity. We propose a weighting function for the score calculation algorithm in AMFT. The weighting function is estimated by calculating the number of reliable spectral components. A modified mask is also proposed to reduce the number of reliable components based on the estimated weighting function. In the proposed Hard-mask MFT-8 (HMFT-8), only 8 elements are selected out of 10 spectral components in a feature vector. Compared with the full-band system and the AMFT, the proposed HMFT-8 gives a lower identification error rate by 16.95% and 2.67%, respectively. In terms of computational complexity, AMFT and HMFT-8 require 307 and 41 arithmetic and conditional operations for each frame, respectively.

引用

页码：1245 / 1250

页数：6

共 50 条

[31] MULTILEVEL SPEECH INTELLIGIBILITY FOR ROBUST SPEAKER RECOGNITION [J].

Nemala, Sridhar Krishna ;

Elhilali, Mounya .

2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, :4393-4396

[32] Robust Speaker Recognition Based on Improved GFCC [J].

Shi, Xiaoyuan ;

Yang, Haiyan ;

Zhou, Ping .

2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, :1927-1931

[33] Corpora for the Evaluation of Robust Speaker Recognition Systems [J].

Sturim, Douglas E. ;

Torres-Carrasquillo, Pedro A. ;

Campbell, Joseph P. .

17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, :2776-2780

[34] THE EFFECT OF LANGUAGE FACTORS FOR ROBUST SPEAKER RECOGNITION [J].

Lu, Liang ;

Dong, Yuan ;

Zhao, Xianyu ;

Liu, Jiqing ;

Wang, Haila .

2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, :4217-+

[35] Robust Speaker Recognition from Distant Speech under Real Reverberant Environments Using Speaker Embeddings [J].

Nandwana, Mahesh Kumar ;

van Hout, Julien ;

McLaren, Mitchell ;

Stauffer, Allen ;

Richey, Colleen ;

Lawson, Aaron ;

Graciarena, Martin .

19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, :1106-1110

[36] Robust features for text-independent speaker recognition with short utterances [J].

Chakroun, Rania ;

Frikha, Mondher .

NEURAL COMPUTING & APPLICATIONS, 2020, 32 (17) :13863-13883

[37] Robust speaker modeling using perceptually motivated feature [J].

Abdulla, Waleed H. .

PATTERN RECOGNITION LETTERS, 2007, 28 (11) :1333-1342

[38] Robust features for text-independent speaker recognition with short utterances [J].

Rania Chakroun ;

Mondher Frikha .

Neural Computing and Applications, 2020, 32 :13863-13883

[39] ROBUST FEATURE FRONT-END FOR SPEAKER IDENTIFICATION [J].

Liu, Gang ;

Lei, Yun ;

Hansen, John H. L. .

2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, :4233-4236

[40] Improved Multitaper PNCC Feature for Robust Speaker Verification [J].

Liu, Yi ;

He, Liang ;

Liu, Jia .

2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, :168-172

← 1 2 3 4 5 →