HILBERT ENVELOPE BASED FEATURES FOR ROBUST SPEAKER IDENTIFICATION UNDER REVERBERANT MISMATCHED CONDITIONS

被引：0

作者：

Sadjadi, Seyed Omid ^{[1
]}

Hansen, John H. L. ^{[1
]}

机构：

[1] Univ Texas Dallas, CRSS, Richardson, TX 75080 USA

来源：

2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年

关键词：

Gammatone filterbank; Hilbert envelope; mismatched conditions; reverberation suppression; speaker identification;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

It is well known that MFCC based speaker identification (SID) systems easily break down under mismatched training and test conditions. One such mismatch occurs when a SID system is trained on anechoic speech data, while test is carried out using reverberant data collected via a distant microphone. In this study, a new set of feature parameters based on the Hilbert envelope of Gammatone filterbank outputs is proposed to improve SID performance in the presence of room reverberation. Considering two distinct perceptual effects of reverberation on speech signals, i.e., coloration and long-term reverberation, two different compensation strategies are integrated within the feature extraction framework to effectively suppress the effects of reverberation. Experimental evaluation is performed using speech material from the TIMIT, four different measured room impulse responses (RIR) from Aachen impulse response (AIR) database, and a GMM-based SID system. Obtained results indicate significant improvement over the baseline system with MFCCs plus cepstral mean subtraction (CMS), confirming the effectiveness of the proposed feature parameters for SID under reverberant mismatched conditions.

引用

页码：5448 / 5451

页数：4

共 50 条

[1] Robust Speaker Identification in Noisy and Reverberant Conditions
Zhao, Xiaojia
Wang, Yuxuan
Wang, DeLiang
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (04) : 836 - 845
[2] ROBUST SPEAKER IDENTIFICATION IN NOISY AND REVERBERANT CONDITIONS
Zhao, Xiaojia
Wang, Yuxuan
Wang, DeLiang
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[3] Robust Far-Field Speaker Identification under Mismatched Conditions
Jin, Qin
Schultz, Tanja
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1893 - 1896
[4] An Auditory-Based Feature Extraction Algorithm for Robust Speaker Identification Under Mismatched Conditions
Li, Qi
Huang, Yan
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1791 - 1801
[5] Mean Hilbert envelope coefficients (MHEC) for robust speaker and language identification
Sadjadi, Seyed Omid
Hansen, John H. L.
SPEECH COMMUNICATION, 2015, 72 : 138 - 148
[6] Cochannel Speaker Identification in Anechoic and Reverberant Conditions
Zhao, Xiaojia
Wang, Yuxuan
Wang, DeLiang
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1727 - 1736
[7] Robust distant automatic speaker identification in reverberant environment
Jiang, Ye
Tang, Zhenmin
Ding, Hui
Journal of Computational Information Systems, 2010, 6 (13): : 4315 - 4324
[8] Rapid Unsupervised Speaker Adaptation Robust in Reverberant Environment Conditions
Gomez, Randy
Even, Jani
Shikano, Kiyohiro
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1309 - +
[9] Mean Hilbert Envelope Coefficients (MHEC) for Robust Speaker Recognition
Sadjadi, Seyed Omid
Hasan, Taufiq
Hansen, John H. L.
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1694 - 1697
[10] Speaker verification under mismatched data conditions
Pillay, S. G.
Ariyaeeinia, A.
Pawlewski, M.
Sivakumaran, P.
IET SIGNAL PROCESSING, 2009, 3 (04) : 236 - 246

← 1 2 3 4 5 →