HILBERT ENVELOPE BASED FEATURES FOR ROBUST SPEAKER IDENTIFICATION UNDER REVERBERANT MISMATCHED CONDITIONS

被引:0
作者
Sadjadi, Seyed Omid [1 ]
Hansen, John H. L. [1 ]
机构
[1] Univ Texas Dallas, CRSS, Richardson, TX 75080 USA
来源
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年
关键词
Gammatone filterbank; Hilbert envelope; mismatched conditions; reverberation suppression; speaker identification;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
It is well known that MFCC based speaker identification (SID) systems easily break down under mismatched training and test conditions. One such mismatch occurs when a SID system is trained on anechoic speech data, while test is carried out using reverberant data collected via a distant microphone. In this study, a new set of feature parameters based on the Hilbert envelope of Gammatone filterbank outputs is proposed to improve SID performance in the presence of room reverberation. Considering two distinct perceptual effects of reverberation on speech signals, i.e., coloration and long-term reverberation, two different compensation strategies are integrated within the feature extraction framework to effectively suppress the effects of reverberation. Experimental evaluation is performed using speech material from the TIMIT, four different measured room impulse responses (RIR) from Aachen impulse response (AIR) database, and a GMM-based SID system. Obtained results indicate significant improvement over the baseline system with MFCCs plus cepstral mean subtraction (CMS), confirming the effectiveness of the proposed feature parameters for SID under reverberant mismatched conditions.
引用
收藏
页码:5448 / 5451
页数:4
相关论文
共 46 条
[21]   Speaker Identification Approach Based On Time Domain Extracted Features [J].
Lupu, Eugen ;
Emerich, Simina .
PROCEEDINGS ELMAR-2010, 2010, :355-358
[22]   Consolidating Product Spectrum and Gammatone Filterbank for Robust Speaker Verification under noisy conditions [J].
Fedila, Meriem ;
Bengherabi, Messaoud ;
Amrouche, Abderrahmane .
2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, :347-352
[23]   Speaker Identification Under Noisy Conditions Using Hybrid Deep Learning Model [J].
Lambamo, Wondimu ;
Srinivasagan, Ramasamy ;
Jifara, Worku .
PAN-AFRICAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PT I, PANAFRICON AI 2023, 2024, 2068 :154-175
[24]   Developing sequentially trained robust Punjabi speech recognition system under matched and mismatched conditions [J].
Puneet Bawa ;
Virender Kadyan ;
Abinash Tripathy ;
Thipendra P. Singh .
Complex & Intelligent Systems, 2023, 9 :1-23
[25]   Developing sequentially trained robust Punjabi speech recognition system under matched and mismatched conditions [J].
Bawa, Puneet ;
Kadyan, Virender ;
Tripathy, Abinash ;
Singh, Thipendra P. .
COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (01) :1-23
[26]   A robust DNN model for text-independent speaker identification using non-speaker embeddings in diverse data conditions [J].
Nirupam Shome ;
Banala Saritha ;
Richik Kashyap ;
Rabul Hussain Laskar .
Neural Computing and Applications, 2023, 35 :18933-18947
[27]   A robust DNN model for text-independent speaker identification using non-speaker embeddings in diverse data conditions [J].
Shome, Nirupam ;
Saritha, Banala ;
Kashyap, Richik ;
Laskar, Rabul Hussain .
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (26) :18933-18947
[28]   Robust Speaker Identification System Based on Two-Stage Vector Quantization [J].
Chen, Wan-Chen ;
Hsieh, Ching-Tang ;
Hsu, Chih-Hsu .
JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2008, 11 (04) :357-366
[29]   Robust Speaker Identification Based On Hybrid Model of VQ and GMM-UBM [J].
Nguyen, Vu X. ;
Nguyen, Vu P. H. ;
Pham, Tuan V. .
2015 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2015, :490-495
[30]   SPARSITY BASED ROBUST SPEAKER IDENTIFICATION USING A DISCRIMINATIVE DICTIONARY LEARNING APPROACH [J].
Tzagkarakis, Christos ;
Mouchtaris, Athanasios .
2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,