Assessment of Single-Channel Speech Enhancement Techniques for Speaker Identification under Mismatched Conditions

被引：0

作者：

Sadjadi, Seyed Omid ^{[1
]}

Hansen, John H. L. ^{[1
]}

机构：

[1] Univ Texas Dallas, CRSS, Dallas, TX 75230 USA

来源：

11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 | 2010年

关键词：

feature extraction; gammatone filterbank; Hilbert envelope; speaker identification; speech enhancement; RECOGNITION; SIGNAL;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

It is well known that MFCC based speaker identification (SID) systems easily break down under mismatched training and test conditions. In this paper, we report on a study that considers four different single-channel speech enhancement front-ends for robust SID under such conditions. Speech files from the YOHO database are corrupted with four types of noise including babble, car, factory, and white Gaussian at five SNR levels (0-20 dB), and processed using four speech enhancement techniques representing distinct classes of algorithms: spectral subtraction, statistical model-based, subspace, and Wiener filtering. Both processed and unprocessed files are submitted to a SID system trained on clean data. In addition, a new set of acoustic feature parameters based on Hilbert envelope of gammatone filterbank outputs are proposed and evaluated for SID task. Experimental results indicate that: (i) depending on the noise type and SNR level, the enhancement front-ends may help or hurt SID performance, (ii) the proposed feature significantly achieves higher SID accuracy compared to MFCCs under mismatched conditions.

引用

页码：2138 / 2141

页数：4

共 50 条

[1] JOINT SINGLE-CHANNEL SPEECH SEPARATION AND SPEAKER IDENTIFICATION
Mowlaee, P.
Saeidi, R.
Tan, Z. -H.
Christensen, M. G.
Franti, P.
Jensen, S. H.
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4430 - 4433
[2] A Joint Approach for Single-Channel Speaker Identification and Speech Separation
Mowlaee, Pejman
Saeidi, Rahim
Christensen, Mads Grsboll
Tan, Zheng-Hua
Kinnunen, Tomi
Franti, Pasi
Jensen, Soren Holdt
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (09): : 2586 - 2601
[3] Comparative Studies of Single-Channel Speech Enhancement Techniques
Kumar, Bittu
Kumar, Neeraj
Kumar, Manoj
Prasad, S. V. S.
Varma, Ashwini Kumar
Ravi, Banoth
IETE JOURNAL OF RESEARCH, 2024, 70 (06) : 5704 - 5720
[4] Robust Speaker Recognition Based on Single-Channel and Multi-Channel Speech Enhancement
Taherian, Hassan
Wang, Zhong-Qiu
Chang, Jorge
Wang, DeLiang
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1293 - 1302
[5] Speech Enhancement for Speaker Identification
Mahesh, R.
2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
[6] HILBERT ENVELOPE BASED FEATURES FOR ROBUST SPEAKER IDENTIFICATION UNDER REVERBERANT MISMATCHED CONDITIONS
Sadjadi, Seyed Omid
Hansen, John H. L.
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5448 - 5451
[7] Single-channel speech enhancement by subspace affinity minimization
Tran, Dung N.
Koishida, Kazuhito
INTERSPEECH 2020, 2020, : 2447 - 2451
[8] UltraSE: Single-Channel Speech Enhancement Using Ultrasound
Sun, Ke
Zhang, Xinyu
PROCEEDINGS OF THE 27TH ACM ANNUAL INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING (ACM MOBICOM '21), 2021, : 160 - 173
[9] Single-Channel Speech Enhancement Using Double Spectrum
Blass, Martin
Mowlaee, Pejman
Kleijn, W. Bastiaan
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1740 - 1744
[10] A spectral conversion approach to single-channel speech enhancement
Mouchtaris, Athanasios
Van der Spiegel, Jan
Mueller, Paul
Tsakalides, Panagiotis
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1180 - 1193

← 1 2 3 4 5 →