Assessment of Single-Channel Speech Enhancement Techniques for Speaker Identification under Mismatched Conditions

被引：0

作者：

Sadjadi, Seyed Omid ^{[1
]}

Hansen, John H. L. ^{[1
]}

机构：

[1] Univ Texas Dallas, CRSS, Dallas, TX 75230 USA

来源：

11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 | 2010年

关键词：

feature extraction; gammatone filterbank; Hilbert envelope; speaker identification; speech enhancement; RECOGNITION; SIGNAL;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

It is well known that MFCC based speaker identification (SID) systems easily break down under mismatched training and test conditions. In this paper, we report on a study that considers four different single-channel speech enhancement front-ends for robust SID under such conditions. Speech files from the YOHO database are corrupted with four types of noise including babble, car, factory, and white Gaussian at five SNR levels (0-20 dB), and processed using four speech enhancement techniques representing distinct classes of algorithms: spectral subtraction, statistical model-based, subspace, and Wiener filtering. Both processed and unprocessed files are submitted to a SID system trained on clean data. In addition, a new set of acoustic feature parameters based on Hilbert envelope of gammatone filterbank outputs are proposed and evaluated for SID task. Experimental results indicate that: (i) depending on the noise type and SNR level, the enhancement front-ends may help or hurt SID performance, (ii) the proposed feature significantly achieves higher SID accuracy compared to MFCCs under mismatched conditions.

引用

页码：2138 / 2141

页数：4

共 50 条

[11] CompNet: Complementary network for single-channel speech enhancement
Fan, Cunhang
Zhang, Hongmei
Li, Andong
Xiang, Wang
Zheng, Chengshi
Lv, Zhao
Wu, Xiaopei
NEURAL NETWORKS, 2023, 168 : 508 - 517
[12] Speaker Verification-Based Evaluation of Single-Channel Speech Separation
Maciejewski, Matthew
Watanabe, Shinji
Khudanpur, Sanjeev
INTERSPEECH 2021, 2021, : 3520 - 3524
[13] Speaker Separation Using Visual Speech Features and Single-channel Audio
Khan, Faheem
Milner, Ben
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3263 - 3267
[14] FPGA Implementation of a Phase-Aware Single-Channel Speech Enhancement System
Samui, Suman
Sahu, Pragya
Chakrabarti, Indrajit
Ghosh, Soumya K.
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (11) : 4688 - 4715
[15] Filtering and Refining: A Collaborative-Style Framework for Single-Channel Speech Enhancement
Li, Andong
Zheng, Chengshi
Yu, Guochen
Cai, Juanjuan
Li, Xiaodong
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2156 - 2172
[16] Deep Neural Network for Supervised Single-Channel Speech Enhancement
Saleem, Nasir
Irfan Khattak, Muhammad
Ali, Muhammad Yousaf
Shafi, Muhammad
ARCHIVES OF ACOUSTICS, 2019, 44 (01) : 3 - 12
[17] INVESTIGATION OF A PARAMETRIC GAIN APPROACH TO SINGLE-CHANNEL SPEECH ENHANCEMENT
Huang, Gongping
Chen, Jingdong
Benesty, Jacob
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 206 - 210
[18] SINGLE-CHANNEL SPEECH ENHANCEMENT WITH SEQUENTIALLY TRAINED DNN SYSTEM
Sun, Yang
Xian, Yang
Wang, Wenwu
Naqvi, Syed Mohsen
2019 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2019,
[19] Deep Learning Models for Single-Channel Speech Enhancement on Drones
Mukhutdinov, Dmitrii
Alex, Ashish
Cavallaro, Andrea
Wang, Lin
IEEE ACCESS, 2023, 11 : 22993 - 23007
[20] Single-channel speech enhancement using learnable loss mixup
Chang, Oscar
Tran, Dung N.
Koishida, Kazuhito
INTERSPEECH 2021, 2021, : 2696 - 2700

← 1 2 3 4 5 →