Two-Stage Temporal Processing for Single-Channel Speech Enhancement

被引：4

作者：

Samui, Sunzan ^{[1
]}

Chakrabarti, Indrajit ^{[1
]}

Ghosh, Soumya Kanti ^{[1
]}

机构：

[1] Indian Inst Technol, Kharagpur, W Bengal, India

来源：

17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年

关键词：

Speech enhancement; noise-reduction; noise estimation; temporal processing; ALGORITHMS;

D O I：

10.21437/Interspeech.2016-307

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Most of the conventional speech enhancement methods operating in the spectral domain often suffer from spurious artifact called musical noise. Moreover, these methods also incur an extra overhead time for noise power spectral density estimation. In this paper, a speech enhancement framework is proposed by cascading two temporal processing stages. The first stage performs excitation source based temporal processing that involves identifying and boosting the excitation source based speech specific features present at the gross and fine temporal levels, whereas the second stage provides noise reduction by estimating standard deviation of noise in time-domain by using a robust estimator. The proposed noise reduction stage is quite simply implementable and computationally less complex as it does not require noise estimation in spectral domain as a pre-processing phase. The experimental results have established that the proposed scheme produces on an average 60-65 % improvement in the speech quality (PESQ scores) and intelligibility (STOI scores) at 0 and -5 dB input SNR when compared to existing standard approaches.

引用

页码：3723 / 3727

页数：5

共 50 条

[1] Two-Stage Single-Channel Speech Enhancement with Multi-Frame Filtering
Lin, Shaoxiong
Zhang, Wangyou
Qian, Yanmin
APPLIED SCIENCES-BASEL, 2023, 13 (08):
[2] TWO-STAGE DATA-DRIVEN SINGLE CHANNEL SPEECH ENHANCEMENT WITH CEPSTRAL ANALYSIS PRE-PROCESSING
Rao, Yu
Vahanesa, Chetan
Reddy, Chandan K. A.
Panahi, Issa M. S.
2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 702 - 706
[3] Two-stage UNet with channel and temporal-frequency attention for multi-channel speech enhancement
Xu, Shiyun
Cao, Yinghan
Zhang, Zehua
Wang, Mingjiang
SPEECH COMMUNICATION, 2025, 166
[4] Single-channel speech enhancement by subspace affinity minimization
Tran, Dung N.
Koishida, Kazuhito
INTERSPEECH 2020, 2020, : 2447 - 2451
[5] Comparative Studies of Single-Channel Speech Enhancement Techniques
Kumar, Bittu
Kumar, Neeraj
Kumar, Manoj
Prasad, S. V. S.
Varma, Ashwini Kumar
Ravi, Banoth
IETE JOURNAL OF RESEARCH, 2024, 70 (06) : 5704 - 5720
[6] FPGA Implementation of a Phase-Aware Single-Channel Speech Enhancement System
Samui, Suman
Sahu, Pragya
Chakrabarti, Indrajit
Ghosh, Soumya K.
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (11) : 4688 - 4715
[7] INVESTIGATION OF A PARAMETRIC GAIN APPROACH TO SINGLE-CHANNEL SPEECH ENHANCEMENT
Huang, Gongping
Chen, Jingdong
Benesty, Jacob
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 206 - 210
[8] Single-channel speech enhancement using learnable loss mixup
Chang, Oscar
Tran, Dung N.
Koishida, Kazuhito
INTERSPEECH 2021, 2021, : 2696 - 2700
[9] STFT Phase Reconstruction in Voiced Speech for an Improved Single-Channel Speech Enhancement
Krawczyk, Martin
Gerkmann, Timo
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (12) : 1931 - 1940
[10] UltraSE: Single-Channel Speech Enhancement Using Ultrasound
Sun, Ke
Zhang, Xinyu
PROCEEDINGS OF THE 27TH ACM ANNUAL INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING (ACM MOBICOM '21), 2021, : 160 - 173

← 1 2 3 4 5 →