Joint Dereverberation and Residual Echo Suppression of Speech Signals in Noisy Environments

被引：41

作者：

Habets, Emanuel A. P. ^{[1
,2
]}

Gannot, Sharon ^{[1
]}

Cohen, Israel ^{[2
]}

Sommen, Piet C. W. ^{[3
]}

机构：

[1] Bar Ilan Univ, Sch Engn, IL-52900 Ramat Gan, Israel

[2] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel

[3] Tech Univ Eindhoven, Dept Elect Engn, Signal Proc Syst Grp, NL-5600 MB Eindhoven, Netherlands

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2008年 / 16卷 / 08期

基金：

以色列科学基金会;

关键词：

Acoustic echo cancellation (AEC); dereverberation; residual echo suppression;

D O I：

10.1109/TASL.2008.2002071

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Hands-free devices are often used in a noisy and reverberant environment. Therefore, the received microphone signal does not only contain the desired near-end speech signal but also interferences such as room reverberation that is caused by the near-end source, background noise and a far-end echo signal that results from the acoustic coupling between the loudspeaker and the microphone. These interferences degrade the fidelity and intelligibility of near-end speech. In the last two decades, postfilters have been developed that can be used in conjunction with a single microphone acoustic echo canceller to enhance the near-end speech. In previous works, spectral enhancement techniques have been used to suppress residual echo and background noise for single microphone acoustic echo cancellers. However, dereverberation of the near-end speech was not addressed in this context. Recently, practically feasible spectral enhancement techniques to suppress reverberation have emerged. In this paper, we derive a novel spectral variance estimator for the late reverberation of the near-end speech. Residual echo will be present at the output of the acoustic echo canceller when the acoustic echo path cannot be completely modeled by the adaptive filter. A spectral variance estimator for the so-called late residual echo that results from the deficient length of the adaptive filter is derived. Both estimators are based on a statistical reverberation model. The model parameters depend on the reverberation time of the room, which can be obtained using the estimated acoustic echo path. A novel postfilter is developed which suppresses late reverberation of the near-end speech, residual echo and background noise, and maintains a constant residual background noise level. Experimental results demonstrate the beneficial use of the developed system for reducing reverberation, residual echo, and background noise.

引用

页码：1433 / 1451

页数：19

共 50 条

[31] METRICGAN-U: UNSUPERVISED SPEECH ENHANCEMENT/ DEREVERBERATION BASED ONLY ON NOISY/ REVERBERATED SPEECH
Fu, Szu-Wei
Yu, Cheng
Hung, Kuo-Hsuan
Ravanelli, Mirco
Tsao, Yu
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7412 - 7416
[32] Single Channel Speech Dereverberation Using the LP Residual Cepstrum
Padaki, Harish
Nathwani, Karan
Hegde, Rajesh M.
2013 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2013,
[33] Pitch estimator for noisy speech signals
Shedied, SA
Gadalah, ME
VanLandingham, HF
SMC 2000 CONFERENCE PROCEEDINGS: 2000 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOL 1-5, 2000, : 97 - 100
[34] SPEECH COMMUNICATION IN VERY NOISY ENVIRONMENTS
CHERRY, C
WILEY, R
NATURE, 1967, 214 (5093) : 1164 - &
[35] SPEECH RECOGNITION IN NOISY ENVIRONMENTS - A SURVEY
GONG, YF
SPEECH COMMUNICATION, 1995, 16 (03) : 261 - 291
[36] Speech Synthesis enhancement in noisy environments
Bonardo, Davide
Zovato, Enrico
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 789 - 792
[37] Robust Speech Detection for Noisy Environments
Varela, Oscar
Indra, S. A.
San-Segundo, Ruben
Hernandez, Luis A.
IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2011, 26 (11) : 16 - U12
[38] Integrated Speech Enhancement Method Using Noise Suppression and Dereverberation
Yoshioka, Takuya
Nakatani, Tomohiro
Miyoshi, Masato
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (02): : 231 - 246
[39] Blind dereverberation of speech signals using independence transform matrix
Lee, JH
Lee, SY
PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 1453 - 1457
[40] Blind dereverberation of monaural speech signals based on harmonic structure
Nakatani, Tomohiro
Miyoshi, Masato
Kinoshita, Keisuke
Systems and Computers in Japan, 2006, 37 (06): : 1 - 12

← 1 2 3 4 5 →