Joint Dereverberation and Residual Echo Suppression of Speech Signals in Noisy Environments

被引:41
|
作者
Habets, Emanuel A. P. [1 ,2 ]
Gannot, Sharon [1 ]
Cohen, Israel [2 ]
Sommen, Piet C. W. [3 ]
机构
[1] Bar Ilan Univ, Sch Engn, IL-52900 Ramat Gan, Israel
[2] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel
[3] Tech Univ Eindhoven, Dept Elect Engn, Signal Proc Syst Grp, NL-5600 MB Eindhoven, Netherlands
基金
以色列科学基金会;
关键词
Acoustic echo cancellation (AEC); dereverberation; residual echo suppression;
D O I
10.1109/TASL.2008.2002071
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Hands-free devices are often used in a noisy and reverberant environment. Therefore, the received microphone signal does not only contain the desired near-end speech signal but also interferences such as room reverberation that is caused by the near-end source, background noise and a far-end echo signal that results from the acoustic coupling between the loudspeaker and the microphone. These interferences degrade the fidelity and intelligibility of near-end speech. In the last two decades, postfilters have been developed that can be used in conjunction with a single microphone acoustic echo canceller to enhance the near-end speech. In previous works, spectral enhancement techniques have been used to suppress residual echo and background noise for single microphone acoustic echo cancellers. However, dereverberation of the near-end speech was not addressed in this context. Recently, practically feasible spectral enhancement techniques to suppress reverberation have emerged. In this paper, we derive a novel spectral variance estimator for the late reverberation of the near-end speech. Residual echo will be present at the output of the acoustic echo canceller when the acoustic echo path cannot be completely modeled by the adaptive filter. A spectral variance estimator for the so-called late residual echo that results from the deficient length of the adaptive filter is derived. Both estimators are based on a statistical reverberation model. The model parameters depend on the reverberation time of the room, which can be obtained using the estimated acoustic echo path. A novel postfilter is developed which suppresses late reverberation of the near-end speech, residual echo and background noise, and maintains a constant residual background noise level. Experimental results demonstrate the beneficial use of the developed system for reducing reverberation, residual echo, and background noise.
引用
收藏
页码:1433 / 1451
页数:19
相关论文
共 50 条
  • [31] METRICGAN-U: UNSUPERVISED SPEECH ENHANCEMENT/ DEREVERBERATION BASED ONLY ON NOISY/ REVERBERATED SPEECH
    Fu, Szu-Wei
    Yu, Cheng
    Hung, Kuo-Hsuan
    Ravanelli, Mirco
    Tsao, Yu
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7412 - 7416
  • [32] Single Channel Speech Dereverberation Using the LP Residual Cepstrum
    Padaki, Harish
    Nathwani, Karan
    Hegde, Rajesh M.
    2013 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2013,
  • [33] Pitch estimator for noisy speech signals
    Shedied, SA
    Gadalah, ME
    VanLandingham, HF
    SMC 2000 CONFERENCE PROCEEDINGS: 2000 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOL 1-5, 2000, : 97 - 100
  • [34] SPEECH COMMUNICATION IN VERY NOISY ENVIRONMENTS
    CHERRY, C
    WILEY, R
    NATURE, 1967, 214 (5093) : 1164 - &
  • [35] SPEECH RECOGNITION IN NOISY ENVIRONMENTS - A SURVEY
    GONG, YF
    SPEECH COMMUNICATION, 1995, 16 (03) : 261 - 291
  • [36] Speech Synthesis enhancement in noisy environments
    Bonardo, Davide
    Zovato, Enrico
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 789 - 792
  • [37] Robust Speech Detection for Noisy Environments
    Varela, Oscar
    Indra, S. A.
    San-Segundo, Ruben
    Hernandez, Luis A.
    IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2011, 26 (11) : 16 - U12
  • [38] Integrated Speech Enhancement Method Using Noise Suppression and Dereverberation
    Yoshioka, Takuya
    Nakatani, Tomohiro
    Miyoshi, Masato
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (02): : 231 - 246
  • [39] Blind dereverberation of speech signals using independence transform matrix
    Lee, JH
    Lee, SY
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 1453 - 1457
  • [40] Blind dereverberation of monaural speech signals based on harmonic structure
    Nakatani, Tomohiro
    Miyoshi, Masato
    Kinoshita, Keisuke
    Systems and Computers in Japan, 2006, 37 (06): : 1 - 12