NOISE ROBUST SPEECH DEREVERBERATION WITH KALMAN SMOOTHER

被引:0
作者
Togami, Masahito [1 ]
Kawaguchi, Yohei [1 ]
机构
[1] Hitachi Ltd, Cent Res Lab, Kokubunji, Tokyo 1858601, Japan
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
Noise reduction; dereverberation; kalman smoother; LINEAR PREDICTION; SEPARATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A speech dereverberation method is proposed that is robust against background noise. In contrast to conventional methods based on the linear prediction of the given microphone input signal, in which the linear prediction coefficients are not fully optimized when there is background noise, the proposed method optimizes the coefficients by linear prediction of the noiseless reverberant speech signal even when there is background noise. The noiseless reverberant speech signal and the parameters are iteratively updated on the basis of the expectation maximization algorithm. In the expectation step, sufficient statistics of latent variables which include noiseless reverberant speech signal are estimated using the Kalman smoother. Unlike the standard Kalman smoother, which uses a time-invariant covariance matrix as a state-transition covariance matrix, the proposed method utilizes a time-varying covariance matrix, enabling it to meet the time-varying speech characteristics. The parameters are updated so that the Q function is increased in the maximization step. Experimental results show that the proposed method is superior to conventional methods under noisy conditions.
引用
收藏
页码:7447 / 7451
页数:5
相关论文
共 50 条
  • [21] Simultaneous Optimization of Acoustic Echo Reduction, Speech Dereverberation, and Noise Reduction against Mutual Interference
    Togami, Masahito
    Kawaguchi, Yohei
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (11) : 1612 - 1623
  • [22] A Bayesian Hierarchical Model for Speech Dereverberation
    Laufer, Yaron
    Gannot, Sharon
    2018 IEEE INTERNATIONAL CONFERENCE ON THE SCIENCE OF ELECTRICAL ENGINEERING IN ISRAEL (ICSEE), 2018,
  • [23] ROBUST SPEECH DEREVERBERATION USING SUBBAND MULTICHANNEL LEAST SQUARES WITH VARIABLE RELAXATION
    Lim, Felicia
    Naylor, Patrick A.
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [24] Learning Spectral Mapping for Speech Dereverberation and Denoising
    Han, Kun
    Wang, Yuxuan
    Wang, DeLiang
    Woods, William S.
    Merks, Ivo
    Zhang, Tao
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (06) : 982 - 992
  • [25] End-to-End Dereverberation, Beamforming, and Speech Recognition in a Cocktail Party
    Zhang, Wangyou
    Chang, Xuankai
    Boeddeker, Christoph
    Nakatani, Tomohiro
    Watanabe, Shinji
    Qian, Yanmin
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 (3173-3188) : 3173 - 3188
  • [26] EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
    Richter, Julius
    Wu, Yi-Chiao
    Krenn, Steven
    Welker, Simon
    Lay, Bunlong
    Watanabe, Shinji
    Richard, Alexander
    Gerkmann, Timo
    INTERSPEECH 2024, 2024, : 4873 - 4877
  • [27] Linear Prediction-Based Online Dereverberation and Noise Reduction Using Alternating Kalman Filters
    Braun, Sebastian
    Habets, Emanuel A. P.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (06) : 1115 - 1125
  • [28] An online algorithm for echo cancellation, dereverberation and noise reduction based on a Kalman-EM Method
    Cohen, Nili
    Hazan, Gershon
    Schwartz, Boaz
    Gannot, Sharon
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
  • [29] An online algorithm for echo cancellation, dereverberation and noise reduction based on a Kalman-EM Method
    Nili Cohen
    Gershon Hazan
    Boaz Schwartz
    Sharon Gannot
    EURASIP Journal on Audio, Speech, and Music Processing, 2021
  • [30] Comparative Analysis of Generalized Sidelobe Cancellation and Multi-Channel Linear Prediction for Speech Dereverberation and Noise Reduction
    Dietzen, Thomas
    Spriet, Ann
    Tirry, Wouter
    Doclo, Simon
    Moonen, Marc
    van Waterschoot, Toon
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (03) : 544 - 558