NOISE ROBUST SPEECH DEREVERBERATION WITH KALMAN SMOOTHER

被引:0
作者
Togami, Masahito [1 ]
Kawaguchi, Yohei [1 ]
机构
[1] Hitachi Ltd, Cent Res Lab, Kokubunji, Tokyo 1858601, Japan
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
Noise reduction; dereverberation; kalman smoother; LINEAR PREDICTION; SEPARATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A speech dereverberation method is proposed that is robust against background noise. In contrast to conventional methods based on the linear prediction of the given microphone input signal, in which the linear prediction coefficients are not fully optimized when there is background noise, the proposed method optimizes the coefficients by linear prediction of the noiseless reverberant speech signal even when there is background noise. The noiseless reverberant speech signal and the parameters are iteratively updated on the basis of the expectation maximization algorithm. In the expectation step, sufficient statistics of latent variables which include noiseless reverberant speech signal are estimated using the Kalman smoother. Unlike the standard Kalman smoother, which uses a time-invariant covariance matrix as a state-transition covariance matrix, the proposed method utilizes a time-varying covariance matrix, enabling it to meet the time-varying speech characteristics. The parameters are updated so that the Q function is increased in the maximization step. Experimental results show that the proposed method is superior to conventional methods under noisy conditions.
引用
收藏
页码:7447 / 7451
页数:5
相关论文
共 50 条
[21]   Calculating inverse filters for speech dereverberation [J].
Miyoshi, Masato ;
Delcroix, Marc ;
Kinoshita, Keisuke .
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2008, E91A (06) :1303-1309
[22]   A Bayesian Hierarchical Model for Speech Dereverberation [J].
Laufer, Yaron ;
Gannot, Sharon .
2018 IEEE INTERNATIONAL CONFERENCE ON THE SCIENCE OF ELECTRICAL ENGINEERING IN ISRAEL (ICSEE), 2018,
[23]   ROBUST SPEECH DEREVERBERATION USING SUBBAND MULTICHANNEL LEAST SQUARES WITH VARIABLE RELAXATION [J].
Lim, Felicia ;
Naylor, Patrick A. .
2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
[24]   Run-Time Adaptation of Neural Beamforming for Robust Speech Dereverberation and Denoising [J].
Fujita, Yoto ;
Nugraha, Aditya Arie ;
Di Carlo, Diego ;
Bando, Yoshiaki ;
Fontaine, Mathieu ;
Yoshii, Kazuyoshi .
2024 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2024,
[25]   End-to-End Dereverberation, Beamforming, and Speech Recognition in a Cocktail Party [J].
Zhang, Wangyou ;
Chang, Xuankai ;
Boeddeker, Christoph ;
Nakatani, Tomohiro ;
Watanabe, Shinji ;
Qian, Yanmin .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 (3173-3188) :3173-3188
[26]   Learning Spectral Mapping for Speech Dereverberation and Denoising [J].
Han, Kun ;
Wang, Yuxuan ;
Wang, DeLiang ;
Woods, William S. ;
Merks, Ivo ;
Zhang, Tao .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (06) :982-992
[27]   EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation [J].
Richter, Julius ;
Wu, Yi-Chiao ;
Krenn, Steven ;
Welker, Simon ;
Lay, Bunlong ;
Watanabe, Shinji ;
Richard, Alexander ;
Gerkmann, Timo .
INTERSPEECH 2024, 2024, :4873-4877
[28]   Linear Prediction-Based Online Dereverberation and Noise Reduction Using Alternating Kalman Filters [J].
Braun, Sebastian ;
Habets, Emanuel A. P. .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (06) :1115-1125
[29]   An online algorithm for echo cancellation, dereverberation and noise reduction based on a Kalman-EM Method [J].
Cohen, Nili ;
Hazan, Gershon ;
Schwartz, Boaz ;
Gannot, Sharon .
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
[30]   An online algorithm for echo cancellation, dereverberation and noise reduction based on a Kalman-EM Method [J].
Nili Cohen ;
Gershon Hazan ;
Boaz Schwartz ;
Sharon Gannot .
EURASIP Journal on Audio, Speech, and Music Processing, 2021