Online Dereverberation for Dynamic Scenarios Using a Kalman Filter With an Autoregressive Model

被引:34
作者
Braun, Sebastian [1 ]
Habets, Emanuel A. P. [1 ]
机构
[1] Int Audio Labs Erlangen, D-91058 Erlangen, Germany
关键词
Dereverberation; Kalman filter; Markov model; multichannel autoregressive (AR) model; SPEECH DEREVERBERATION; LINEAR PREDICTION; SUPPRESSION;
D O I
10.1109/LSP.2016.2616888
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Reverberant signals can be modeled in the short-time Fourier transform domain using a frequency-dependent autore-gressive (AR) model. In state-of-the-art, these AR coefficients have been considered stationary, which does not hold in time-varying environments. We propose to model these AR coefficients using a first-order Markov process, whereas the reverberant microphone signal observations are considered deterministic. This leads to a framework where the AR coefficients can be optimally estimated using a Kalman filter per subband. As a consequence, we can dereverberate the observed signals by applying the estimated AR coefficients as an adaptive linear filter per subband. Estimators for the required statistical parameters in the Kalman filter are derived. Due to the adaptive solution, the algorithm is suitable for real-time applications. It is shown that the proposed method outperforms an existing recursive least-squares solution in terms of reverberation reduction, convergence time, and tracking changes in the acoustic scene.
引用
收藏
页码:1741 / 1745
页数:5
相关论文
共 23 条
[1]  
[Anonymous], 2006, IEEE VEH TECHN C MEL
[2]   A multichannel diffuse power estimator for dereverberation in the presence of multiple sources [J].
Braun, Sebastian ;
Habets, Emanuel A. P. .
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015, :1-14
[3]   Dereverberation and denoising using multichannel linear prediction [J].
Delcroix, Marc ;
Hikichi, Takafumi ;
Miyoshi, Masato .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (06) :1791-1801
[4]   Frequency-domain adaptive Kalman filter for acoustic echo control in hands-free telephones [J].
Enzner, G ;
Vary, P .
SIGNAL PROCESSING, 2006, 86 (06) :1140-1156
[5]   Correlation-Based and Model-Based Blind Single-Channel Late-Reverberation Suppression in Noisy Time-Varying Acoustical Environments [J].
Erkelens, Jan S. ;
Heusdens, Richard .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07) :1746-1765
[6]   Speech enhancement using a modified Kalman Filter based on complex linear prediction and supergaussian priors [J].
Esch, Thomas ;
Vary, Peter .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :4877-4880
[7]   Speech Processing Utilizing the Kalman Filter [J].
Gannot, Sharon .
IEEE INSTRUMENTATION & MEASUREMENT MAGAZINE, 2012, 15 (03) :10-14
[8]   Multi-Channel Linear Prediction-Based Speech Dereverberation With Sparse Priors [J].
Jukic, Ante ;
van Waterschoot, Toon ;
Gerkmann, Timo ;
Doclo, Simon .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (09) :1509-1520
[9]  
Kalman RE., 1960, J BASIC ENG, V82, P35, DOI [DOI 10.1115/1.3662552, 10.1115/1.3662552]
[10]   INVERSE FILTERING OF ROOM ACOUSTICS [J].
MIYOSHI, M ;
KANEDA, Y .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (02) :145-152