Combating Reverberation in NTF-based Speech Separation Using a Sub-Source Weighted Multichannel Wiener Filter and Linear Prediction

被引：3

作者：

Fras, Mieszko ^{[1
]}

Witkowski, Marcin ^{[1
]}

Kowalczyk, Konrad ^{[1
]}

机构：

[1] AGH Univ Sci & Technol, Inst Elect, Krakow, Poland

来源：

INTERSPEECH 2021 | 2021年

关键词：

source separation; nonnegative tensor factorization; multichannel Wiener filter; subsource modelling; dereverberation; weighted prediction error; NONNEGATIVE MATRIX FACTORIZATION; INDEPENDENT COMPONENT ANALYSIS; AUDIO SOURCE SEPARATION; DEREVERBERATION;

D O I：

10.21437/Interspeech.2021-1230

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

Sound source separation (SS) from the microphone signals capturing speech in reverberant conditions is a formidable task. This paper addresses the problem of joint separation and dereverberation of speech using the multichannel Wiener filter (MWF) that is tailored to the sub-source modeling of each speech source with a full-rank mixing matrix. Specifically, the parameters of the proposed sub-source-weighted (SSW) spatial filter are estimated using the sub-source based expectation maximization (EM) algorithm with multiplicative updates (MU) and the localization prior distribution (LP) on the mixing matrix (SSEM-MU-LP). In addition, we strengthen dereverberation by incorporating a Generalized Weighted Prediction Error (GWPE) algorithm. The proposed method is evaluated using a large dataset of two-channel recordings of clean speech convolved with both real and synthesized impulse responses. The results of the experiments show the superior performance of the proposed method in reverberant conditions in comparison to using the standard NTF-based separation with the vanilla MWF in terms of signal-to-distortion ratio (improvement of 3 - 5.6 dB) and other commonly used sound separation metrics.

引用

页码：3895 / 3899

页数：5

共 18 条

[1] Convolutional Weighted Parametric Multichannel Wiener Filter for Reverberant Source Separation
Fras, Mieszko
Kowalczyk, Konrad
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1928 - 1932
[2] MULTICHANNEL WIENER FILTER ESTIMATION USING SOURCE LOCATION KNOWLEDGE FOR SPEECH ENHANCEMENT
Anderson, Craig A.
Teal, Paul D.
Poletti, Mark A.
2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 57 - 60
[3] MAXIMUM A POSTERIORI ESTIMATOR FOR CONVOLUTIVE SOUND SOURCE SEPARATION WITH SUB-SOURCE BASED NTF MODEL AND THE LOCALIZATION PROBABILISTIC PRIOR ON THE MIXING MATRIX
Fras, Mieszko
Kowalczyk, Konrad
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 526 - 530
[4] JOINT BEAMFORMING AND REVERBERATION CANCELLATION USING A CONSTRAINED KALMAN FILTER WITH MULTICHANNEL LINEAR PREDICTION
Hashemgeloogerdi, Sahar
Braun, Sebastian
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 481 - 485
[5] A Weighted Multichannel Wiener Filter and its Decomposition to LCMV Beam Former and Post-Filter for Source Separation and Noise Reduction
Adler, Aviel
Schwartz, Ofer
Gannot, Sharon
2018 IEEE INTERNATIONAL CONFERENCE ON THE SCIENCE OF ELECTRICAL ENGINEERING IN ISRAEL (ICSEE), 2018,
[6] A Joint Diagonalization Based Efficient Approach to Underdetermined Blind Audio Source Separation Using the Multichannel Wiener Filter
Ito, Nobutaka
Ikeshita, Rintaro
Sawada, Hiroshi
Nakatani, Tomohiro
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1950 - 1965
[7] Robust Multichannel Linear Prediction for Online Speech Dereverberation Using Weighted Householder Least Squares Lattice Adaptive Filter
Wung, Jason
Jukic, Ante
Malik, Sarmad
Souden, Mehrez
Pichevar, Ramin
Atkins, Joshua
Naik, Devang
Acero, Alex
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 : 3559 - 3574
[8] CURVATURE-BASED OPTIMIZATION OF THE TRADE-OFF PARAMETER IN THE SPEECH DISTORTION WEIGHTED MULTICHANNEL WIENER FILTER
Kodrasi, Ina
Marquardt, Daniel
Doclo, Simon
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 315 - 319
[9] Speech Enhancement by Denoising and Dereverberation Using a Generalized Sidelobe Canceller-Based Multichannel Wiener Filter
Bai, Mingsian R.
Kung, Fan-Jie
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2022, 70 (03): : 140 - 155
[10] Kronecker Product Multichannel Linear Filtering for Adaptive Weighted Prediction Error-Based Speech Dereverberation
Huang, Gongping
Benesty, Jacob
Cohen, Israel
Chen, Jingdong
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1277 - 1289

← 1 2 →