Combating Reverberation in NTF-based Speech Separation Using a Sub-Source Weighted Multichannel Wiener Filter and Linear Prediction

被引:3
|
作者
Fras, Mieszko [1 ]
Witkowski, Marcin [1 ]
Kowalczyk, Konrad [1 ]
机构
[1] AGH Univ Sci & Technol, Inst Elect, Krakow, Poland
来源
INTERSPEECH 2021 | 2021年
关键词
source separation; nonnegative tensor factorization; multichannel Wiener filter; subsource modelling; dereverberation; weighted prediction error; NONNEGATIVE MATRIX FACTORIZATION; INDEPENDENT COMPONENT ANALYSIS; AUDIO SOURCE SEPARATION; DEREVERBERATION;
D O I
10.21437/Interspeech.2021-1230
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Sound source separation (SS) from the microphone signals capturing speech in reverberant conditions is a formidable task. This paper addresses the problem of joint separation and dereverberation of speech using the multichannel Wiener filter (MWF) that is tailored to the sub-source modeling of each speech source with a full-rank mixing matrix. Specifically, the parameters of the proposed sub-source-weighted (SSW) spatial filter are estimated using the sub-source based expectation maximization (EM) algorithm with multiplicative updates (MU) and the localization prior distribution (LP) on the mixing matrix (SSEM-MU-LP). In addition, we strengthen dereverberation by incorporating a Generalized Weighted Prediction Error (GWPE) algorithm. The proposed method is evaluated using a large dataset of two-channel recordings of clean speech convolved with both real and synthesized impulse responses. The results of the experiments show the superior performance of the proposed method in reverberant conditions in comparison to using the standard NTF-based separation with the vanilla MWF in terms of signal-to-distortion ratio (improvement of 3 - 5.6 dB) and other commonly used sound separation metrics.
引用
收藏
页码:3895 / 3899
页数:5
相关论文
共 18 条
  • [1] Convolutional Weighted Parametric Multichannel Wiener Filter for Reverberant Source Separation
    Fras, Mieszko
    Kowalczyk, Konrad
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1928 - 1932
  • [2] MULTICHANNEL WIENER FILTER ESTIMATION USING SOURCE LOCATION KNOWLEDGE FOR SPEECH ENHANCEMENT
    Anderson, Craig A.
    Teal, Paul D.
    Poletti, Mark A.
    2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 57 - 60
  • [3] MAXIMUM A POSTERIORI ESTIMATOR FOR CONVOLUTIVE SOUND SOURCE SEPARATION WITH SUB-SOURCE BASED NTF MODEL AND THE LOCALIZATION PROBABILISTIC PRIOR ON THE MIXING MATRIX
    Fras, Mieszko
    Kowalczyk, Konrad
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 526 - 530
  • [4] JOINT BEAMFORMING AND REVERBERATION CANCELLATION USING A CONSTRAINED KALMAN FILTER WITH MULTICHANNEL LINEAR PREDICTION
    Hashemgeloogerdi, Sahar
    Braun, Sebastian
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 481 - 485
  • [5] A Weighted Multichannel Wiener Filter and its Decomposition to LCMV Beam Former and Post-Filter for Source Separation and Noise Reduction
    Adler, Aviel
    Schwartz, Ofer
    Gannot, Sharon
    2018 IEEE INTERNATIONAL CONFERENCE ON THE SCIENCE OF ELECTRICAL ENGINEERING IN ISRAEL (ICSEE), 2018,
  • [6] A Joint Diagonalization Based Efficient Approach to Underdetermined Blind Audio Source Separation Using the Multichannel Wiener Filter
    Ito, Nobutaka
    Ikeshita, Rintaro
    Sawada, Hiroshi
    Nakatani, Tomohiro
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1950 - 1965
  • [7] Robust Multichannel Linear Prediction for Online Speech Dereverberation Using Weighted Householder Least Squares Lattice Adaptive Filter
    Wung, Jason
    Jukic, Ante
    Malik, Sarmad
    Souden, Mehrez
    Pichevar, Ramin
    Atkins, Joshua
    Naik, Devang
    Acero, Alex
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 : 3559 - 3574
  • [8] CURVATURE-BASED OPTIMIZATION OF THE TRADE-OFF PARAMETER IN THE SPEECH DISTORTION WEIGHTED MULTICHANNEL WIENER FILTER
    Kodrasi, Ina
    Marquardt, Daniel
    Doclo, Simon
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 315 - 319
  • [9] Speech Enhancement by Denoising and Dereverberation Using a Generalized Sidelobe Canceller-Based Multichannel Wiener Filter
    Bai, Mingsian R.
    Kung, Fan-Jie
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2022, 70 (03): : 140 - 155
  • [10] Kronecker Product Multichannel Linear Filtering for Adaptive Weighted Prediction Error-Based Speech Dereverberation
    Huang, Gongping
    Benesty, Jacob
    Cohen, Israel
    Chen, Jingdong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1277 - 1289