Joint dereverberation and blind source separation using a hybrid autoregressive and convolutive transfer function-based model

被引：0

作者：

Liu, Shengdong ^{[1
,2
]}

Yang, Feiran ^{[2
,3
]}

Chen, Rilin ^{[4
]}

Yang, Jun ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Inst Acoust, Key Lab Noise & Vibrat Res, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[3] Chinese Acad Sci, State Key Lab Acoust, Inst Acoust, Beijing 100190, Peoples R China

[4] Tencent AI Lab, Beijing 100080, Peoples R China

来源：

APPLIED ACOUSTICS | 2024年 / 224卷

基金：

中国国家自然科学基金; 北京市自然科学基金;

关键词：

Convolutive transfer function; Autoregressive; Dereverberation; Blind source separation; Multichannel non-negative matrix factorization; NONNEGATIVE MATRIX FACTORIZATION; MIXTURES; DOMAIN; IDENTIFICATION; NOISE;

D O I：

10.1016/j.apacoust.2024.110135

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Most frequency-domain blind source separation (BSS) methods are based on the multiplicative narrowband assumption, which is not valid in long reverberation environments. In contrast, convolutive transfer function (CTF)-based BSS methods do not rely on the narrowband assumption, and the separation performance is significantly improved compared to the traditional algorithms in long reverberation environments. However, the CTF-based BSS methods and their variants, e.g., autoregressive (AR) BSS methods, introduce modeling errors to some extent, due to the truncation or approximation during the optimization process. To address this problem, we propose a frequency-domain BSS method employing a hybrid AR and CTF model, which can provide more precise representations of the early reflections and late reverberations. Furthermore, we utilize the Gaussian noise model to deal with the BSS problem in noisy reverberant environments. We formulate the objective function using the maximum log-likelihood criterion, and derive an efficient iterative algorithm for parameter estimation with the block coordinate descent (BCD) method. Experimental results show that the proposed method has a better separation performance than the existing methods in long reverberation environments.

引用

页数：10

共 50 条

[31] SPEECH DEREVERBERATION WITH CONVOLUTIVE TRANSFER FUNCTION APPROXIMATION USING MAP AND VARIATIONAL DECONVOLUTION APPROACHES
Jukic, Ante
van Waterschoot, Toon
Gerkmann, Timo
Doclo, Simon
2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 50 - 54
[32] A hybrid algorithm for blind source separation of a convolutive mixture of three speech sources
Shahab Faiz Minhas
Patrick Gaydecki
EURASIP Journal on Advances in Signal Processing, 2014
[33] Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction Based on Convolutive Transfer Function
Li, Xiaofei
Gannot, Sharon
Girin, Laurent
Horaud, Radu
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (10) : 1755 - 1768
[34] USING THE SCALING AMBIGUITY FOR FILTER SHORTENING IN CONVOLUTIVE BLIND SOURCE SEPARATION
Mazur, Radoslaw
Mertins, Alfred
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1709 - 1712
[35] Convolutive blind source separation in the frequency domain based on sparse representation
He, Zhaoshui
Xie, Shengli
Ding, Shuxue
Cichocki, Andrzej
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (05): : 1551 - 1563
[36] Joint diagonalization of power spectral density matrices for blind source separation of convolutive mixtures
Mei, TM
Xi, JT
Yin, FL
Chicharo, JF
ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 2, PROCEEDINGS, 2005, 3497 : 520 - 525
[37] Convolutive Blind Source Separation Algorithm based on Higher Order Statistics
Wang, Hongzhi
Bi, Aiqi
Xu, Peixin
Gao, Can
2013 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM DESIGN AND ENGINEERING APPLICATIONS (ISDEA), 2013, : 487 - 490
[38] Convolutive Blind Source Separation Based on Disjointness Maximization of Subband Signals
Mei, Tiemin
Mertins, Alfred
IEEE SIGNAL PROCESSING LETTERS, 2008, 15 (725-728) : 725 - 728
[39] Convolutive Blind Source Separation based on Wavelet De-noising
Zhang, Hong-Bin
Xu, Peng-Fei
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION APPLICATIONS (ICCIA 2012), 2012, : 807 - 810
[40] De-cumulant based approaches for convolutive blind source separation
Mei, TM
Xi, JT
Chicharo, J
Yin, FL
PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 471 - 474

← 1 2 3 4 5 →