Joint dereverberation and blind source separation using a hybrid autoregressive and convolutive transfer function-based model

被引：0

作者：

Liu, Shengdong ^{[1
,2
]}

Yang, Feiran ^{[2
,3
]}

Chen, Rilin ^{[4
]}

Yang, Jun ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Inst Acoust, Key Lab Noise & Vibrat Res, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[3] Chinese Acad Sci, State Key Lab Acoust, Inst Acoust, Beijing 100190, Peoples R China

[4] Tencent AI Lab, Beijing 100080, Peoples R China

来源：

APPLIED ACOUSTICS | 2024年 / 224卷

基金：

中国国家自然科学基金; 北京市自然科学基金;

关键词：

Convolutive transfer function; Autoregressive; Dereverberation; Blind source separation; Multichannel non-negative matrix factorization; NONNEGATIVE MATRIX FACTORIZATION; MIXTURES; DOMAIN; IDENTIFICATION; NOISE;

D O I：

10.1016/j.apacoust.2024.110135

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Most frequency-domain blind source separation (BSS) methods are based on the multiplicative narrowband assumption, which is not valid in long reverberation environments. In contrast, convolutive transfer function (CTF)-based BSS methods do not rely on the narrowband assumption, and the separation performance is significantly improved compared to the traditional algorithms in long reverberation environments. However, the CTF-based BSS methods and their variants, e.g., autoregressive (AR) BSS methods, introduce modeling errors to some extent, due to the truncation or approximation during the optimization process. To address this problem, we propose a frequency-domain BSS method employing a hybrid AR and CTF model, which can provide more precise representations of the early reflections and late reverberations. Furthermore, we utilize the Gaussian noise model to deal with the BSS problem in noisy reverberant environments. We formulate the objective function using the maximum log-likelihood criterion, and derive an efficient iterative algorithm for parameter estimation with the block coordinate descent (BCD) method. Experimental results show that the proposed method has a better separation performance than the existing methods in long reverberation environments.

引用

页数：10

共 50 条

[41] Multiresolution Convolutive Blind Source Separation Using Adaptive Lifting Scheme
Hattay, Jamel
Belaid, Samir
Naanaa, Wady
2013 IEEE 20TH INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS (ICECS), 2013, : 273 - 276
[42] AN IMROVEMENT IN USING HERMITIAN ANGLEIN CONVOLUTIVE SPEECH BLIND SOURCE SEPARATION
Mahmoodian, Hamid
Soltani, Atefeh
Hashemi, Ali
2013 INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTER AND COMPUTATION (ICECCO), 2013, : 368 - 371
[43] A Hybrid Reverberation Model and Its Application to Joint Speech Dereverberation and Separation
Liu, Tongzheng
Lu, Zhihua
da Costa, Joao Paulo J.
Fei, Tai
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 3000 - 3014
[44] A Blind Source Separation Approach Based on IVA for Convolutive Speech Mixtures
Jan, Tariqullah
Zafar, Haseeb
Khalil, Ruhulamin
Ashraf, Majid
2016 8TH COMPUTER SCIENCE AND ELECTRONIC ENGINEERING CONFERENCE (CEEC), 2016, : 140 - 145
[45] Autoregressive Moving Average Jointly-Diagonalizable Spatial Covariance Analysis for Joint Source Separation and Dereverberation
Sekiguchi, Kouhei
Bando, Yoshiaki
Nugraha, Aditya Arie
Fontaine, Mathieu
Yoshii, Kazuyoshi
Kawahara, Tatsuya
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2368 - 2382
[46] Joint source separation and dereverberation using constrained spectral divergence optimization
Nathwani, Karan
Hegde, Rajesh M.
SIGNAL PROCESSING, 2015, 106 : 266 - 281
[47] AUDIO SOURCE SEPARATION BASED ON CONVOLUTIVE TRANSFER FUNCTION AND FREQUENCY-DOMAIN LASSO OPTIMIZATION
Li, Xiaofei
Girin, Laurent
Horaud, Radu
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 541 - 545
[48] Non-orthogonal joint block diagonalization based on the LU or QR factorizations for convolutive blind source separation
Zhang, Lei
Cao, Yueyun
Yang, Zichun
Weng, Lei
JOURNAL OF VIBROENGINEERING, 2017, 19 (05) : 3380 - 3394
[49] Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function
Li, Xiaofei
Girin, Laurent
Gannot, Sharon
Horaud, Radu
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (03) : 645 - 659
[50] FAJD blind source separation algorithm based on time-varying autoregressive model
Ji C.
Jin C.
Zhang Y.
Kongzhi yu Juece/Control and Decision, 2020, 35 (03): : 651 - 656

← 1 2 3 4 5 →