Joint dereverberation and blind source separation using a hybrid autoregressive and convolutive transfer function-based model

被引:0
|
作者
Liu, Shengdong [1 ,2 ]
Yang, Feiran [2 ,3 ]
Chen, Rilin [4 ]
Yang, Jun [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Noise & Vibrat Res, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Chinese Acad Sci, State Key Lab Acoust, Inst Acoust, Beijing 100190, Peoples R China
[4] Tencent AI Lab, Beijing 100080, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Convolutive transfer function; Autoregressive; Dereverberation; Blind source separation; Multichannel non-negative matrix factorization; NONNEGATIVE MATRIX FACTORIZATION; MIXTURES; DOMAIN; IDENTIFICATION; NOISE;
D O I
10.1016/j.apacoust.2024.110135
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most frequency-domain blind source separation (BSS) methods are based on the multiplicative narrowband assumption, which is not valid in long reverberation environments. In contrast, convolutive transfer function (CTF)-based BSS methods do not rely on the narrowband assumption, and the separation performance is significantly improved compared to the traditional algorithms in long reverberation environments. However, the CTF-based BSS methods and their variants, e.g., autoregressive (AR) BSS methods, introduce modeling errors to some extent, due to the truncation or approximation during the optimization process. To address this problem, we propose a frequency-domain BSS method employing a hybrid AR and CTF model, which can provide more precise representations of the early reflections and late reverberations. Furthermore, we utilize the Gaussian noise model to deal with the BSS problem in noisy reverberant environments. We formulate the objective function using the maximum log-likelihood criterion, and derive an efficient iterative algorithm for parameter estimation with the block coordinate descent (BCD) method. Experimental results show that the proposed method has a better separation performance than the existing methods in long reverberation environments.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] SPEECH DEREVERBERATION WITH CONVOLUTIVE TRANSFER FUNCTION APPROXIMATION USING MAP AND VARIATIONAL DECONVOLUTION APPROACHES
    Jukic, Ante
    van Waterschoot, Toon
    Gerkmann, Timo
    Doclo, Simon
    2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 50 - 54
  • [32] A hybrid algorithm for blind source separation of a convolutive mixture of three speech sources
    Shahab Faiz Minhas
    Patrick Gaydecki
    EURASIP Journal on Advances in Signal Processing, 2014
  • [33] Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction Based on Convolutive Transfer Function
    Li, Xiaofei
    Gannot, Sharon
    Girin, Laurent
    Horaud, Radu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (10) : 1755 - 1768
  • [34] USING THE SCALING AMBIGUITY FOR FILTER SHORTENING IN CONVOLUTIVE BLIND SOURCE SEPARATION
    Mazur, Radoslaw
    Mertins, Alfred
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1709 - 1712
  • [35] Convolutive blind source separation in the frequency domain based on sparse representation
    He, Zhaoshui
    Xie, Shengli
    Ding, Shuxue
    Cichocki, Andrzej
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (05): : 1551 - 1563
  • [36] Joint diagonalization of power spectral density matrices for blind source separation of convolutive mixtures
    Mei, TM
    Xi, JT
    Yin, FL
    Chicharo, JF
    ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 2, PROCEEDINGS, 2005, 3497 : 520 - 525
  • [37] Convolutive Blind Source Separation Algorithm based on Higher Order Statistics
    Wang, Hongzhi
    Bi, Aiqi
    Xu, Peixin
    Gao, Can
    2013 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM DESIGN AND ENGINEERING APPLICATIONS (ISDEA), 2013, : 487 - 490
  • [38] Convolutive Blind Source Separation Based on Disjointness Maximization of Subband Signals
    Mei, Tiemin
    Mertins, Alfred
    IEEE SIGNAL PROCESSING LETTERS, 2008, 15 (725-728) : 725 - 728
  • [39] Convolutive Blind Source Separation based on Wavelet De-noising
    Zhang, Hong-Bin
    Xu, Peng-Fei
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION APPLICATIONS (ICCIA 2012), 2012, : 807 - 810
  • [40] De-cumulant based approaches for convolutive blind source separation
    Mei, TM
    Xi, JT
    Chicharo, J
    Yin, FL
    PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 471 - 474