Joint dereverberation and blind source separation using a hybrid autoregressive and convolutive transfer function-based model

被引:0
|
作者
Liu, Shengdong [1 ,2 ]
Yang, Feiran [2 ,3 ]
Chen, Rilin [4 ]
Yang, Jun [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Noise & Vibrat Res, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Chinese Acad Sci, State Key Lab Acoust, Inst Acoust, Beijing 100190, Peoples R China
[4] Tencent AI Lab, Beijing 100080, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Convolutive transfer function; Autoregressive; Dereverberation; Blind source separation; Multichannel non-negative matrix factorization; NONNEGATIVE MATRIX FACTORIZATION; MIXTURES; DOMAIN; IDENTIFICATION; NOISE;
D O I
10.1016/j.apacoust.2024.110135
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most frequency-domain blind source separation (BSS) methods are based on the multiplicative narrowband assumption, which is not valid in long reverberation environments. In contrast, convolutive transfer function (CTF)-based BSS methods do not rely on the narrowband assumption, and the separation performance is significantly improved compared to the traditional algorithms in long reverberation environments. However, the CTF-based BSS methods and their variants, e.g., autoregressive (AR) BSS methods, introduce modeling errors to some extent, due to the truncation or approximation during the optimization process. To address this problem, we propose a frequency-domain BSS method employing a hybrid AR and CTF model, which can provide more precise representations of the early reflections and late reverberations. Furthermore, we utilize the Gaussian noise model to deal with the BSS problem in noisy reverberant environments. We formulate the objective function using the maximum log-likelihood criterion, and derive an efficient iterative algorithm for parameter estimation with the block coordinate descent (BCD) method. Experimental results show that the proposed method has a better separation performance than the existing methods in long reverberation environments.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Multiresolution Convolutive Blind Source Separation Using Adaptive Lifting Scheme
    Hattay, Jamel
    Belaid, Samir
    Naanaa, Wady
    2013 IEEE 20TH INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS (ICECS), 2013, : 273 - 276
  • [42] AN IMROVEMENT IN USING HERMITIAN ANGLEIN CONVOLUTIVE SPEECH BLIND SOURCE SEPARATION
    Mahmoodian, Hamid
    Soltani, Atefeh
    Hashemi, Ali
    2013 INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTER AND COMPUTATION (ICECCO), 2013, : 368 - 371
  • [43] A Hybrid Reverberation Model and Its Application to Joint Speech Dereverberation and Separation
    Liu, Tongzheng
    Lu, Zhihua
    da Costa, Joao Paulo J.
    Fei, Tai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 3000 - 3014
  • [44] A Blind Source Separation Approach Based on IVA for Convolutive Speech Mixtures
    Jan, Tariqullah
    Zafar, Haseeb
    Khalil, Ruhulamin
    Ashraf, Majid
    2016 8TH COMPUTER SCIENCE AND ELECTRONIC ENGINEERING CONFERENCE (CEEC), 2016, : 140 - 145
  • [45] Autoregressive Moving Average Jointly-Diagonalizable Spatial Covariance Analysis for Joint Source Separation and Dereverberation
    Sekiguchi, Kouhei
    Bando, Yoshiaki
    Nugraha, Aditya Arie
    Fontaine, Mathieu
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2368 - 2382
  • [46] Joint source separation and dereverberation using constrained spectral divergence optimization
    Nathwani, Karan
    Hegde, Rajesh M.
    SIGNAL PROCESSING, 2015, 106 : 266 - 281
  • [47] AUDIO SOURCE SEPARATION BASED ON CONVOLUTIVE TRANSFER FUNCTION AND FREQUENCY-DOMAIN LASSO OPTIMIZATION
    Li, Xiaofei
    Girin, Laurent
    Horaud, Radu
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 541 - 545
  • [48] Non-orthogonal joint block diagonalization based on the LU or QR factorizations for convolutive blind source separation
    Zhang, Lei
    Cao, Yueyun
    Yang, Zichun
    Weng, Lei
    JOURNAL OF VIBROENGINEERING, 2017, 19 (05) : 3380 - 3394
  • [49] Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function
    Li, Xiaofei
    Girin, Laurent
    Gannot, Sharon
    Horaud, Radu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (03) : 645 - 659
  • [50] FAJD blind source separation algorithm based on time-varying autoregressive model
    Ji C.
    Jin C.
    Zhang Y.
    Kongzhi yu Juece/Control and Decision, 2020, 35 (03): : 651 - 656