A frequency domain method for blind source separation of convolutive audio mixtures

被引:82
|
作者
Rahbar, K [1 ]
Reilly, JP [1 ]
机构
[1] McMaster Univ, Dept Elect & Comp Engn, Hamilton, ON L8S 4K1, Canada
来源
基金
加拿大自然科学与工程研究理事会;
关键词
audio enhancement; frequency domain blind; source separation; joint diagonalization; permutation ambiguity;
D O I
10.1109/TSA.2005.851925
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a new frequency domain approach to blind source separation (BSS) of audio signals mixed in a reverberant environment. We propose a joint diagonalization procedure on the cross power spectral density matrices of the signals at the output of the mixing system to identify the mixing system at each frequency bin up to a scale and permutation ambiguity. The frequency domain joint diagonalization is performed using a new and quickly converging algorithm which uses an alternating least-squares (ALS) optimization method. The inverse of the mixing system is then used to separate the sources. An efficient dyadic algorithm to resolve the frequency dependent permutation ambiguities that exploits the inherent nonstationarity of the sources is presented. The effect of the unknown scaling ambiguities is partially resolved using an initialization procedure for the ALS algorithm. The performance of the proposed algorithm is demonstrated by experiments conducted in real reverberant rooms. Performance comparisons are made with previous methods.
引用
收藏
页码:832 / 844
页数:13
相关论文
共 50 条
  • [41] A narrowband approach to blind source separation in convolutive MIMO mixtures
    Affes, Sofiene
    Souden, Mehrez
    Benesty, Jacob
    2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 1377 - 1380
  • [42] AUDIO SOURCE SEPARATION BASED ON CONVOLUTIVE TRANSFER FUNCTION AND FREQUENCY-DOMAIN LASSO OPTIMIZATION
    Li, Xiaofei
    Girin, Laurent
    Horaud, Radu
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 541 - 545
  • [43] A new method of solving permutation problem in blind source separation for convolutive acoustic signals in frequency-domain
    Wu, Wenyan
    Zhang, Liming
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 1237 - 1242
  • [44] Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation
    Ozerov, Alexey
    Fevotte, Cedric
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 550 - 563
  • [45] A probabilistic approach for blind source separation of underdetermined convolutive mixtures
    Peterson, JM
    Kadambe, S
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 861 - 864
  • [46] A Sparsity-Based Method to Solve Permutation Indeterminacy in Frequency-Domain Convolutive Blind Source Separation
    Sudhakar, Prasad
    Gribonval, Remi
    INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 338 - 345
  • [47] A Time-Frequency Domain Blind Source Separation Method for Underdetermined Instantaneous Mixtures
    Peng, Tianliang
    Chen, Yang
    Liu, Zengli
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2015, 34 (12) : 3883 - 3895
  • [48] An Adaptive Approach to Subband domain Convolutive Blind Source Separation
    Ayub, Sara
    Arslan, Muhammad
    Salman, Muhammad
    Mirza, Alina
    Asghar, Eram
    Ayub, Huma
    Amanat, Hiba
    Aziz, Lubna
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATIONS, AND CONTROL TECHNOLOGY (I4CT), 2015,
  • [49] A Method for Filter Equalization in Convolutive Blind Source Separation
    Mazur, Radoslaw
    Mertins, Alfred
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, 2010, 6365 : 328 - 336
  • [50] A Method for Filter Shaping in Convolutive Blind Source Separation
    Mazur, Radoslaw
    Mertins, Alfred
    INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 282 - 289