A frequency domain method for blind source separation of convolutive audio mixtures

被引:82
作者
Rahbar, K [1 ]
Reilly, JP [1 ]
机构
[1] McMaster Univ, Dept Elect & Comp Engn, Hamilton, ON L8S 4K1, Canada
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2005年 / 13卷 / 05期
基金
加拿大自然科学与工程研究理事会;
关键词
audio enhancement; frequency domain blind; source separation; joint diagonalization; permutation ambiguity;
D O I
10.1109/TSA.2005.851925
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a new frequency domain approach to blind source separation (BSS) of audio signals mixed in a reverberant environment. We propose a joint diagonalization procedure on the cross power spectral density matrices of the signals at the output of the mixing system to identify the mixing system at each frequency bin up to a scale and permutation ambiguity. The frequency domain joint diagonalization is performed using a new and quickly converging algorithm which uses an alternating least-squares (ALS) optimization method. The inverse of the mixing system is then used to separate the sources. An efficient dyadic algorithm to resolve the frequency dependent permutation ambiguities that exploits the inherent nonstationarity of the sources is presented. The effect of the unknown scaling ambiguities is partially resolved using an initialization procedure for the ALS algorithm. The performance of the proposed algorithm is demonstrated by experiments conducted in real reverberant rooms. Performance comparisons are made with previous methods.
引用
收藏
页码:832 / 844
页数:13
相关论文
共 50 条
  • [31] Audio source separation with multiple microphones on time-frequency representations
    Sawada, Hiroshi
    INDEPENDENT COMPONENT ANALYSES, COMPRESSIVE SAMPLING, WAVELETS, NEURAL NET, BIOSYSTEMS, AND NANOENGINEERING XI, 2013, 8750
  • [32] An Approach to Solving a Permutation Problem of Frequency Domain Independent Component Analysis for Blind Source Separation of Speech Signals
    Fujieda, Masaru
    Murakami, Takahiro
    Ishida, Yoshihisa
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 18, 2006, 18 : 64 - 68
  • [33] Frequency domain passive broadband speaker localization using a permutation-free blind source separation algorithm
    Visser, Erik
    2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol II, Pts 1-3, 2007, : 673 - 676
  • [34] Blind signal separation method and relationship between source separation and source localisation in the TF plane
    Cholnam, Om
    Chongil, Gwak
    Chol, Rim Kyong
    IET SIGNAL PROCESSING, 2018, 12 (09) : 1115 - 1122
  • [35] Blind Source Separation Approach for Audio Signals based on Support Vector Machine Classification
    Abouzid, H.
    Chakkor, O.
    ICCWCS'17: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTING AND WIRELESS COMMUNICATION SYSTEMS, 2017,
  • [36] A new approach for blind source separation using time frequency distributions
    Belouchrani, A
    Amin, MG
    ADVANCED SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, AND IMPLEMENTATIONS VI, 1996, 2846 : 193 - 203
  • [37] Determined and overdetermined convolutive blind source extractions by approximate joint diagonalization
    Saito, Shinya
    Oishi, Kunio
    Furukawa, Toshihiro
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2019, 40 (05) : 302 - 312
  • [38] Local AM/FM Parameters Estimation: Application to Sinusoidal Modeling and Blind Audio Source Separation
    Fourer, Dominique
    Auger, Francois
    Peeters, Geoffroy
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (10) : 1600 - 1604
  • [39] Time-Domain Audio Source Separation With Neural Networks Based on Multiresolution Analysis
    Nakamura, Tomohiko
    Kozuka, Shihori
    Saruwatari, Hiroshi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1687 - 1701
  • [40] Blind Source Separation of Radar Signals in Time Domain Using Deep Learning
    Hinderer, Sven
    2022 23RD INTERNATIONAL RADAR SYMPOSIUM (IRS), 2022, : 486 - 491