A frequency domain method for blind source separation of convolutive audio mixtures

被引：82

作者：

Rahbar, K ^{[1
]}

Reilly, JP ^{[1
]}

机构：

[1] McMaster Univ, Dept Elect & Comp Engn, Hamilton, ON L8S 4K1, Canada

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2005年 / 13卷 / 05期

基金：

加拿大自然科学与工程研究理事会;

关键词：

audio enhancement; frequency domain blind; source separation; joint diagonalization; permutation ambiguity;

D O I：

10.1109/TSA.2005.851925

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we propose a new frequency domain approach to blind source separation (BSS) of audio signals mixed in a reverberant environment. We propose a joint diagonalization procedure on the cross power spectral density matrices of the signals at the output of the mixing system to identify the mixing system at each frequency bin up to a scale and permutation ambiguity. The frequency domain joint diagonalization is performed using a new and quickly converging algorithm which uses an alternating least-squares (ALS) optimization method. The inverse of the mixing system is then used to separate the sources. An efficient dyadic algorithm to resolve the frequency dependent permutation ambiguities that exploits the inherent nonstationarity of the sources is presented. The effect of the unknown scaling ambiguities is partially resolved using an initialization procedure for the ALS algorithm. The performance of the proposed algorithm is demonstrated by experiments conducted in real reverberant rooms. Performance comparisons are made with previous methods.

引用

页码：832 / 844

页数：13

共 50 条

[31] Audio source separation with multiple microphones on time-frequency representations
Sawada, Hiroshi
INDEPENDENT COMPONENT ANALYSES, COMPRESSIVE SAMPLING, WAVELETS, NEURAL NET, BIOSYSTEMS, AND NANOENGINEERING XI, 2013, 8750
[32] An Approach to Solving a Permutation Problem of Frequency Domain Independent Component Analysis for Blind Source Separation of Speech Signals
Fujieda, Masaru
Murakami, Takahiro
Ishida, Yoshihisa
PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 18, 2006, 18 : 64 - 68
[33] Frequency domain passive broadband speaker localization using a permutation-free blind source separation algorithm
Visser, Erik
2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol II, Pts 1-3, 2007, : 673 - 676
[34] Blind signal separation method and relationship between source separation and source localisation in the TF plane
Cholnam, Om
Chongil, Gwak
Chol, Rim Kyong
IET SIGNAL PROCESSING, 2018, 12 (09) : 1115 - 1122
[35] Blind Source Separation Approach for Audio Signals based on Support Vector Machine Classification
Abouzid, H.
Chakkor, O.
ICCWCS'17: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTING AND WIRELESS COMMUNICATION SYSTEMS, 2017,
[36] A new approach for blind source separation using time frequency distributions
Belouchrani, A
Amin, MG
ADVANCED SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, AND IMPLEMENTATIONS VI, 1996, 2846 : 193 - 203
[37] Determined and overdetermined convolutive blind source extractions by approximate joint diagonalization
Saito, Shinya
Oishi, Kunio
Furukawa, Toshihiro
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2019, 40 (05) : 302 - 312
[38] Local AM/FM Parameters Estimation: Application to Sinusoidal Modeling and Blind Audio Source Separation
Fourer, Dominique
Auger, Francois
Peeters, Geoffroy
IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (10) : 1600 - 1604
[39] Time-Domain Audio Source Separation With Neural Networks Based on Multiresolution Analysis
Nakamura, Tomohiko
Kozuka, Shihori
Saruwatari, Hiroshi
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1687 - 1701
[40] Blind Source Separation of Radar Signals in Time Domain Using Deep Learning
Hinderer, Sven
2022 23RD INTERNATIONAL RADAR SYMPOSIUM (IRS), 2022, : 486 - 491

← 1 2 3 4 5 →