A Unifying View on Blind Source Separation of Convolutive Mixtures Based on Independent Component Analysis

被引:12
作者
Brendel, Andreas [1 ,2 ]
Haubner, Thomas [1 ]
Kellermann, Walter
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Chair Multimedia Communcat & Signal Proc, D-91058 Erlangen, Germany
[2] Fraunhofer Inst Integrated Circuits IIS, D-91058 Erlangen, Germany
关键词
Cost function; Microphones; Signal processing algorithms; Time-frequency analysis; Acoustics; Convolution; Probability density function; Blind source separation; independent component analysis; convolutive mixtures; indpendent vector analysis; trinicon; VECTOR ANALYSIS; PERMUTATION PROBLEM; SIGNAL SEPARATION; ALGORITHMS; SPEECH; IDENTIFICATION; EXTRACTION; ROBUST; ICA;
D O I
10.1109/TSP.2023.3255552
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In many daily-life scenarios, acoustic sources recorded in an enclosure can only be observed with other interfering sources. Hence, convolutive Blind Source Separation (BSS) is a central problem in audio signal processing. Methods based on Independent Component Analysis (ICA) are especially important in this field as they require only few and weak assumptions and allow for blindness regarding the original source signals and the acoustic propagation path. Most of the currently used algorithms belong to one of the following three families: Frequency Domain ICA (FD-ICA), Independent Vector Analysis (IVA), and TRIple-N Independent component analysis for CONvolutive mixtures (TRINICON). While the relation between ICA, FD-ICA and IVA becomes apparent due to their construction, the relation to TRINICON is not well established yet. This paper fills this gap by providing an in-depth treatment of the common building blocks of these algorithms and their differences, and thus provides a common framework for all considered algorithms.
引用
收藏
页码:816 / 830
页数:15
相关论文
共 73 条
[1]   A real-time blind source separation scheme and its application to reverberant and noisy acoustic environments [J].
Aichner, R ;
Buchner, H ;
Yan, F ;
Kellermann, W .
SIGNAL PROCESSING, 2006, 86 (06) :1260-1277
[2]  
Aichner R, 2002, NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, P445, DOI 10.1109/NNSP.2002.1030056
[3]   Nonholonomic orthogonal learning algorithms for blind source separation [J].
Amari, S ;
Chen, TP ;
Cichocki, A .
NEURAL COMPUTATION, 2000, 12 (06) :1463-1484
[4]   Natural gradient works efficiently in learning [J].
Amari, S .
NEURAL COMPUTATION, 1998, 10 (02) :251-276
[5]  
Anderson CA, 2014, 2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), P102, DOI 10.1109/HSCMA.2014.6843260
[6]   Independent Vector Analysis: Identification Conditions and Performance Bounds [J].
Anderson, Matthew ;
Fu, Geng-Shen ;
Phlypo, Ronald ;
Adali, Tulay .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (17) :4399-4410
[7]   Joint Blind Source Separation With Multivariate Gaussian Model: Algorithms and Performance Analysis [J].
Anderson, Matthew ;
Adali, Tuelay ;
Li, Xi-Lin .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (04) :1672-1683
[8]   The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech [J].
Araki, S ;
Mukai, R ;
Makino, S ;
Nishikawa, T ;
Saruwatari, H .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (02) :109-116
[9]  
Baumann D., 2003, P IEEE INT C AC SPEE, pV
[10]   AN INFORMATION MAXIMIZATION APPROACH TO BLIND SEPARATION AND BLIND DECONVOLUTION [J].
BELL, AJ ;
SEJNOWSKI, TJ .
NEURAL COMPUTATION, 1995, 7 (06) :1129-1159