A Unifying View on Blind Source Separation of Convolutive Mixtures Based on Independent Component Analysis

被引:12
作者
Brendel, Andreas [1 ,2 ]
Haubner, Thomas [1 ]
Kellermann, Walter
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Chair Multimedia Communcat & Signal Proc, D-91058 Erlangen, Germany
[2] Fraunhofer Inst Integrated Circuits IIS, D-91058 Erlangen, Germany
关键词
Cost function; Microphones; Signal processing algorithms; Time-frequency analysis; Acoustics; Convolution; Probability density function; Blind source separation; independent component analysis; convolutive mixtures; indpendent vector analysis; trinicon; VECTOR ANALYSIS; PERMUTATION PROBLEM; SIGNAL SEPARATION; ALGORITHMS; SPEECH; IDENTIFICATION; EXTRACTION; ROBUST; ICA;
D O I
10.1109/TSP.2023.3255552
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In many daily-life scenarios, acoustic sources recorded in an enclosure can only be observed with other interfering sources. Hence, convolutive Blind Source Separation (BSS) is a central problem in audio signal processing. Methods based on Independent Component Analysis (ICA) are especially important in this field as they require only few and weak assumptions and allow for blindness regarding the original source signals and the acoustic propagation path. Most of the currently used algorithms belong to one of the following three families: Frequency Domain ICA (FD-ICA), Independent Vector Analysis (IVA), and TRIple-N Independent component analysis for CONvolutive mixtures (TRINICON). While the relation between ICA, FD-ICA and IVA becomes apparent due to their construction, the relation to TRINICON is not well established yet. This paper fills this gap by providing an in-depth treatment of the common building blocks of these algorithms and their differences, and thus provides a common framework for all considered algorithms.
引用
收藏
页码:816 / 830
页数:15
相关论文
共 73 条
[51]   Independent Deeply Learned Matrix Analysis for Determined Audio Source Separation [J].
Makishima, Naoki ;
Mogami, Shinichi ;
Takamune, Norihiro ;
Kitamura, Daichi ;
Sumino, Hayato ;
Takamichi, Shinnosuke ;
Saruwatari, Hiroshi ;
Ono, Nobutaka .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (10) :1601-1615
[52]   A NEURAL-NET FOR BLIND SEPARATION OF NONSTATIONARY SIGNALS [J].
MATSUOKA, K ;
OHYA, M ;
KAWAMOTO, M .
NEURAL NETWORKS, 1995, 8 (03) :411-419
[53]  
Matsuoka K, 2002, SICE 2002: PROCEEDINGS OF THE 41ST SICE ANNUAL CONFERENCE, VOLS 1-5, P2138, DOI 10.1109/SICE.2002.1195729
[54]   Blind separation of convolutive mixtures by decorrelation [J].
Mei, TM ;
Yin, FL .
SIGNAL PROCESSING, 2004, 84 (12) :2297-2313
[55]  
Meier S, 2015, EUR SIGNAL PR CONF, P414, DOI 10.1109/EUSIPCO.2015.7362416
[56]  
Moreau E., 2013, Blind Identification and Separation of Complex-valued Signals
[57]   An approach to blind source separation based on temporal structure of speech signals [J].
Murata, N ;
Ikeda, S ;
Ziehe, A .
NEUROCOMPUTING, 2001, 41 :1-24
[58]  
Ono N, 2012, ASIAPAC SIGN INFO PR
[59]  
Ono N, 2011, 2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), P189, DOI 10.1109/ASPAA.2011.6082320
[60]  
Ono S., LATENT VARIABLE ANAL, V6365