Consistent ICA: Determined BSS Meets Spectrogram Consistency

被引:12
作者
Yatabe, Kohei [1 ]
机构
[1] Waseda Univ, Dept Intermedia Art & Sci, Tokyo 1698555, Japan
关键词
Spectrogram; Time-frequency analysis; Time-domain analysis; Matrix converters; Blind source separation; Smoothing methods; Linear source separation; multichannel acoustic signal processing; demixing filter estimation; independent component analysis (ICA); short-time Fourier transform; INDEPENDENT COMPONENT ANALYSIS; SOURCE SEPARATION; PERMUTATION PROBLEM; ALGORITHMS; MIXTURES;
D O I
10.1109/LSP.2020.2996904
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multichannel audio blind source separation (BSS) in the determined situation (the number of microphones is equal to that of the sources), or determined BSS, is performed by multichannel linear filtering in the time-frequency domain to handle the convolutive mixing process. Ordinarily, the filter treats each frequency independently, which causes the well-known permutation problem, i.e., the problem of how to align the frequency-wise filters so that each separated component is correctly assigned to the corresponding sources. In this paper, it is shown that the general property of the time-frequency-domain representation called spectrogram consistency can be an assistant for solving the permutation problem.
引用
收藏
页码:870 / 874
页数:5
相关论文
共 34 条
[1]  
Anemuller J., 2000, Second International Workshop on Independent Component Analysis and Blind Signal Separation. Proceedings, P215
[2]  
Araki Shoko, 2012, Latent Variable Analysis and Signal Separation. Proceedings 10th International Conference, LVA/ICA 2012, P414, DOI 10.1007/978-3-642-28551-6_51
[3]   A generalization of blind source separation algorithms for convolutive mixtures based on second-order statistics [J].
Buchner, H ;
Aichner, R ;
Kellermann, W .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (01) :120-134
[4]   INDEPENDENT COMPONENT ANALYSIS, A NEW CONCEPT [J].
COMON, P .
SIGNAL PROCESSING, 1994, 36 (03) :287-314
[5]   Phase Processing for Single-Channel Speech Enhancement [J].
Gerkmann, Timo ;
Krawczyk-Becker, Martin ;
Le Roux, Jonathan .
IEEE SIGNAL PROCESSING MAGAZINE, 2015, 32 (02) :55-66
[6]  
Hiroe A, 2006, LECT NOTES COMPUT SC, V3889, P601
[7]   Independent component analysis:: algorithms and applications [J].
Hyvärinen, A ;
Oja, E .
NEURAL NETWORKS, 2000, 13 (4-5) :411-430
[8]   Blind source separation exploiting higher-order frequency dependencies [J].
Kim, Taesu ;
Attias, Hagai T. ;
Lee, Soo-Young ;
Lee, Te-Won .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01) :70-79
[9]   Generalized independent low-rank matrix analysis using heavy-tailed distributions for blind source separation [J].
Kitamura, Daichi ;
Mogami, Shinichi ;
Mitsui, Yoshiki ;
Takamune, Norihiro ;
Saruwatari, Hiroshi ;
Ono, Nobutaka ;
Takahashi, Yu ;
Kondo, Kazunobu .
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2018,
[10]   Determined Blind Source Separation Unifying Independent Vector Analysis and Nonnegative Matrix Factorization [J].
Kitamura, Daichi ;
Ono, Nobutaka ;
Sawada, Hiroshi ;
Kameoka, Hirokazu ;
Saruwatari, Hiroshi .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (09) :1626-1641