Determined Blind Source Separation Unifying Independent Vector Analysis and Nonnegative Matrix Factorization

被引:307
作者
Kitamura, Daichi [1 ]
Ono, Nobutaka [2 ]
Sawada, Hiroshi [3 ]
Kameoka, Hirokazu [3 ]
Saruwatari, Hiroshi [4 ]
机构
[1] SOKENDAI Grad Univ Adv Studies, Sch Multidisciplinary Sci, Dept Informat, Hayama, Kanagawa 2400193, Japan
[2] Natl Inst Informat, Tokyo 1018430, Japan
[3] NTT Corp, NTT Commun Sci Labs, Kyoto 6190237, Japan
[4] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo 1138656, Japan
关键词
Blind source separation; determined; independent vector analysis; nonnegative matrix factorization; CONVOLUTIVE MIXTURES;
D O I
10.1109/TASLP.2016.2577880
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper addresses the determined blind source separation problem and proposes a new effective method unifying independent vector analysis (IVA) and nonnegative matrix factorization (NMF). IVA is a state-of-the-art technique that utilizes the statistical independence between sources in a mixture signal, and an efficient optimization scheme has been proposed for IVA. However, since the source model in IVA is based on a spherical multivariate distribution, IVA cannot utilize specific spectral structures such as the harmonic structures of pitched instrumental sounds. To solve this problem, we introduce NMF decomposition as the source model in IVA to capture the spectral structures. The formulation of the proposed method is derived from conventional multichannel NMF(MNMF), which reveals the relationship between MNMF and IVA. The proposed method can be optimized by the update rules of IVA and single-channel NMF. Experimental results show the efficacy of the proposed method compared with IVA and MNMF in terms of separation accuracy and convergence speed.
引用
收藏
页码:1626 / 1641
页数:16
相关论文
共 40 条
[1]  
Araki Shoko, 2012, Latent Variable Analysis and Signal Separation. Proceedings 10th International Conference, LVA/ICA 2012, P414, DOI 10.1007/978-3-642-28551-6_51
[2]   The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech [J].
Araki, S ;
Mukai, R ;
Makino, S ;
Nishikawa, T ;
Saruwatari, H .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (02) :109-116
[3]   Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors [J].
Araki, Shoko ;
Sawada, Hiroshi ;
Mukai, Ryo ;
Makino, Shoji .
SIGNAL PROCESSING, 2007, 87 (08) :1833-1847
[4]  
Arberet S., 2010, 2010 10th International Conference on Information Sciences, Signal Processing and their Applications (ISSPA 2010), P1, DOI 10.1109/ISSPA.2010.5605570
[5]   A generalization of blind source separation algorithms for convolutive mixtures based on second-order statistics [J].
Buchner, H ;
Aichner, R ;
Kellermann, W .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (01) :120-134
[6]   INDEPENDENT COMPONENT ANALYSIS, A NEW CONCEPT [J].
COMON, P .
SIGNAL PROCESSING, 1994, 36 (03) :287-314
[7]   Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model [J].
Duong, Ngoc Q. K. ;
Vincent, Emmanuel ;
Gribonval, Remi .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07) :1830-1840
[8]   Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis [J].
Fevotte, Cedric ;
Bertin, Nancy ;
Durrieu, Jean-Louis .
NEURAL COMPUTATION, 2009, 21 (03) :793-830
[9]  
FitzGerald D., 2005, IEE Irish Signals and Systems Conference 2005, P8, DOI 10.1049/cp:20050279
[10]  
Hiroe A, 2006, LECT NOTES COMPUT SC, V3889, P601