MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION IN CONVOLUTIVE MIXTURES. WITH APPLICATION TO BLIND AUDIO SOURCE SEPARATION.

被引:13
作者
Ozerov, Alexey [1 ]
Fevotte, Cedric [2 ]
机构
[1] TELECOM ParisTech, CNRS LTCI, Inst TELECOM, 37-39 Rue Dareau, F-75014 Paris, France
[2] TELECOM ParisTech, CNRS LTCI, F-75014 Paris, France
来源
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年
关键词
Multichannel audio; nonnegative matrix factorization; nonnegative tensor factorization; underdetermined convolutive blind source separation;
D O I
10.1109/ICASSP.2009.4960289
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We consider inference in a general data-driven object-based model of multichannel audio data, assumed generated as a possibly underdetermined convolutive mixture of source signals. Each source is given a model inspired from nonnegative matrix factorization (NMF) with the Itakura-Saito divergence, which underlies a statistical model of superimposed Gaussian components. We address estimation of the mixing and source parameters using two methods. The first one consists of maximizing the exact joint likelihood of the multichannel data using an expectation-maximization algorithm. The second method consists of maximizing the sum of individual likelihoods of all channels using a multiplicative update algorithm inspired from NMF methodology. Our decomposition algorithms were applied to stereo music and assessed in terms of blind source separation performance.
引用
收藏
页码:3137 / +
页数:2
相关论文
共 10 条
[1]  
Abdallah S.A., 2004, P INT C MUS INF RETR, P318
[2]  
ATTIAS H, 2003, P IEEE INT C AC SPEE
[3]  
CARDOSO JF, 2002, P EUSIPCO, V1, P561
[4]  
CICHOCKI A, 2006, P 6 INT C IND COMP A, P32
[5]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[6]  
FEVOTTE C, 2009, NEURAL COMPUTATION, V21
[7]  
FitzGerald D., 2005, P IR SIGN SYST C DUB
[8]  
Makino S, 2007, SIGNALS COMMUN TECHN, P1, DOI 10.1007/978-1-4020-6479-1
[9]  
MOULINES E, 1997, P IEEE INT C AC SPEE
[10]  
Vincent E, 2007, LECT NOTES COMPUT SC, V4666, P552