FASTMNMF: JOINT DIAGONALIZATION BASED ACCELERATED ALGORITHMS FOR MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION

被引:0
作者
Ito, Nobutaka [1 ]
Nakatani, Tomohiro [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Kyoto, Japan
来源
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2019年
关键词
Nonnegative matrix factorization; joint diagonalization; source separation; microphone arrays; BLIND SOURCE SEPARATION; MIXTURES;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A multichannel extension of nonnegative matrix factorization (NMF) for audio/music data, called multichannel NMF (MNMF), has been proposed by Sawada et al. ["Multichannel extensions of non-negative matrix factorization with complex-valued data," IEEE Trans. ASLP, vol. 21, no. 5, pp. 971-982, May 2013]. However, conventional MNMF algorithms have a major drawback of a heavy computational load due to numerous matrix operations, such as matrix inversions and matrix multiplications. Here we propose FastMNMF, accelerated algorithms for the MNMF based on joint diagonalization of matrices. It is well known that, for diagonal matrices, matrix operations reduce to mere scalar operations on diagonal entries. Because of this property, the joint diagonalization results in a significantly reduced computational load compared to conventional MNMF algorithms. This makes the proposed FastMNMF even applicable to a situation with a large database or restricted computational resources.
引用
收藏
页码:371 / 375
页数:5
相关论文
共 29 条
[1]   A blind source separation technique using second-order statistics [J].
Belouchrani, A ;
AbedMeraim, K ;
Cardoso, JF ;
Moulines, E .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1997, 45 (02) :434-444
[2]   GSVD-based optimal filtering for single and multimicrophone speech enhancement [J].
Doclo, S ;
Moonen, M .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2002, 50 (09) :2230-2244
[3]   Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model [J].
Duong, Ngoc Q. K. ;
Vincent, Emmanuel ;
Gribonval, Remi .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07) :1830-1840
[4]   Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis [J].
Fevotte, Cedric ;
Bertin, Nancy ;
Durrieu, Jean-Louis .
NEURAL COMPUTATION, 2009, 21 (03) :793-830
[5]  
Ikeshita R, 2018, INT WORKSH ACOUSTIC, P520, DOI 10.1109/IWAENC.2018.8521340
[6]  
Ito N., 2018, P IWAENC SEP
[7]  
Ito N., 2018, P EUSIPCO SEP
[8]  
Ito N., 2018, P SPRING M AC SOC JA, P427
[9]  
Ito N, 2018, IEEE GLOB CONF SIG, P231, DOI 10.1109/GlobalSIP.2018.8646336
[10]  
Ito N, 2018, EUR SIGNAL PR CONF, P1662, DOI 10.23919/EUSIPCO.2018.8553410