Joint-Diagonalizability-Constrained Multichannel Nonnegative Matrix Factorization Based on Multivariate Complex Student's t-distribution

被引:0
作者
Kamo, Keigo [1 ]
Kubo, Yuki [1 ]
Takamune, Norihiro [1 ]
Kitamura, Daichi [2 ]
Saruwatari, Hiroshi [1 ]
Takahashi, Yu [3 ]
Kondo, Kazunobu [3 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan
[2] Kagawa Coll, Natl Inst Technol, Mitoya, Kagawa, Japan
[3] Yamaha Corp, Shizuoka, Japan
来源
2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC) | 2020年
关键词
INDEPENDENT VECTOR ANALYSIS; SEPARATION; ICA; MIXTURES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose the model generalization of a fast version of multichannel nonnegative matrix factorization (FastMNMF). FastMNMF is a blind source separation (BSS) method under the assumption that the spatial covariance matrices of multiple sources are jointly diagonalizable. To further improve its source-separation performance, we introduce a multivariate complex Student's t-distribution as a generative model, which includes a multivariate complex Gaussian distribution used in conventional FastMNMF. We derive a new parameter update rule using the auxiliary-function-based method and show the validity of the proposed method on the basis of BSS experiments using music sources.
引用
收藏
页码:869 / 874
页数:6
相关论文
共 25 条
[1]   Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model [J].
Duong, Ngoc Q. K. ;
Vincent, Emmanuel ;
Gribonval, Remi .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07) :1830-1840
[2]  
Hiroe A, 2006, LECT NOTES COMPUT SC, V3889, P601
[3]  
Hunter DR, 2000, J COMPUT GRAPH STAT, V9, P60
[4]  
Ito N, 2019, INT CONF ACOUST SPEE, P371, DOI [10.1109/ICASSP.2019.8682291, 10.1109/icassp.2019.8682291]
[5]  
Kim T, 2006, LECT NOTES COMPUT SC, V3889, P165
[6]   Blind source separation exploiting higher-order frequency dependencies [J].
Kim, Taesu ;
Attias, Hagai T. ;
Lee, Soo-Young ;
Lee, Te-Won .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01) :70-79
[7]  
Kitamura D, OPEN DATASET SONGKIT
[8]  
Kitamura D, 2018, SIGNALS COMMUN TECHN, P125, DOI 10.1007/978-3-319-73031-8_6
[9]   Generalized independent low-rank matrix analysis using heavy-tailed distributions for blind source separation [J].
Kitamura, Daichi ;
Mogami, Shinichi ;
Mitsui, Yoshiki ;
Takamune, Norihiro ;
Saruwatari, Hiroshi ;
Ono, Nobutaka ;
Takahashi, Yu ;
Kondo, Kazunobu .
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2018,
[10]   Determined Blind Source Separation Unifying Independent Vector Analysis and Nonnegative Matrix Factorization [J].
Kitamura, Daichi ;
Ono, Nobutaka ;
Sawada, Hiroshi ;
Kameoka, Hirokazu ;
Saruwatari, Hiroshi .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (09) :1626-1641