Joint-diagonalizability-constrained multichannel nonnegative matrix factorization based on time-variant multivariate complex sub-Gaussian distribution

被引:3
作者
Kamo, Keigo [1 ]
Mitsui, Yoshiki [1 ]
Kubo, Yuki [1 ]
Takamune, Norihiro [1 ]
Kitamura, Daichi [2 ]
Saruwatari, Hiroshi [1 ]
Takahashi, Yu [3 ]
Kondo, Kazunobu [3 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo 1138656, Japan
[2] Natl Inst Technol Kagawa Coll, Dept Elect & Comp Engn, Takamatsu, Kagawa 7618058, Japan
[3] Yamaha Corp, Shizuoka 4308650, Japan
关键词
Blind source separation; Spatial covariance matrix; Joint diagonalizability; Multichannel nonnegative matrix; factorization; Sub-Gaussian distribution; INDEPENDENT VECTOR ANALYSIS; AUDIO SOURCE SEPARATION; MIXTURES; ICA; ALGORITHMS;
D O I
10.1016/j.sigpro.2021.108183
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multichannel nonnegative matrix factorization (MNMF) is a common blind source separation technique that employs full-rank spatial covariance matrices (SCMs). The full-rank SCMs can simulate reverberant mixing systems where the sources are spatially spread. In conventional MNMF, spectrograms of observed signals are modeled by some types of distribution, e.g., the Gaussian distribution and Student's t distribution. However, MNMF based on the sub-Gaussian distribution has not been proposed because its cost function is difficult to minimize. In this paper, we address the statistical model extension of MNMF to the sub-Gaussian distribution to improve the source separation accuracy. In the proposed method, the generalized Gaussian distribution is utilized as the sub-Gaussian model. Moreover, to design an auxiliary function for the proposed cost function, we introduce the joint-diagonalizability constraint to SCMs similarly to FastMNMF. Two types of update rule for the proposed MNMF are derived on the basis of the majorization-minimization (MM) and majorization-equalization (ME) algorithms. Since the optimization speed of each parameter affects the source separation performance, we experimentally analyze the best combination of MM-and ME-algorithm-based update rules in the proposed method. Experiments of blind source separation reveal that the proposed MNMF based on the sub-Gaussian model can outperform conventional methods. (c) 2021 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license ( http://creativecommons.org/licenses/by-nc-nd/4.0/ )
引用
收藏
页数:10
相关论文
共 40 条
[1]   Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model [J].
Duong, Ngoc Q. K. ;
Vincent, Emmanuel ;
Gribonval, Remi .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07) :1830-1840
[2]   Algorithms for Nonnegative Matrix Factorization with the β-Divergence [J].
Fevotte, Cedric ;
Idier, Jerome .
NEURAL COMPUTATION, 2011, 23 (09) :2421-2456
[3]  
Fontaine M., 2020, P INT 2020 P INT 2020, P4541
[4]   A multivariate generalization of the power exponential family of distributions [J].
Gomez, E ;
Gomez-Villegas, MA ;
Marin, JM .
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1998, 27 (03) :589-600
[5]  
Hiroe A, 2006, LECT NOTES COMPUT SC, V3889, P601
[6]  
Hunter DR, 2000, J COMPUT GRAPH STAT, V9, P60
[7]  
Ikeshita R, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P741, DOI 10.1109/ICASSP.2018.8461387
[8]  
Ito N, 2019, INT CONF ACOUST SPEE, P371, DOI [10.1109/icassp.2019.8682291, 10.1109/ICASSP.2019.8682291]
[9]  
Kamo K., 2020, P EUSIPCO P EUSIPCO, P890
[10]  
Kamo K, 2020, ASIAPAC SIGN INFO PR, P869