TRANSDUCTIVE NONNEGATIVE MATRIX FACTORIZATION FOR SEMI-SUPERVISED HIGH-PERFORMANCE SPEECH SEPARATION

被引:9
作者
Guan, Naiyang [1 ]
Lan, Long [1 ]
Tao, Dacheng [2 ]
Luo, Zhigang [1 ]
Yang, Xuejun [3 ]
机构
[1] Natl Univ Def Technol, Sch Comp Sci, Sci & Technol Parallel & Distributed Proc Lab, Changsha 410073, Hunan, Peoples R China
[2] Univ Technol Sydney, Ctr Quantum Computat & Intelligent Syst, FEIT, Sydney, NSW 2007, Australia
[3] Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha 410073, Hunan, Peoples R China
来源
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年
关键词
Nonnegative matrix factorization; transductive learning; speech separation;
D O I
10.1109/ICASSP.2014.6854057
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Regarding the non-negativity property of the magnitude spectrogram of speech signals, nonnegative matrix factorization (NMF) has obtained promising performance for speech separation by independently learning a dictionary on the speech signals of each known speaker. However, traditional NMF fails to represent the mixture signals accurately because the dictionaries for speakers are learned in the absence of mixture signals. In this paper, we propose a new transductive NMF algorithm (TNMF) to jointly learn a dictionary on both speech signals of each speaker and the mixture signals to be separated. Since TNMF learns a more descriptive dictionary by encoding the mixture signals than that learned by NMF, it significantly boosts the separation performance. Experiments results on a popular TIMIT dataset show that the proposed TNMF-based methods outperform traditional NMF-based methods for separating the monophonic mixtures of speech signals of known speakers.
引用
收藏
页数:5
相关论文
共 19 条
[1]  
Fevotte C., 2011, MAJORIZATION MINIMIZ
[2]   Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis [J].
Fevotte, Cedric ;
Bertin, Nancy ;
Durrieu, Jean-Louis .
NEURAL COMPUTATION, 2009, 21 (03) :793-830
[3]  
Guan N., 2012, ARXIV12073438
[4]   Online Nonnegative Matrix Factorization With Robust Stochastic Approximation [J].
Guan, Naiyang ;
Tao, Dacheng ;
Luo, Zhigang ;
Yuan, Bo .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (07) :1087-1099
[5]   NeNMF: An Optimal Gradient Method for Nonnegative Matrix Factorization [J].
Guan, Naiyang ;
Tao, Dacheng ;
Luo, Zhigang ;
Yuan, Bo .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (06) :2882-2898
[6]   Non-negative Patch Alignment Framework [J].
Guan, Naiyang ;
Tao, Dacheng ;
Luo, Zhigang ;
Yuan, Bo .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (08) :1218-1230
[7]   Manifold Regularized Discriminative Nonnegative Matrix Factorization With Fast Gradient Descent [J].
Guan, Naiyang ;
Tao, Dacheng ;
Luo, Zhigang ;
Yuan, Bo .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (07) :2030-2048
[8]  
Joachims T, 1999, MACHINE LEARNING, PROCEEDINGS, P200
[9]  
Joder C., 2012, P 10 INT C LAT VAR A, V7191, P323
[10]  
Joder C, 2013, INT CONF ACOUST SPEE, P131, DOI 10.1109/ICASSP.2013.6637623