Nonnegative matrix factorization 2D with the flexible β-Divergence for Single Channel Source Separation

被引：0

作者：

Yu, Kaiwen ^{[1
]}

Woo, W. L. ^{[1
]}

Dlay, S. S. ^{[1
]}

机构：

[1] Newcastle Univ, Sch Elect & Elect Engn, Newcastle Upon Tyne, Tyne & Wear, England

来源：

2015 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2015) | 2015年

关键词：

Single channel source separation; audio processing; non-negative matrix factorization; beta-Divergence; maximization-minimization; FEATURES;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

This paper presents an algorithm for nonnegative matrix factorization 2D (NMF-2D) with the flexible beta-Divergence. The beta-Divergence is a group of cost functions parametrized by a single parameter beta. The Least Squares divergence, Kullback-Leibler divergence and the Itakura-Saito divergence are special cases (beta=2,1,0). This paper presents a more complete algorithm which uses a flexible range of beta, instead of be limited to just special cases. We describe a maximization-minimization (MM) algorithm lead to multiplicative updates. The proposed factorization decomposes an information-bearing matrix into two-dimensional convolution of factor matrices that represent the spectral dictionary and temporal codes with enhanced performance. The method is demonstrated on the separation of audio mixtures recorded from a single channel. Experimental tests and comparisons with other factorization methods have been conducted to verify the efficacy of the proposed method.

引用

页数：5

共 19 条

[1] Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix Factorization Applied to Polyphonic Music Transcription
Bertin, Nancy
Badeau, Roland
Vincent, Emmanuel
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 538 - 549
[2] Nonnegative features of spectro-temporal sounds for classification
Cho, YC
Choi, SJ
[J]. PATTERN RECOGNITION LETTERS, 2005, 26 (09) : 1327 - 1336
[3] Cichocki A, 2006, LECT NOTES COMPUT SC, V3889, P32
[4] Fevotte C., NEURAL COMPUT, V23, P2421, DOI [10.1162/NECO_a_00168, DOI 10.1162/NEC0_A_00168]
[5] Cochleagram-based audio pattern separation using two-dimensional non-negative matrix factorization with automatic sparsity adaptation
Gao, Bin
Woo, W. L.
Khor, C.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (03) : 1171 - 1185
[6] Variational Regularized 2-D Nonnegative Matrix Factorization
Gao, Bin
Woo, W. L.
Dlay, S. S.
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (05) : 703 - 716
[7] Single-Channel Source Separation Using EMD-Subband Variable Regularized Sparse Features
Gao, Bin
Woo, W. L.
Dlay, S. S.
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 961 - 976
[8] Goto M., 2003, P 4 INT C MUS INF RE, P229
[9] A generalized divergence measure for nonnegative matrix factorization
Kompass, Raul
[J]. NEURAL COMPUTATION, 2007, 19 (03) : 780 - 791
[10] Learning the parts of objects by non-negative matrix factorization
Lee, DD
Seung, HS
[J]. NATURE, 1999, 401 (6755) : 788 - 791

← 1 2 →