Nonnegative matrix factorization 2D with the flexible β-Divergence for Single Channel Source Separation

被引:0
作者
Yu, Kaiwen [1 ]
Woo, W. L. [1 ]
Dlay, S. S. [1 ]
机构
[1] Newcastle Univ, Sch Elect & Elect Engn, Newcastle Upon Tyne, Tyne & Wear, England
来源
2015 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2015) | 2015年
关键词
Single channel source separation; audio processing; non-negative matrix factorization; beta-Divergence; maximization-minimization; FEATURES;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents an algorithm for nonnegative matrix factorization 2D (NMF-2D) with the flexible beta-Divergence. The beta-Divergence is a group of cost functions parametrized by a single parameter beta. The Least Squares divergence, Kullback-Leibler divergence and the Itakura-Saito divergence are special cases (beta=2,1,0). This paper presents a more complete algorithm which uses a flexible range of beta, instead of be limited to just special cases. We describe a maximization-minimization (MM) algorithm lead to multiplicative updates. The proposed factorization decomposes an information-bearing matrix into two-dimensional convolution of factor matrices that represent the spectral dictionary and temporal codes with enhanced performance. The method is demonstrated on the separation of audio mixtures recorded from a single channel. Experimental tests and comparisons with other factorization methods have been conducted to verify the efficacy of the proposed method.
引用
收藏
页数:5
相关论文
共 19 条
  • [1] Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix Factorization Applied to Polyphonic Music Transcription
    Bertin, Nancy
    Badeau, Roland
    Vincent, Emmanuel
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 538 - 549
  • [2] Nonnegative features of spectro-temporal sounds for classification
    Cho, YC
    Choi, SJ
    [J]. PATTERN RECOGNITION LETTERS, 2005, 26 (09) : 1327 - 1336
  • [3] Cichocki A, 2006, LECT NOTES COMPUT SC, V3889, P32
  • [4] Fevotte C., NEURAL COMPUT, V23, P2421, DOI [10.1162/NECO_a_00168, DOI 10.1162/NEC0_A_00168]
  • [5] Cochleagram-based audio pattern separation using two-dimensional non-negative matrix factorization with automatic sparsity adaptation
    Gao, Bin
    Woo, W. L.
    Khor, C.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (03) : 1171 - 1185
  • [6] Variational Regularized 2-D Nonnegative Matrix Factorization
    Gao, Bin
    Woo, W. L.
    Dlay, S. S.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (05) : 703 - 716
  • [7] Single-Channel Source Separation Using EMD-Subband Variable Regularized Sparse Features
    Gao, Bin
    Woo, W. L.
    Dlay, S. S.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 961 - 976
  • [8] Goto M., 2003, P 4 INT C MUS INF RE, P229
  • [9] A generalized divergence measure for nonnegative matrix factorization
    Kompass, Raul
    [J]. NEURAL COMPUTATION, 2007, 19 (03) : 780 - 791
  • [10] Learning the parts of objects by non-negative matrix factorization
    Lee, DD
    Seung, HS
    [J]. NATURE, 1999, 401 (6755) : 788 - 791