Machine Learning Source Separation Using Maximum A Posteriori Nonnegative Matrix Factorization

被引:50
作者
Gao, Bin [1 ]
Woo, W. L. [2 ]
Ling, Bingo W-K. [3 ]
机构
[1] Univ Elect Sci & Technol China, Sch Automat Engn, Chengdu 610054, Peoples R China
[2] Newcastle Univ, Sch Elect & Elect Engn, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England
[3] Guangdong Univ Technol, Fac Engn, Guangzhou, Guangdong, Peoples R China
关键词
Blind signal separation; Itakura-Saito divergence; non-negative matrix factorization; single channel; signal processing; BLIND SOURCE SEPARATION; SIGNAL SEPARATION;
D O I
10.1109/TCYB.2013.2281332
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A novel unsupervised machine learning algorithm for single channel source separation is presented. The proposed method is based on nonnegative matrix factorization, which is optimized under the framework of maximum a posteriori probability and Itakura-Saito divergence. The method enables a generalized criterion for variable sparseness to be imposed onto the solution and prior information to be explicitly incorporated through the basis vectors. In addition, the method is scale invariant where both low and high energy components of a signal are treated with equal importance. The proposed algorithm is a more complete and efficient approach for matrix factorization of signals that exhibit temporal dependency of the frequency patterns. Experimental tests have been conducted and compared with other algorithms to verify the efficiency of the proposed method.
引用
收藏
页码:1169 / 1179
页数:11
相关论文
共 50 条
  • [21] SPARSENESS-BASED MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR BLIND SOURCE SEPARATION
    Higuchi, Takuya
    Yoshioka, Takuya
    Nakatani, Tomohiro
    [J]. 2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [22] Beamspace-Domain Multichannel Nonnegative Matrix Factorization for Audio Source Separation
    Lee, Seokjin
    Park, Sang Ha
    Sung, Koeng-Mo
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (01) : 43 - 46
  • [23] Multichannel Blind Sound Source Separation Using Spatial Covariance Model With Level and Time Differences and Nonnegative Matrix Factorization
    Carabias-Orti, Julio Jose
    Nikunen, Joonas
    Virtanen, Tuomas
    Vera-Candeas, Pedro
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1512 - 1527
  • [24] Regularized nonnegative matrix factorization using Gaussian mixture priors for supervised single channel source separation
    Grais, Emad M.
    Erdogan, Hakan
    [J]. COMPUTER SPEECH AND LANGUAGE, 2013, 27 (03) : 746 - 762
  • [25] Generalized Fast Multichannel Nonnegative Matrix Factorization Based on Gaussian Scale Mixtures for Blind Source Separation
    Fontaine, Mathieu
    Sekiguchi, Kouhei
    Nugraha, Aditya Arie
    Bando, Yoshiaki
    Yoshii, Kazuyoshi
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1734 - 1748
  • [26] Heart-lung sound separation by nonnegative matrix factorization and deep learning
    Wang, Weibo
    Wang, Shubo
    Qin, Dimei
    Fang, Yu
    Zheng, Yongkang
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [27] Convolutive Transfer Function-Based Multichannel Nonnegative Matrix Factorization for Overdetermined Blind Source Separation
    Wang, Taihui
    Yang, Feiran
    Yang, Jun
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 802 - 815
  • [28] Minimum-Volume Multichannel Nonnegative Matrix Factorization for Blind Audio Source Separation
    Wang, Jianyu
    Guan, Shanzheng
    Liu, Shupei
    Zhang, Xiao-Lei
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 (29) : 3089 - 3103
  • [29] Ray-Space-Based Multichannel Nonnegative Matrix Factorization for Audio Source Separation
    Pezzoli, Mirco
    Carabias-Orti, Julio Jose
    Cobos, Maximo
    Antonacci, Fabio
    Sarti, Augusto
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 369 - 373
  • [30] Hybrid Projective Nonnegative Matrix Factorization With Drum Dictionaries for Harmonic/Percussive Source Separation
    Laroche, Clement
    Kowalski, Matthieu
    Papadopoulos, Helene
    Richard, Gael
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1499 - 1511