Machine Learning Source Separation Using Maximum A Posteriori Nonnegative Matrix Factorization

被引:50
|
作者
Gao, Bin [1 ]
Woo, W. L. [2 ]
Ling, Bingo W-K. [3 ]
机构
[1] Univ Elect Sci & Technol China, Sch Automat Engn, Chengdu 610054, Peoples R China
[2] Newcastle Univ, Sch Elect & Elect Engn, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England
[3] Guangdong Univ Technol, Fac Engn, Guangzhou, Guangdong, Peoples R China
关键词
Blind signal separation; Itakura-Saito divergence; non-negative matrix factorization; single channel; signal processing; BLIND SOURCE SEPARATION; SIGNAL SEPARATION;
D O I
10.1109/TCYB.2013.2281332
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A novel unsupervised machine learning algorithm for single channel source separation is presented. The proposed method is based on nonnegative matrix factorization, which is optimized under the framework of maximum a posteriori probability and Itakura-Saito divergence. The method enables a generalized criterion for variable sparseness to be imposed onto the solution and prior information to be explicitly incorporated through the basis vectors. In addition, the method is scale invariant where both low and high energy components of a signal are treated with equal importance. The proposed algorithm is a more complete and efficient approach for matrix factorization of signals that exhibit temporal dependency of the frequency patterns. Experimental tests have been conducted and compared with other algorithms to verify the efficiency of the proposed method.
引用
收藏
页码:1169 / 1179
页数:11
相关论文
共 50 条
  • [1] A STRUCTURED NONNEGATIVE MATRIX FACTORIZATION FOR SOURCE SEPARATION
    Laroche, Clement
    Kowalski, Matthieu
    Papadopoulos, Helene
    Richard, Gael
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2033 - 2037
  • [2] Geometric Source Separation Method Using Nonnegative Matrix Factorization and Interference Suppression
    Lee, Seokjin
    Park, Sang Ha
    Sung, Koeng-Mo
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2011, E94A (11) : 2442 - 2447
  • [3] Dual-Transform Source Separation Using Sparse Nonnegative Matrix Factorization
    Hossain, Md. Imran
    Islam, Md. Shohidul
    Khatun, Mst. Titasa
    Ullah, Rizwan
    Masood, Asim
    Ye, Zhongfu
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (04) : 1868 - 1891
  • [4] Dual-Transform Source Separation Using Sparse Nonnegative Matrix Factorization
    Md. Imran Hossain
    Md. Shohidul Islam
    Mst. Titasa Khatun
    Rizwan Ullah
    Asim Masood
    Zhongfu Ye
    Circuits, Systems, and Signal Processing, 2021, 40 : 1868 - 1891
  • [5] Discriminative Nonnegative Matrix Factorization Using Cross-Reconstruction Error for Source Separation
    Kwon, Kisoo
    Shin, Jong Won
    Kim, Hyung Yong
    Kim, Nam Soo
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1513 - 1516
  • [6] Online Blind Source Separation Using Incremental Nonnegative Matrix Factorization with Volume Constraint
    Zhou, Guoxu
    Yang, Zuyuan
    Xie, Shengli
    Yang, Jun-Mei
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (04): : 550 - 560
  • [7] Underdetermined blind source separation using normalized spatial covariance matrix and multichannel nonnegative matrix factorization
    Oh, Son-hook
    Kim, Jung-Han
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2020, 39 (02): : 120 - 130
  • [8] Initialization of Nonnegative Matrix Factorization Dictionaries for Single Channel Source Separation
    Grais, Emad M.
    Erdogan, Hakan
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [9] BAYESIAN MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR AUDIO SOURCE SEPARATION AND LOCALIZATION
    Itakura, Kousuke
    Bando, Yoshiaki
    Nakamura, Eita
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 551 - 555
  • [10] Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation
    Ozerov, Alexey
    Fevotte, Cedric
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 550 - 563