Machine Learning Source Separation Using Maximum A Posteriori Nonnegative Matrix Factorization

被引：50

作者：

Gao, Bin ^{[1
]}

Woo, W. L. ^{[2
]}

Ling, Bingo W-K. ^{[3
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Automat Engn, Chengdu 610054, Peoples R China

[2] Newcastle Univ, Sch Elect & Elect Engn, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England

[3] Guangdong Univ Technol, Fac Engn, Guangzhou, Guangdong, Peoples R China

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2014年 / 44卷 / 07期

关键词：

Blind signal separation; Itakura-Saito divergence; non-negative matrix factorization; single channel; signal processing; BLIND SOURCE SEPARATION; SIGNAL SEPARATION;

D O I：

10.1109/TCYB.2013.2281332

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

A novel unsupervised machine learning algorithm for single channel source separation is presented. The proposed method is based on nonnegative matrix factorization, which is optimized under the framework of maximum a posteriori probability and Itakura-Saito divergence. The method enables a generalized criterion for variable sparseness to be imposed onto the solution and prior information to be explicitly incorporated through the basis vectors. In addition, the method is scale invariant where both low and high energy components of a signal are treated with equal importance. The proposed algorithm is a more complete and efficient approach for matrix factorization of signals that exhibit temporal dependency of the frequency patterns. Experimental tests have been conducted and compared with other algorithms to verify the efficiency of the proposed method.

引用

页码：1169 / 1179

页数：11

共 50 条

[21] SPARSENESS-BASED MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR BLIND SOURCE SEPARATION
Higuchi, Takuya
Yoshioka, Takuya
Nakatani, Tomohiro
[J]. 2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
[22] Beamspace-Domain Multichannel Nonnegative Matrix Factorization for Audio Source Separation
Lee, Seokjin
Park, Sang Ha
Sung, Koeng-Mo
[J]. IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (01) : 43 - 46
[23] Multichannel Blind Sound Source Separation Using Spatial Covariance Model With Level and Time Differences and Nonnegative Matrix Factorization
Carabias-Orti, Julio Jose
Nikunen, Joonas
Virtanen, Tuomas
Vera-Candeas, Pedro
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1512 - 1527
[24] Regularized nonnegative matrix factorization using Gaussian mixture priors for supervised single channel source separation
Grais, Emad M.
Erdogan, Hakan
[J]. COMPUTER SPEECH AND LANGUAGE, 2013, 27 (03) : 746 - 762
[25] Generalized Fast Multichannel Nonnegative Matrix Factorization Based on Gaussian Scale Mixtures for Blind Source Separation
Fontaine, Mathieu
Sekiguchi, Kouhei
Nugraha, Aditya Arie
Bando, Yoshiaki
Yoshii, Kazuyoshi
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1734 - 1748
[26] Heart-lung sound separation by nonnegative matrix factorization and deep learning
Wang, Weibo
Wang, Shubo
Qin, Dimei
Fang, Yu
Zheng, Yongkang
[J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
[27] Convolutive Transfer Function-Based Multichannel Nonnegative Matrix Factorization for Overdetermined Blind Source Separation
Wang, Taihui
Yang, Feiran
Yang, Jun
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 802 - 815
[28] Minimum-Volume Multichannel Nonnegative Matrix Factorization for Blind Audio Source Separation
Wang, Jianyu
Guan, Shanzheng
Liu, Shupei
Zhang, Xiao-Lei
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 (29) : 3089 - 3103
[29] Ray-Space-Based Multichannel Nonnegative Matrix Factorization for Audio Source Separation
Pezzoli, Mirco
Carabias-Orti, Julio Jose
Cobos, Maximo
Antonacci, Fabio
Sarti, Augusto
[J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 369 - 373
[30] Hybrid Projective Nonnegative Matrix Factorization With Drum Dictionaries for Harmonic/Percussive Source Separation
Laroche, Clement
Kowalski, Matthieu
Papadopoulos, Helene
Richard, Gael
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1499 - 1511

← 1 2 3 4 5 →