Laplace Nonnegative Matrix Factorization with Application to Semi-supervised Audio Denoising

被引：3

作者：

Tanji, Hiroki ^{[1
]}

Murakami, Takahiro ^{[1
,2
]}

Kamata, Hiroyuki ^{[1
]}

机构：

[1] Meiji Univ, Dept Elect & Bioinformat, Sch Sci & Technol, Tokyo, Japan

[2] Univ Surrey, Ctr Vis Speech & Signal Proc CVSSP, Guildford, Surrey, England

来源：

2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2019年

关键词：

complex Laplace distribution; nonnegative matrix factorization; majorization-minimization algorithm; source separation; SEPARATION;

D O I：

10.23919/eusipco.2019.8903074

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper proposes two statistical models for the nonnegative matrix factorization (NMF) based on heavy-tailed distributions. In the NMF for acoustic signals, previous works justify the additivity of an observed spectrogram using the reproductive property of a probability density function. However, the effectiveness of these properties is not clear. Consequently, to construct a model robust to noise, statistical models based on heavy-tailed distributions are recently growing up. In this paper, as heavy-tailed models for the NMF, we introduce statistical models based on the complex Laplace distributions, and call them Laplace-NMF. Moreover, we derive convergence-guaranteed optimization algorithms to estimate parameters. From our formulation, a statistical interpretation of the Itakura-Saito (IS) divergence-based NMF is newly revealed. We confirm the effectiveness of Laplace-NMF in semi-supervised audio denoising.

引用

页数：5

共 27 条

[21] Alpha-Stable Matrix Factorization [J].

Simsekli, Umut ;

Liutkus, Antoine ;

Cemgil, Ali Taylan .

IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (12) :2289-2293

[22] Non-negative matrix factorization for polyphonic music transcription [J].

Smaragdis, P ;

Brown, JC .

2003 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS PROCEEDINGS, 2003, :177-180

[23]

Smaragdis P, 2007, LECT NOTES COMPUT SC, V4666, P414

[24] Convolutive speech bases and their application to supervised speech separation [J].

Smaragdis, Paris .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01) :1-12

[25] Performance measurement in blind audio source separation [J].

Vincent, Emmanuel ;

Gribonval, Remi ;

Févotte, Cedric .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04) :1462-1469

[26] Speech denoising using nonnegative matrix factorization with priors [J].

Wilson, Kevin W. ;

Raj, Bhiksha ;

Smaragdis, Paris ;

Divakaran, Ajay .

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :4029-+

[27]

Yoshii K, 2016, INT CONF ACOUST SPEE, P51, DOI 10.1109/ICASSP.2016.7471635

← 1 2 3 →