Laplace Nonnegative Matrix Factorization with Application to Semi-supervised Audio Denoising

被引:3
作者
Tanji, Hiroki [1 ]
Murakami, Takahiro [1 ,2 ]
Kamata, Hiroyuki [1 ]
机构
[1] Meiji Univ, Dept Elect & Bioinformat, Sch Sci & Technol, Tokyo, Japan
[2] Univ Surrey, Ctr Vis Speech & Signal Proc CVSSP, Guildford, Surrey, England
来源
2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2019年
关键词
complex Laplace distribution; nonnegative matrix factorization; majorization-minimization algorithm; source separation; SEPARATION;
D O I
10.23919/eusipco.2019.8903074
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes two statistical models for the nonnegative matrix factorization (NMF) based on heavy-tailed distributions. In the NMF for acoustic signals, previous works justify the additivity of an observed spectrogram using the reproductive property of a probability density function. However, the effectiveness of these properties is not clear. Consequently, to construct a model robust to noise, statistical models based on heavy-tailed distributions are recently growing up. In this paper, as heavy-tailed models for the NMF, we introduce statistical models based on the complex Laplace distributions, and call them Laplace-NMF. Moreover, we derive convergence-guaranteed optimization algorithms to estimate parameters. From our formulation, a statistical interpretation of the Itakura-Saito (IS) divergence-based NMF is newly revealed. We confirm the effectiveness of Laplace-NMF in semi-supervised audio denoising.
引用
收藏
页数:5
相关论文
共 27 条
[21]   Alpha-Stable Matrix Factorization [J].
Simsekli, Umut ;
Liutkus, Antoine ;
Cemgil, Ali Taylan .
IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (12) :2289-2293
[22]   Non-negative matrix factorization for polyphonic music transcription [J].
Smaragdis, P ;
Brown, JC .
2003 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS PROCEEDINGS, 2003, :177-180
[23]  
Smaragdis P, 2007, LECT NOTES COMPUT SC, V4666, P414
[24]   Convolutive speech bases and their application to supervised speech separation [J].
Smaragdis, Paris .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01) :1-12
[25]   Performance measurement in blind audio source separation [J].
Vincent, Emmanuel ;
Gribonval, Remi ;
Févotte, Cedric .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04) :1462-1469
[26]   Speech denoising using nonnegative matrix factorization with priors [J].
Wilson, Kevin W. ;
Raj, Bhiksha ;
Smaragdis, Paris ;
Divakaran, Ajay .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :4029-+
[27]  
Yoshii K, 2016, INT CONF ACOUST SPEE, P51, DOI 10.1109/ICASSP.2016.7471635