ALPHA-STABLE LOW-RANK PLUS RESIDUAL DECOMPOSITION FOR SPEECH ENHANCEMENT

被引:0
作者
Simsekli, Umut [1 ]
Erdogan, Halil [2 ]
Leglaive, Simon [1 ]
Liutkus, Antoine [3 ,4 ]
Badeau, Roland [1 ]
Richard, Gael [1 ]
机构
[1] Univ Paris Saclay, LTCI, Telecom ParisTech, F-75013 Paris, France
[2] Sabanci Univ, Fac Engn & Nat Sci, Istanbul, Turkey
[3] INRIA, Montpellier, France
[4] LIRMM, Montpellier, France
来源
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年
关键词
Alpha-stable distributions; Audio source separation; Speech enhancement; Monte Carlo Expectation-Maximization; NONNEGATIVE MATRIX FACTORIZATION; DIVERGENCE; ALGORITHMS; SEPARATION; MODELS; SPARSE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this study, we propose a novel probabilistic model for separating clean speech signals from noisy mixtures by decomposing the mixture spectra into a structured speech part and a more flexible residual part. The main novelty in our model is that it uses a family of heavy-tailed distributions, so called the alpha-stable distributions, for modeling the residual signal. We develop an expectation-maximization algorithm for parameter estimation and a Monte Carlo scheme for posterior estimation of the clean speech. Our experiments show that the proposed method outperforms relevant factorization-based algorithms by a significant margin.
引用
收藏
页码:651 / 655
页数:5
相关论文
共 34 条
[1]  
[Anonymous], 1994, STABLE NONGAUSSIAN R
[2]  
Bassiou N, 2013, INT SYMP IMAGE SIG, P382
[3]   METHOD FOR SIMULATING STABLE RANDOM-VARIABLES [J].
CHAMBERS, JM ;
MALLOWS, CL ;
STUCK, BW .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1976, 71 (354) :340-344
[4]  
Chen Z, 2013, P IEEE WORKSH APPL S, P1
[5]  
Cohen I, 2001, INT CONF ACOUST SPEE, P661, DOI 10.1109/ICASSP.2001.940918
[6]   Sparse Hidden Markov Models for Speech Enhancement in Non-Stationary Noise Environments [J].
Deng, Feng ;
Bao, Changchun ;
Kleijn, W. Bastiaan .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) :1973-1987
[7]  
Févotte C, 2013, INT CONF ACOUST SPEE, P3158, DOI 10.1109/ICASSP.2013.6638240
[8]   Algorithms for Nonnegative Matrix Factorization with the β-Divergence [J].
Fevotte, Cedric ;
Idier, Jerome .
NEURAL COMPUTATION, 2011, 23 (09) :2421-2456
[9]   Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis [J].
Fevotte, Cedric ;
Bertin, Nancy ;
Durrieu, Jean-Louis .
NEURAL COMPUTATION, 2009, 21 (03) :793-830
[10]  
Fontaine M., 2017, WASPAA