ALPHA-STABLE LOW-RANK PLUS RESIDUAL DECOMPOSITION FOR SPEECH ENHANCEMENT

被引：0

作者：

Simsekli, Umut ^{[1
]}

Erdogan, Halil ^{[2
]}

Leglaive, Simon ^{[1
]}

Liutkus, Antoine ^{[3
,4
]}

Badeau, Roland ^{[1
]}

Richard, Gael ^{[1
]}

机构：

[1] Univ Paris Saclay, LTCI, Telecom ParisTech, F-75013 Paris, France

[2] Sabanci Univ, Fac Engn & Nat Sci, Istanbul, Turkey

[3] INRIA, Montpellier, France

[4] LIRMM, Montpellier, France

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年

关键词：

Alpha-stable distributions; Audio source separation; Speech enhancement; Monte Carlo Expectation-Maximization; NONNEGATIVE MATRIX FACTORIZATION; DIVERGENCE; ALGORITHMS; SEPARATION; MODELS; SPARSE;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this study, we propose a novel probabilistic model for separating clean speech signals from noisy mixtures by decomposing the mixture spectra into a structured speech part and a more flexible residual part. The main novelty in our model is that it uses a family of heavy-tailed distributions, so called the alpha-stable distributions, for modeling the residual signal. We develop an expectation-maximization algorithm for parameter estimation and a Monte Carlo scheme for posterior estimation of the clean speech. Our experiments show that the proposed method outperforms relevant factorization-based algorithms by a significant margin.

引用

页码：651 / 655

页数：5

共 34 条

[1]

[Anonymous], 1994, STABLE NONGAUSSIAN R

[2]

Bassiou N, 2013, INT SYMP IMAGE SIG, P382

[3] METHOD FOR SIMULATING STABLE RANDOM-VARIABLES [J].

CHAMBERS, JM ;

MALLOWS, CL ;

STUCK, BW .

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1976, 71 (354) :340-344

[4]

Chen Z, 2013, P IEEE WORKSH APPL S, P1

[5]

Cohen I, 2001, INT CONF ACOUST SPEE, P661, DOI 10.1109/ICASSP.2001.940918

[6] Sparse Hidden Markov Models for Speech Enhancement in Non-Stationary Noise Environments [J].

Deng, Feng ;

Bao, Changchun ;

Kleijn, W. Bastiaan .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) :1973-1987

[7]

Févotte C, 2013, INT CONF ACOUST SPEE, P3158, DOI 10.1109/ICASSP.2013.6638240

[8] Algorithms for Nonnegative Matrix Factorization with the β-Divergence [J].

Fevotte, Cedric ;

Idier, Jerome .

NEURAL COMPUTATION, 2011, 23 (09) :2421-2456

[9] Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis [J].

Fevotte, Cedric ;

Bertin, Nancy ;

Durrieu, Jean-Louis .

NEURAL COMPUTATION, 2009, 21 (03) :793-830

[10]

Fontaine M., 2017, WASPAA

← 1 2 3 4 →