Sparse Reverberant Audio Source Separation via Reweighted Analysis

被引：17

作者：

Arberet, Simon ^{[1
]}

Vandergheynst, Pierre ^{[1
]}

Carrillo, Rafael E. ^{[1
]}

Thiran, Jean-Philippe ^{[1
]}

Wiaux, Yves ^{[1
]}

机构：

[1] Ecole Polytech Fed Lausanne, Dept Elect Engn, Signal Proc Lab, CH-1015 Lausanne, Switzerland

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2013年 / 21卷 / 07期

关键词：

Convolutive mixture; convex optimization; source separation; sparsity; BLIND SOURCE SEPARATION; THRESHOLDING ALGORITHM; SPEECH; MODEL;

D O I：

10.1109/TASL.2013.2250962

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We propose a novel algorithm for source signals estimation from an underdetermined convolutive mixture assuming known mixing filters. Most of the state-of-the-art methods are dealing with anechoic or short reverberant mixture, assuming a synthesis sparse prior in the time-frequency domain and a narrowband approximation of the convolutive mixing process. In this paper, we address the source estimation of convolutive mixtures with a new algorithm based on i) an analysis sparse prior, ii) a reweighting scheme so as to increase the sparsity, iii) a wideband data-fidelity term in a constrained form. We show, through theoretical discussions and simulations, that this algorithm is particularly well suited for source separation of realistic reverberation mixtures. Particularly, the proposed algorithm outperforms state-of-the-art methods on reverberant mixtures of audio sources by more than 2 dB of signal-to-distortion ratio on the BSS Oracle dataset.

引用

页码：1391 / 1402

页数：12

共 44 条

[1]

Araki S, 2005, INT CONF ACOUST SPEE, P81

[2]

Arberet S., 2010, 2010 10th International Conference on Information Sciences, Signal Processing and their Applications (ISSPA 2010), P1, DOI 10.1109/ISSPA.2010.5605570

[3] A tractable framework for estimating and combining spectral source models for audio source separation [J].

Arberet, Simon ;

Ozerov, Alexey ;

Bimbot, Frederic ;

Gribonval, Remi .

SIGNAL PROCESSING, 2012, 92 (08) :1886-1901

[4]

Arberet S, 2011, INT CONF ACOUST SPEE, P2876

[5] A Robust Method to Count and Locate Audio Sources in a Multichannel Underdetermined Mixture [J].

Arberet, Simon ;

Gribonval, Remi ;

Bimbot, Frederic .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (01) :121-133

[6] A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems [J].

Beck, Amir ;

Teboulle, Marc .

SIAM JOURNAL ON IMAGING SCIENCES, 2009, 2 (01) :183-202

[7]

Belouchrani A, 1998, IEEE T SIGNAL PROCES, V46, P2888, DOI 10.1109/78.726803

[8] Underdetermined blind source separation using sparse representations [J].

Bofill, P ;

Zibulevsky, M .

SIGNAL PROCESSING, 2001, 81 (11) :2353-2362

[9]

Campbell D. R., 2005, COMPUT INF SYST, V9, P48

[10] Compressed sensing with coherent and redundant dictionaries [J].

Candes, Emmanuel J. ;

Eldar, Yonina C. ;

Needell, Deanna ;

Randall, Paige .

APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2011, 31 (01) :59-73

← 1 2 3 4 5 →