Sparse Reverberant Audio Source Separation via Reweighted Analysis

被引:17
作者
Arberet, Simon [1 ]
Vandergheynst, Pierre [1 ]
Carrillo, Rafael E. [1 ]
Thiran, Jean-Philippe [1 ]
Wiaux, Yves [1 ]
机构
[1] Ecole Polytech Fed Lausanne, Dept Elect Engn, Signal Proc Lab, CH-1015 Lausanne, Switzerland
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2013年 / 21卷 / 07期
关键词
Convolutive mixture; convex optimization; source separation; sparsity; BLIND SOURCE SEPARATION; THRESHOLDING ALGORITHM; SPEECH; MODEL;
D O I
10.1109/TASL.2013.2250962
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose a novel algorithm for source signals estimation from an underdetermined convolutive mixture assuming known mixing filters. Most of the state-of-the-art methods are dealing with anechoic or short reverberant mixture, assuming a synthesis sparse prior in the time-frequency domain and a narrowband approximation of the convolutive mixing process. In this paper, we address the source estimation of convolutive mixtures with a new algorithm based on i) an analysis sparse prior, ii) a reweighting scheme so as to increase the sparsity, iii) a wideband data-fidelity term in a constrained form. We show, through theoretical discussions and simulations, that this algorithm is particularly well suited for source separation of realistic reverberation mixtures. Particularly, the proposed algorithm outperforms state-of-the-art methods on reverberant mixtures of audio sources by more than 2 dB of signal-to-distortion ratio on the BSS Oracle dataset.
引用
收藏
页码:1391 / 1402
页数:12
相关论文
共 44 条
[1]  
Araki S, 2005, INT CONF ACOUST SPEE, P81
[2]  
Arberet S., 2010, 2010 10th International Conference on Information Sciences, Signal Processing and their Applications (ISSPA 2010), P1, DOI 10.1109/ISSPA.2010.5605570
[3]   A tractable framework for estimating and combining spectral source models for audio source separation [J].
Arberet, Simon ;
Ozerov, Alexey ;
Bimbot, Frederic ;
Gribonval, Remi .
SIGNAL PROCESSING, 2012, 92 (08) :1886-1901
[4]  
Arberet S, 2011, INT CONF ACOUST SPEE, P2876
[5]   A Robust Method to Count and Locate Audio Sources in a Multichannel Underdetermined Mixture [J].
Arberet, Simon ;
Gribonval, Remi ;
Bimbot, Frederic .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (01) :121-133
[6]   A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems [J].
Beck, Amir ;
Teboulle, Marc .
SIAM JOURNAL ON IMAGING SCIENCES, 2009, 2 (01) :183-202
[7]  
Belouchrani A, 1998, IEEE T SIGNAL PROCES, V46, P2888, DOI 10.1109/78.726803
[8]   Underdetermined blind source separation using sparse representations [J].
Bofill, P ;
Zibulevsky, M .
SIGNAL PROCESSING, 2001, 81 (11) :2353-2362
[9]  
Campbell D. R., 2005, COMPUT INF SYST, V9, P48
[10]   Compressed sensing with coherent and redundant dictionaries [J].
Candes, Emmanuel J. ;
Eldar, Yonina C. ;
Needell, Deanna ;
Randall, Paige .
APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2011, 31 (01) :59-73