Beyond the Narrowband Approximation: Wideband Convex Methods for Under-Determined Reverberant Audio Source Separation

被引:43
作者
Kowalski, Matthieu [1 ]
Vincent, Emmanuel [2 ]
Gribonval, Remi [2 ]
机构
[1] Univ Paris Sud, Lab Signaux & Syst, CNRS, SUPELEC,UMR 8506, F-91192 Gif Sur Yvette, France
[2] INRIA, Ctr Inria Rennes Bretagne Atlantique, F-35042 Rennes, France
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2010年 / 18卷 / 07期
关键词
Convex optimization; convolutive mixture; mixed norms; narrowband approximation; source separation; THRESHOLDING ALGORITHM; BLIND SEPARATION; SHRINKAGE; MIXTURES;
D O I
10.1109/TASL.2010.2050089
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We consider the problem of extracting the source signals from an under-determined convolutive mixture assuming known mixing filters. State-of-the-art methods operate in the time-frequency domain and rely on narrowband approximation of the convolutive mixing process by complex-valued multiplication in each frequency bin. The source signals are then estimated by minimizing either a mixture fitting cost or a source sparsity cost, under possible constraints on the number of active sources. In this paper, we define a wideband l(2) mixture fitting cost circumventing the above approximation and investigate the use of a l(1, 2) mixed-norm cost promoting disjointness of the source time-frequency representations. We design a family of convex functionals combining these costs and derive suitable optimization algorithms. Experiments indicate that the proposed wideband methods result in a signal-to-distortion ratio improvement of 2 to 5 dB compared to the state-of-the-art on reverberant speech mixtures.
引用
收藏
页码:1818 / 1829
页数:12
相关论文
共 31 条
[1]   Blind separation of underdetermined convolutive mixtures using their time-frequency representation [J].
Aissa-El-Bey, Abdeldjalil ;
Abed-Meraim, Karim ;
Grenier, Yves .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (05) :1540-1550
[2]  
[Anonymous], P EUR SIGN PROC C EU
[3]   A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems [J].
Beck, Amir ;
Teboulle, Marc .
SIAM JOURNAL ON IMAGING SCIENCES, 2009, 2 (01) :183-202
[4]   Atomic decomposition by basis pursuit [J].
Chen, SSB ;
Donoho, DL ;
Saunders, MA .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1998, 20 (01) :33-61
[5]   Signal recovery by proximal forward-backward splitting [J].
Combettes, PL ;
Wajs, VR .
MULTISCALE MODELING & SIMULATION, 2005, 4 (04) :1168-1200
[6]   An iterative thresholding algorithm for linear inverse problems with a sparsity constraint [J].
Daubechies, I ;
Defrise, M ;
De Mol, C .
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 2004, 57 (11) :1413-1457
[7]   On Spatial Aliasing in Microphone Arrays [J].
Dmochowski, Jacek ;
Benesty, Jacob ;
Affes, Sofiene .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2009, 57 (04) :1383-1395
[8]   Piecewise linear source separation [J].
Gribonval, R .
WAVELETS: APPLICATIONS IN SIGNAL AND IMAGE PROCESSING X, PTS 1 AND 2, 2003, 5207 :297-310
[9]   FIXED-POINT CONTINUATION FOR l1-MINIMIZATION: METHODOLOGY AND CONVERGENCE [J].
Hale, Elaine T. ;
Yin, Wotao ;
Zhang, Yin .
SIAM JOURNAL ON OPTIMIZATION, 2008, 19 (03) :1107-1130
[10]  
KELLERMANN W, 2003, P AS C SIGN SYST COM, V2, P1278