UNDER-DETERMINED CONVOLUTIVE BLIND SOURCE SEPARATION USING SPATIAL COVARIANCE MODELS

被引:10
作者
Duong, Ngoc Q. K. [1 ]
Vincent, Emmanuel [1 ]
Gribonval, Remi [1 ]
机构
[1] IRISA INRIA, METISS Project Team, F-35042 Rennes, France
来源
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年
关键词
Convolutive blind source separation; under-determined mixtures; spatial covariance models; EM algorithm; permutation problem; MAXIMUM-LIKELIHOOD;
D O I
10.1109/ICASSP.2010.5496284
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper deals with the problem of under-determined convolutive blind source separation. We model the contribution of each source to all mixture channels in the time-frequency domain as a zero-mean Gaussian random variable whose covariance encodes the spatial properties of the source. We consider two covariance models and address the estimation of their parameters from the recorded mixture by a suitable initialization scheme followed by an iterative expectation-maximization (EM) procedure in each frequency bin. We then align the order of the estimated sources across all frequency bins based on their estimated directions of arrival (DOA). Experimental results over a stereo reverberant speech mixture show the effectiveness of the proposed approach.
引用
收藏
页码:9 / 12
页数:4
相关论文
共 9 条
[1]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[2]  
Duong Ngoc Q. K., 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), P129, DOI 10.1109/ASPAA.2009.5346503
[3]   Maximum likelihood approach for blind audio source separation using time-frequency Gaussian source models [J].
Févotte, C ;
Cardoso, JF .
2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, :78-81
[4]   MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION IN CONVOLUTIVE MIXTURES. WITH APPLICATION TO BLIND AUDIO SOURCE SEPARATION. [J].
Ozerov, Alexey ;
Fevotte, Cedric .
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, :3137-+
[5]   Grouping separated frequency components by estimating propagation model parameters in frequency-domain blind source separation [J].
Sawada, Hiroshi ;
Araki, Shoko ;
Mukai, Ryo ;
Makino, Shoji .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (05) :1592-1604
[6]  
Vincent E., MACHINE AUD IN PRESS
[7]  
Vincent E, 2007, LECT NOTES COMPUT SC, V4666, P552
[8]   MAP-based underdetermined blind source separation of convolutive mixtures by hierarchical clustering and l1-norm minimization [J].
Winter, Stefan ;
Kellermann, Walter ;
Sawada, Hiroshi ;
Makino, Shoji .
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)
[9]   Blind separation of speech mixtures via time-frequency masking [J].
Yilmaz, Ö ;
Rickard, S .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (07) :1830-1847