Speech Source Separation in Convolutive Environments Using Space-Time-Frequency Analysis

被引:0
作者
Shlomo Dubnov
Joseph Tabrikian
Miki Arnon-Targan
机构
[1] CALIT 2,Department of Electrical and Computer Engineering
[2] University of California,undefined
[3] Ben-Gurion University of the Negev,undefined
来源
EURASIP Journal on Advances in Signal Processing | / 2006卷
关键词
Information Technology; Transfer Function; Correlation Matrix; Multiple Time; Quantum Information;
D O I
暂无
中图分类号
学科分类号
摘要
We propose a new method for speech source separation that is based on directionally-disjoint estimation of the transfer functions between microphones and sources at different frequencies and at multiple times. The spatial transfer functions are estimated from eigenvectors of the microphones' correlation matrix. Smoothing and association of transfer function parameters across different frequencies are performed by simultaneous extended Kalman filtering of the amplitude and phase estimates. This approach allows transfer function estimation even if the number of sources is greater than the number of microphones, and it can operate for both wideband and narrowband sources. The performance of the proposed method was studied via simulations and the results show good performance.
引用
收藏
相关论文
共 14 条
[1]  
Parra L(2000)Convolutive blind separation of non-stationary sources IEEE Transactions on Speech and Audio Processing 8 320-327
[2]  
Spence C(2000)Blind separation of disjoint orthogonal signals: demixing N sources from 2 mixtures Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '00) 5 2985-2988
[3]  
Jourjine A(2003)Speech segregation based on sound localization The Journal of the Acoustical Society of America 114 2236-2252
[4]  
Rickard S(2004)Two contributions to blind source separation using time-frequency distributions IEEE Signal Processing Letters 11 386-389
[5]  
Yilmaz O(2005)Permutation inconsistency in blind speech separation: investigation and solutions IEEE Transactions on Speech and Audio Processing 13 1-13
[6]  
Roman N(2004)Blind separation of speech mixtures via time-frequency masking IEEE Transactions on Signal Processing 52 1830-1847
[7]  
Wang DL(undefined)undefined undefined undefined undefined-undefined
[8]  
Brown GJ(undefined)undefined undefined undefined undefined-undefined
[9]  
Fevotte C(undefined)undefined undefined undefined undefined-undefined
[10]  
Doncarli C(undefined)undefined undefined undefined undefined-undefined