Time frequency source separation and direction of arrival estimation in a 3D soundscape environment

被引:5
作者
Bunting, O. [1 ]
Chesmore, D. [1 ]
机构
[1] Univ York, Dept Elect, York YO10 5DD, N Yorkshire, England
基金
英国工程与自然科学研究理事会;
关键词
Sparse; Separation; Time-frequency; Coincident array; B-format; Soundscape; BLIND SEPARATION;
D O I
10.1016/j.apacoust.2011.05.018
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The DUET (Degenerative Unmixing and Estimation Technique) algorithm has been shown to be an effective method for the separation of multiple sources from a stereo mixture. We present a time-frequency masking technique based on DUET for the blind separation of N sources from a four channel B format audio mixture. This method is applicable where sources that are both radially sparse in three dimensions, and exhibit approximate W-disjoint orthogonality, i.e. where the signals are approximately sparse in the time-frequency domain. Using the B-format mixture, we generate a three dimensional power-weighted geodesic histogram and show the peak locations correspond to the direction of arrival of the sources. These peaks are then used to generate a time-frequency mask to separate the sources from the w-channel of the B-format mixture. Experimental separation results using speech signals are presented. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:264 / 268
页数:5
相关论文
共 10 条
[1]  
Bunting O, 2009, 8 EUR C NOIS CONTR E
[2]  
Gerzon M.A., 1975, AUD ENG SOC 50 CONV
[3]  
GERZON MA, 1973, J AUDIO ENG SOC, V21, P2
[4]  
Jourjine A, 2000, INT CONF ACOUST SPEE, P2985, DOI 10.1109/ICASSP.2000.861162
[5]  
Merimaa J., 2004, P 7 INT C DIGITAL AU, P139
[6]  
Pulkki V, 2007, J AUDIO ENG SOC, V55, P503
[7]  
Rickard S, 2002, INT CONF ACOUST SPEE, P529
[8]   DOA estimation of many W-disjoint orthogonal sources from two mixtures using DUET [J].
Rickard, S ;
Dietrich, F .
PROCEEDINGS OF THE TENTH IEEE WORKSHOP ON STATISTICAL SIGNAL AND ARRAY PROCESSING, 2000, :311-314
[9]  
Rickard S., 2001, P ICA, P651
[10]   Blind separation of speech mixtures via time-frequency masking [J].
Yilmaz, Ö ;
Rickard, S .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (07) :1830-1847