Generalized State Coherence Transform for Multidimensional TDOA Estimation of Multiple Sources

被引：51

作者：

Nesta, Francesco ^{[1
]}

Omologo, Maurizio ^{[1
]}

机构：

[1] Fdn Bruno Kessler IRST, I-38123 Povo, TN, Italy

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2012年 / 20卷 / 01期

关键词：

Blind source separation (BSS); independent component analysis (ICA); multi-source localization; multidimensional localization; permutation problem; underdetermined source separation; LOCALIZATION; ICA;

D O I：

10.1109/TASL.2011.2160168

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

According to the physical meaning of the frequency-domain blind source separation (FD-BSS), each mixing matrix estimated by independent component analysis (ICA) contains information on the physical acoustic propagation related to each source and then can be used for localization purposes. In this paper, we analyze the Generalized State Coherence Transform (GSCT) which is a non-linear transform of the space represented by the whole demixing matrices. The transform enables an accurate estimation of the propagation time-delay of multiple sources in multiple dimensions. Furthermore, it is shown that with appropriate nonlinearities and a statistical model for the reverberation, GSCT can be considered an approximated kernel density estimator of the acoustic propagation time-delay. Experimental results confirm the good properties of the transform and its effectiveness in addressing multiple source TDOA detection (e.g., 2-D TDOA estimation of several sources with only three microphones).

引用

页码：246 / 260

页数：15

共 32 条

[1]

Ajmera J, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, P605

[2]

[Anonymous], P INT WORKSH AC ECH

[3]

[Anonymous], MICROPHONE ARRAYS

[4]

[Anonymous], SPOKEN DIALOGUE COMP

[5]

[Anonymous], P IEEE INT C MULT FU

[6]

[Anonymous], P IEEE INT C AC SPEE

[7]

Araki S, 2009, LECT NOTES COMPUT SC, V5441, P742, DOI 10.1007/978-3-642-00599-2_93

[8]

BRUTTI A, 2006, P INT, P2606

[9] Multiple Source Localization Based on Acoustic Map De-Emphasis [J].

Brutti, Alessio ;

Omologo, Maurizio ;

Svaizer, Piergiorgio .

EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2010,

[10] Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model [J].

Duong, Ngoc Q. K. ;

Vincent, Emmanuel ;

Gribonval, Remi .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07) :1830-1840

← 1 2 3 4 →