Multi-channel nonlinear phase analysis for time frequency data fusion

被引：2

作者：

Mavandadi, S ^{[1
]}

Aarabi, P ^{[1
]}

机构：

[1] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 3G4, Canada

来源：

MULTISENSOR, MULTISOURCE INFORMATION FUSION: ARCHITECTURES, ALGORITHMS, AND APPLICATIONS 2003 | 2003年 / 5099卷

关键词：

delay-of-arrival estimation; time-frequency data fusion; microphone arrays; speech processing;

D O I：

10.1117/12.487298

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A general method for time delay of arrival (TDOA) estimation for time-frequency information fusion is analyzed. This technique, for which the generalized cross correlation method and histogram methods are special cases, results in a low TDOA estimation error And high efficiency in computation. The proposed method relies on a non-linear phase-error selector function, which acts as a reward and punish method for the phase error at each frequency. Three different selector function candidates, consisting of cosine, rectangular, and triangular functions are analyzed using simulations. In the presence of Gaussian noise, the rectangular selector function performs better than the cosine at signal-to-noise ratios (SNRs) higher than 10dB while for lower SNRs the cosine function performs better. With speech noise, the cosine function, which corresponds to the generalized cross correlation technique, has higher anomaly percentages and higher root-mean-square errors than the rectangular function. This suggests, that in general, the rectangular selector function, which can be computed more easily than the cosine selector function, is superior technique to the generalized cross correlation method for. real-time applications.

引用

页码：222 / 231

页数：10

共 18 条

[1]

Aarabi P., 2001, P 5 IEEE WORKSH NONL

[2]

AARABI P, 1999, THESIS U TORONTO

[3]

AARABI P, 2001, P SENS FUS ARCH AL 5

[4]

AARABI P, 2000, P 4 WORLD MULT C CIR

[5]

AARABI P, 2003, IN PRESS INFORMATION

[6]

Backman J., 1993, P IEEE INT C AC SPEE, P125

[7] A PRACTICAL TIME-DELAY ESTIMATOR FOR LOCALIZING SPEECH SOURCES WITH A MICROPHONE ARRAY [J].

BRANDSTEIN, MS ;

ADCOCK, JE ;

SILVERMAN, HF .

COMPUTER SPEECH AND LANGUAGE, 1995, 9 (02) :153-169

[8] A practical methodology for speech source localization with microphone arrays [J].

Brandstein, MS ;

Silverman, HF .

COMPUTER SPEECH AND LANGUAGE, 1997, 11 (02) :91-126

[9]

BRANDSTEIN MS, 1996, P IEEE C AC SPEECH S

[10] An artificial neural network for sound localization using binaural cues [J].

Datum, MS ;

Palmieri, F ;

Moiseff, A .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (01) :372-383

← 1 2 →