Broadband DOA Estimation Using Sensor Arrays on Complex-Shaped Rigid Bodies

被引:14
作者
Talagala, Dumidu S. [1 ]
Zhang, Wen [1 ]
Abhayapala, Thushara D. [1 ]
机构
[1] Australian Natl Univ, Appl Signal Proc Grp, Res Sch Engn, Coll Engn & Comp Sci, Canberra, ACT 0200, Australia
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2013年 / 21卷 / 08期
关键词
Arbitrary array; array signal processing; direction of arrival (DOA); head related transfer function (HRTF); MUSIC; source localization; OF-ARRIVAL ESTIMATION; SOURCE LOCALIZATION; SOUND LOCALIZATION; SIGNAL; ESPRIT; AUDIO; CUES;
D O I
10.1109/TASL.2013.2255282
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Sensor arrays mounted on complex-shaped rigid bodies are a common feature in many practical broadband direction of arrival (DOA) estimation applications. The scattering and reflections caused by these rigid bodies introduce complexity and diversity in the frequency domain of the channel transfer function, which presents several challenges to existing broadband DOA estimators. This paper presents a novel high resolution broadband DOA estimation technique based on signal subspace decomposition. We describe how broadband signals can be decomposed into narrow subband components, and combined such that the frequency domain diversity is retained. The DOA estimation performance is compared with existing techniques using a uniform circular array and a sensor array on a hypothetical rigid body. An improvement in closely spaced source resolution of up to 6 dB is observed for the sensor array on the hypothetical rigid body, in comparison to the uniform circular array. The results suggest that frequency domain diversity, introduced by complex-shaped rigid bodies, can provide higher resolution and clearer separation of closely spaced broadband sound sources.
引用
收藏
页码:1573 / 1585
页数:13
相关论文
共 38 条
[21]  
Morimoto M., 2003, Acoustical Science and Technology, V24, P267, DOI 10.1250/ast.24.267
[22]  
Oppenheim A.V., 2014, SIGNALS SYSTEMS
[23]   A TUTORIAL ON MPEG AUDIO COMPRESSION [J].
PAN, D .
IEEE MULTIMEDIA, 1995, 2 (02) :60-74
[24]  
Rabiner L. R., 1993, Fundamentals of Speech Recognition
[25]   Identification and localization of sound sources in the median sagittal plane [J].
Rakerd, B ;
Hartmann, WM ;
McCaskey, TL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (05) :2812-2820
[26]   Binaural Source Localization by Joint Estimation of ILD and ITD [J].
Raspaud, Martin ;
Viste, Harald ;
Evangelista, Gianpaolo .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01) :68-77
[27]   ESPRIT - ESTIMATION OF SIGNAL PARAMETERS VIA ROTATIONAL INVARIANCE TECHNIQUES [J].
ROY, R ;
KAILATH, T .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (07) :984-995
[28]   MULTIPLE EMITTER LOCATION AND SIGNAL PARAMETER-ESTIMATION [J].
SCHMIDT, RO .
IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 1986, 34 (03) :276-280
[29]   Tori of confusion: Binaural localization cues for sources within reach of a listener [J].
Shinn-Cunningham, BG ;
Santarelli, S ;
Kopco, N .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2000, 107 (03) :1627-1636
[30]   Acoustic source detection and localization based on wavefield decomposition using circular microphone arrays [J].
Teutsch, Heinz ;
Kellermann, Walter .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (05) :2724-2736