Performance of phase transform for detecting sound sources with microphone arrays in reverberant and noisy environments

被引:32
作者
Donohue, Kevin D. [1 ]
Hannemann, Jens [1 ]
Dietz, Henry G. [1 ]
机构
[1] Univ Kentucky, Ctr Visualizat & Virtual Environm, Lexington, KY 40507 USA
基金
美国国家科学基金会;
关键词
phase transform; microphone array; SOURCE LOCALIZATION;
D O I
10.1016/j.sigpro.2007.01.013
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The performance of sound source location (SSL) algorithms with microphone arrays can be enhanced by processing signals prior to the delay and sum operation. The phase transform (PHAT) has been shown to improve SSL images, especially in reverberant environments. This paper introduces a modification, referred to as the PHAT-beta transform, that varies the degree of spectral magnitude information used by the transform through a single parameter. Performance results are computed using a Monte Carlo simulation of an eight element perimeter array with a receiver operating characteristic (ROC) analysis for detecting single and multiple sound sources. In addition, a Fisher's criterion performance measure is also computed for target and noise peak separability and compared to the ROC results. Results show that the standard PHAT significantly improves detection performance for broadband signals especially in high levels of reverberation noise, and to a lesser degree for noise from other coherent sources. For narrowband targets the PHAT typically results in significant performance degradation; however, the PHAT-beta can achieve performance improvements for both narrowband and broadband signals. Finally, the performance for real speech signal samples is examined and shown to exhibit properties similar to both the simulated broad and narrowband cases, suggesting the use of beta values between 0.5 and 0.7 for array applications with general signals. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:1677 / 1691
页数:15
相关论文
共 20 条