A Novel Multiple Sparse Source Localization Using Triangular Pyramid Microphone Array

被引:24
作者
Ren, Mengqi [1 ]
Zou, Yue Xian [1 ]
机构
[1] Peking Univ Shenzhen, Grad Sch, Adv Digital Signal Proc Lab, Shenzhen 518055, Peoples R China
关键词
Inter-sensor phase difference; source localization; time-frequency sparsity; triangular pyramid microphone array;
D O I
10.1109/LSP.2011.2179801
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Making use of the time-frequency spectra sparsity of the speech sources and the spatial and inter-relation information provided from a triangular pyramid microphone array (TPMA), the ratio of the inter-sensor phase difference (RIPD) is defined and a direct relationship between RIPD information and the direction of arrival (DOA) of each source is obtained. A novel multiple speech source localization algorithm (named as TPMA-RIPD) using the histogram clustering technique is proposed, which has been evaluated by several simulation experiments. Experimental results show that the TPMA-RIPD algorithm is able to provide high source localization accuracy in noisy environment for all angles. It is also able to estimate multiple speech sources when the number of sources is larger than that of the microphones used.
引用
收藏
页码:83 / 86
页数:4
相关论文
共 9 条
  • [1] IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS
    ALLEN, JB
    BERKLEY, DA
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) : 943 - 950
  • [2] Araki S., 2006, IEEE International Conference on Acoustics, Speech and Signal Processing, P33
  • [3] A Sparsity-Based Approach to 3D Binaural Sound Synthesis Using Time-Frequency Array Processing
    Cobos, Maximo
    Lopez, Jose J.
    Spors, Sascha
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010,
  • [4] Localization of multiple sound sources with two microphones
    Liu, C
    Wheeler, BC
    O'Brien, WD
    Bilger, RC
    Lansing, CR
    Feng, AS
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2000, 108 (04) : 1888 - 1905
  • [5] Mandel MichaelI., 2007, IEEE WORKSHOP APPL S, P275
  • [6] Matsuo M., 2005, P INT WORKSH AC ECH, P129
  • [7] Ren M., 2011, 4 IEEE INT C COMP SC
  • [8] Blind separation of speech mixtures via time-frequency masking
    Yilmaz, Ö
    Rickard, S
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (07) : 1830 - 1847
  • [9] A Two Microphone-Based Approach for Source Localization of Multiple Speech Sources
    Zhang, Wenyi
    Rao, Bhaskar D.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (08): : 1913 - 1928