PSD Estimation and Source Separation in a Noisy Reverberant Environment Using a Spherical Microphone Array

被引:21
作者
Fahim, Abdullah [1 ]
Samarasinghe, Prasanga N. [1 ]
Abhayapala, Thushara D. [1 ]
机构
[1] Australian Natl Univ, Coll Engn & Comp Sci, Res Sch Informat Sci & Engn, Canberra, ACT 2601, Australia
基金
澳大利亚研究理事会;
关键词
Noise suppression; power spectral density; source separation; speech dereverberation; spherical microphone array; SPATIAL CORRELATION; SPEECH ENHANCEMENT; WIENER FILTER; FIELDS;
D O I
10.1109/TASLP.2018.2835723
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose an efficient technique for estimating individual power spectral density (PSD) components, i.e., PSD of each desired sound source as well as of noise and reverberation, in a multisource reverberant sound scene with coherent background noise. We formulate the problem in the spherical harmonics domain to take the advantage of the inherent orthogonality of the spherical harmonics basis functions and extract the PSD components from the cross-correlation between the different sound field modes. We also investigate an implementation issue that occurs at the nulls of the Bessel functions and offer an engineering solution. The performance evaluation takes place in a practical environment with a commercial microphone array in order to (m)easure the robustness of the proposed algorithm against all the deviations incurred in practice. We also exhibit an application of the proposed PSD estimator through a source septation algorithm and compare the performance with a contemporary method in terms of different objective measures.
引用
收藏
页码:1594 / 1607
页数:14
相关论文
共 42 条
  • [1] Abhayapala T.D., 1999, Modal analysis and synthesis of broadband nearfield beamforming arrays
  • [2] Abhayapala TD, 2002, INT CONF ACOUST SPEE, P1949
  • [3] Spherical Harmonic Analysis of Wavefields Using Multiple Circular Sensor Arrays
    Abhayapala, Thushara D.
    Gupta, Aastha
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1655 - 1666
  • [4] [Anonymous], P INT WORKSH AC SIGN
  • [5] [Anonymous], 2012, Inverse acoustic and electromagnetic scattering theory
  • [6] [Anonymous], ACOUST SPEECH SIG PR
  • [7] [Anonymous], 2013, PROC IEEE INT C COMP
  • [8] Beh J, 2014, 2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), P47, DOI 10.1109/HSCMA.2014.6843249
  • [9] Benesty J, 2005, SIG COM TEC, P9, DOI 10.1007/3-540-27489-8_2
  • [10] Bourgeois J., 2010, Time-domain beamforming and blind source separation