Learning based method for near field acoustic range estimation in spherical harmonics domain using intensity vectors

被引:2
作者
Dwivedi, Priyadarshini [1 ]
Routray, Gyanajyoti [1 ]
Hegde, Rajesh M. [1 ]
机构
[1] Indian Inst Technol Kanpur, Dept Elect Engn, Kanpur, India
关键词
Spherical harmonics; Near-field; Range estimation; Spherical harmonics intensity; SOURCE LOCALIZATION; DECOMPOSITION; ALGORITHM;
D O I
10.1016/j.patrec.2022.11.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Near-field acoustic range estimation is considered one of the least explored research problems in digital signal processing under noise and reverberant conditions. This letter develops a new learning-based range estimation technique utilizing the spherical harmonics intensity (SH-INT) coefficients. The conventional range estimation in the spherical harmonics (SH) domain relies on the pressure coefficients. However, at high frequencies, these coefficients of different order and range overlap and hinder the accuracy of range estimation. On the contrary, the SH-INT coefficients are well distinguished at high frequencies for various orders and ranges, making these features favorable for accurate range estimation using learning algorithms. Since the SH-INT coefficients in the radial direction are independent of the source signal and vary with range, a convolutional neural network (CNN) model has been adopted to map the SH-INT coefficients with the range classes. The performance of the proposed spherical harmonic intensity (SH-INT) features in the context of near-field range estimation is validated by conducting exhaustive experiments on simulated and real data. Further, the error in near-field source range estimates is characterized using root mean square error (RMSE) criteria. The results are impactful and encourage the use of this method for practical near-field source range estimation applications.
引用
收藏
页码:17 / 24
页数:8
相关论文
共 25 条
  • [1] Brandstein MS, 1997, INT CONF ACOUST SPEE, P375, DOI 10.1109/ICASSP.1997.599651
  • [2] QUATERNION NEURAL NETWORKS FOR 3D SOUND SOURCE LOCALIZATION IN REVERBERANT ENVIRONMENTS
    Celsi, Michela Ricciardi
    Scardapane, Simone
    Comminiello, Danilo
    [J]. PROCEEDINGS OF THE 2020 IEEE 30TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2020,
  • [3] Chakrabarty S., 2017, BROADBAND DOA ESTIMA
  • [4] The overlap integral of three associated Legendre polynomials
    Dong, SH
    Lemus, R
    [J]. APPLIED MATHEMATICS LETTERS, 2002, 15 (05) : 541 - 546
  • [5] The nearfieeld spherical nucrophone array
    Fisher, Etan
    Rafaely, Boaz
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5272 - 5275
  • [6] Near-Field Spherical Microphone Array Processing With Radial Filtering
    Fisher, Etan
    Rafaely, Boaz
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 256 - 265
  • [7] Adaptive eigenvalue decomposition algorithm for realtime acoustic source localization system
    Huang, YT
    Benesty, J
    Elko, GW
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 937 - 940
  • [8] Rigid sphere room impulse response simulation: Algorithm and applications
    Jarrett, D. P.
    Habets, E. A. P.
    Thomas, M. R. P.
    Naylor, P. A.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (03) : 1462 - 1472
  • [9] Jarrett DP, 2010, EUR SIGNAL PR CONF, P442
  • [10] Johansson A, 2004, TENCON IEEE REGION, pB629