Sound source localization method based time-domain signal feature using deep learning

被引:5
作者
Tang, Jun [1 ]
Sun, Xinmiao [1 ]
Yan, Lei [2 ]
Qu, Yang [1 ]
Wang, Tao [1 ]
Yue, Yuan [1 ]
机构
[1] Tianjin Univ, Sch Civil Engn, Tianjin 300072, Peoples R China
[2] China Acad Launch Vehicle Technol, Beijing 100076, Peoples R China
基金
国家重点研发计划;
关键词
Sound source localization; Microphone array; Time-domain features; Convolutional nerual network;
D O I
10.1016/j.apacoust.2023.109626
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Deep learning, as the most commonly used machine learning algorithm, is widely used in various fields. In the field of acoustics, deep learning methods are combined with frequency-domain features of signals to locate sound sources. The commonly frequency domain features include microphones array Cross-spectral-Matrix(CSM) and Short Time Fourier Transform(STFT). However, the use of frequency-domain features often leads to the loss of partial signal information and increases the computational complexity. This paper proposed a novel sound source localization algorithm based on time-domain features, which uses convolutional neural network(CNN) as a medium to achieve mapping from time-domain features to sound source locations. This method does not rely on any basic signal processing algorithm, and directly uses time-domain sampling points as network inputs for sound source localization. The application simulation shows that the proposed method can achieve precise localization and low side-lobe effect under different testing conditions. Once the network training is completed, the testing accuracy under different conditions is above 95%, with a maximum of 100%.
引用
收藏
页数:10
相关论文
共 19 条
  • [1] A neural network based microphone array approach to grid-less noise source localization
    Castellini, Paolo
    Giulietti, Nicola
    Falcionelli, Nicola
    Dragoni, Aldo Franco
    Chiariotti, Paolo
    [J]. APPLIED ACOUSTICS, 2021, 177 (177)
  • [2] A SIMPLE AND EFFICIENT ESTIMATOR FOR HYPERBOLIC LOCATION
    CHAN, YT
    HO, KC
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1994, 42 (08) : 1905 - 1915
  • [3] A Direct Position-Determination Approach for Multiple Sources Based on Neural Network Computation
    Chen, Xin
    Wang, Ding
    Yin, Jiexin
    Wu, Ying
    [J]. SENSORS, 2018, 18 (06)
  • [4] Acoustic beamforming for noise source localization - Reviews, methodology and applications
    Chiariotti, Paolo
    Martarelli, Milena
    Castellini, Paolo
    [J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2019, 120 : 422 - 448
  • [5] Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition
    Dahl, George E.
    Yu, Dong
    Deng, Li
    Acero, Alex
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (01): : 30 - 42
  • [6] A double-step grid-free method for sound source identification using deep learning
    Feng, Luoyi
    Zan, Ming
    Huang, Linsen
    Xu, Zhongming
    [J]. APPLIED ACOUSTICS, 2022, 201
  • [7] Hannun A.Y., 2013, INT C MACHINE LEARNI
  • [8] Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1026 - 1034
  • [9] Kingma Diederik P., 2015, ICLR POSTER
  • [10] GENERALIZED CORRELATION METHOD FOR ESTIMATION OF TIME-DELAY
    KNAPP, CH
    CARTER, GC
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (04): : 320 - 327