Sound source localization method based time-domain signal feature using deep learning

被引：5

作者：

Tang, Jun ^{[1
]}

Sun, Xinmiao ^{[1
]}

Yan, Lei ^{[2
]}

Qu, Yang ^{[1
]}

Wang, Tao ^{[1
]}

Yue, Yuan ^{[1
]}

机构：

[1] Tianjin Univ, Sch Civil Engn, Tianjin 300072, Peoples R China

[2] China Acad Launch Vehicle Technol, Beijing 100076, Peoples R China

来源：

APPLIED ACOUSTICS | 2023年 / 213卷

基金：

国家重点研发计划;

关键词：

Sound source localization; Microphone array; Time-domain features; Convolutional nerual network;

D O I：

10.1016/j.apacoust.2023.109626

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Deep learning, as the most commonly used machine learning algorithm, is widely used in various fields. In the field of acoustics, deep learning methods are combined with frequency-domain features of signals to locate sound sources. The commonly frequency domain features include microphones array Cross-spectral-Matrix(CSM) and Short Time Fourier Transform(STFT). However, the use of frequency-domain features often leads to the loss of partial signal information and increases the computational complexity. This paper proposed a novel sound source localization algorithm based on time-domain features, which uses convolutional neural network(CNN) as a medium to achieve mapping from time-domain features to sound source locations. This method does not rely on any basic signal processing algorithm, and directly uses time-domain sampling points as network inputs for sound source localization. The application simulation shows that the proposed method can achieve precise localization and low side-lobe effect under different testing conditions. Once the network training is completed, the testing accuracy under different conditions is above 95%, with a maximum of 100%.

引用

页数：10

共 19 条

[1] A neural network based microphone array approach to grid-less noise source localization
Castellini, Paolo
Giulietti, Nicola
Falcionelli, Nicola
Dragoni, Aldo Franco
Chiariotti, Paolo
[J]. APPLIED ACOUSTICS, 2021, 177 (177)
[2] A SIMPLE AND EFFICIENT ESTIMATOR FOR HYPERBOLIC LOCATION
CHAN, YT
HO, KC
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1994, 42 (08) : 1905 - 1915
[3] A Direct Position-Determination Approach for Multiple Sources Based on Neural Network Computation
Chen, Xin
Wang, Ding
Yin, Jiexin
Wu, Ying
[J]. SENSORS, 2018, 18 (06)
[4] Acoustic beamforming for noise source localization - Reviews, methodology and applications
Chiariotti, Paolo
Martarelli, Milena
Castellini, Paolo
[J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2019, 120 : 422 - 448
[5] Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition
Dahl, George E.
Yu, Dong
Deng, Li
Acero, Alex
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (01): : 30 - 42
[6] A double-step grid-free method for sound source identification using deep learning
Feng, Luoyi
Zan, Ming
Huang, Linsen
Xu, Zhongming
[J]. APPLIED ACOUSTICS, 2022, 201
[7] Hannun A.Y., 2013, INT C MACHINE LEARNI
[8] Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1026 - 1034
[9] Kingma Diederik P., 2015, ICLR POSTER
[10] GENERALIZED CORRELATION METHOD FOR ESTIMATION OF TIME-DELAY
KNAPP, CH
CARTER, GC
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (04): : 320 - 327

← 1 2 →