Sound source localization using deep learning models

被引：85

作者：

Yalta N. ^{[1
]}

Nakadai K. ^{[2
]}

Ogata T. ^{[1
,3
]}

机构：

[1] Intermedia Art and Science Department, Waseda University, 3-4-1 Ohkubo, Shinjuku, 169-8555, Tokyo

[2] Honda Research Institute Japan Co., Ltd, Tokyo Institute of Technology, 8-1 Honcho, Wako, 351-0188, Saitama

[3] Faculty of Science and Engineering, Waseda University, 3-4-1 Ohkubo, Shinjuku, 169-8555, Tokyo

来源：

| 2017年 / Fuji Technology Press卷 / 29期

关键词：

Deep learning; Deep residual networks; Sound source localization;

D O I：

10.20965/jrm.2017.p0037

中图分类号：

学科分类号：

摘要：

This study proposes the use of a deep neural network to localize a sound source using an array of microphones in a reverberant environment. During the last few years, applications based on deep neural networks have performed various tasks such as image classification or speech recognition to levels that exceed even human capabilities. In our study, we employ deep residual networks, which have recently shown remarkable performance in image classification tasks even when the training period is shorter than that of other models. Deep residual networks are used to process audio input similar to multiple signal classification (MUSIC) methods. We show that with end-to-end training and generic preprocessing, the performance of deep residual networks not only surpasses the block level accuracy of linear models on nearly clean environments but also shows robustness to challenging conditions by exploiting the time delay on power information. © 2017, Fuji Technology Press. All rights reserved.

引用

页码：37 / 48

页数：11

共 50 条

[21] MEG Source Localization via Deep Learning
Pantazis, Dimitrios
Adler, Amir
SENSORS, 2021, 21 (13)
[22] Localization of Steady Sound Source and Direction Detection of Moving Sound Source using CNN
Mane, Shubham S.
Mali, Swapnil G.
Mahajan, S. P.
2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
[23] Sound Source Localization Using a Convolutional Neural Network and Regression Model
Tan, Tan-Hsu
Lin, Yu-Tang
Chang, Yang-Lang
Alkhaleefah, Mohammad
SENSORS, 2021, 21 (23)
[24] Sound source localization
Risoud, M.
Hanson, J. -N.
Gauvrit, F.
Renard, C.
Lemesre, P. -E.
Bonne, N. -X.
Vincent, C.
EUROPEAN ANNALS OF OTORHINOLARYNGOLOGY-HEAD AND NECK DISEASES, 2018, 135 (04) : 259 - 264
[25] Sound source localization using a profile fitting method with sound reflectors
Ichikawa, O
Takiguchi, T
Nishimura, M
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (05) : 1138 - 1145
[26] Acoustic Source Localization in the Circular Harmonic Domain Using Deep Learning Architecture
SongGong, Kunkun
Wang, Wenwu
Chen, Huawei
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2475 - 2491
[27] Source Localization Using Distributed Microphones in Reverberant Environments Based on Deep Learning and Ray Space Transform
Comanducci, Luca
Borra, Federico
Bestagini, Paolo
Antonacci, Fabio
Tubaro, Stefano
Sarti, Augusto
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2238 - 2251
[28] Learning Multiple Sound Source 2D Localization
Le Moing, Guillaume
Vinayavekhin, Phongtharin
Inoue, Tadanobu
Vongkulbhisal, Jayakorn
Munawar, Asim
Tachibana, Ryuki
Agravante, Don Joven
2019 IEEE 21ST INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2019), 2019,
[29] Sound source localization via distance metric learning with regularization
Liu, Mingmin
Lu, Zhihua
Wang, Xiaodong
da Costa, Joao Paulo J.
Fei, Tai
SIGNAL PROCESSING, 2025, 227
[30] Sound Source Localization Using Piezoelectric Acoustic Metasurfaces
Jin-Cheng Gu
Wei Lin
Cai-Xia Kan
Acoustics Australia, 2020, 48 : 455 - 461

← 1 2 3 4 5 →