Sound source localization using deep learning models

被引:85
|
作者
Yalta N. [1 ]
Nakadai K. [2 ]
Ogata T. [1 ,3 ]
机构
[1] Intermedia Art and Science Department, Waseda University, 3-4-1 Ohkubo, Shinjuku, 169-8555, Tokyo
[2] Honda Research Institute Japan Co., Ltd, Tokyo Institute of Technology, 8-1 Honcho, Wako, 351-0188, Saitama
[3] Faculty of Science and Engineering, Waseda University, 3-4-1 Ohkubo, Shinjuku, 169-8555, Tokyo
关键词
Deep learning; Deep residual networks; Sound source localization;
D O I
10.20965/jrm.2017.p0037
中图分类号
学科分类号
摘要
This study proposes the use of a deep neural network to localize a sound source using an array of microphones in a reverberant environment. During the last few years, applications based on deep neural networks have performed various tasks such as image classification or speech recognition to levels that exceed even human capabilities. In our study, we employ deep residual networks, which have recently shown remarkable performance in image classification tasks even when the training period is shorter than that of other models. Deep residual networks are used to process audio input similar to multiple signal classification (MUSIC) methods. We show that with end-to-end training and generic preprocessing, the performance of deep residual networks not only surpasses the block level accuracy of linear models on nearly clean environments but also shows robustness to challenging conditions by exploiting the time delay on power information. © 2017, Fuji Technology Press. All rights reserved.
引用
收藏
页码:37 / 48
页数:11
相关论文
共 50 条
  • [21] MEG Source Localization via Deep Learning
    Pantazis, Dimitrios
    Adler, Amir
    SENSORS, 2021, 21 (13)
  • [22] Localization of Steady Sound Source and Direction Detection of Moving Sound Source using CNN
    Mane, Shubham S.
    Mali, Swapnil G.
    Mahajan, S. P.
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [23] Sound Source Localization Using a Convolutional Neural Network and Regression Model
    Tan, Tan-Hsu
    Lin, Yu-Tang
    Chang, Yang-Lang
    Alkhaleefah, Mohammad
    SENSORS, 2021, 21 (23)
  • [24] Sound source localization
    Risoud, M.
    Hanson, J. -N.
    Gauvrit, F.
    Renard, C.
    Lemesre, P. -E.
    Bonne, N. -X.
    Vincent, C.
    EUROPEAN ANNALS OF OTORHINOLARYNGOLOGY-HEAD AND NECK DISEASES, 2018, 135 (04) : 259 - 264
  • [25] Sound source localization using a profile fitting method with sound reflectors
    Ichikawa, O
    Takiguchi, T
    Nishimura, M
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (05) : 1138 - 1145
  • [26] Acoustic Source Localization in the Circular Harmonic Domain Using Deep Learning Architecture
    SongGong, Kunkun
    Wang, Wenwu
    Chen, Huawei
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2475 - 2491
  • [27] Source Localization Using Distributed Microphones in Reverberant Environments Based on Deep Learning and Ray Space Transform
    Comanducci, Luca
    Borra, Federico
    Bestagini, Paolo
    Antonacci, Fabio
    Tubaro, Stefano
    Sarti, Augusto
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2238 - 2251
  • [28] Learning Multiple Sound Source 2D Localization
    Le Moing, Guillaume
    Vinayavekhin, Phongtharin
    Inoue, Tadanobu
    Vongkulbhisal, Jayakorn
    Munawar, Asim
    Tachibana, Ryuki
    Agravante, Don Joven
    2019 IEEE 21ST INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2019), 2019,
  • [29] Sound source localization via distance metric learning with regularization
    Liu, Mingmin
    Lu, Zhihua
    Wang, Xiaodong
    da Costa, Joao Paulo J.
    Fei, Tai
    SIGNAL PROCESSING, 2025, 227
  • [30] Sound Source Localization Using Piezoelectric Acoustic Metasurfaces
    Jin-Cheng Gu
    Wei Lin
    Cai-Xia Kan
    Acoustics Australia, 2020, 48 : 455 - 461