Sound Event Localization and Detection Based on Deep Learning

被引:0
|
作者
Zhao, Dada [1 ,2 ]
Ding, Kai [2 ]
Qi, Xiaogang [1 ]
Chen, Yu [2 ]
Feng, Hailin [1 ]
机构
[1] School of Mathematics and Statistics, Xidian University, Xi'an,710071, China
[2] Science and Technology on Near-Surface Detection Laboratory, Wuxi,214035, China
基金
中国国家自然科学基金;
关键词
Deep learning - Neural networks;
D O I
10.23919/JSEE.2023.000110
中图分类号
学科分类号
摘要
Acoustic source localization (ASL) and sound event detection (SED) are two widely pursued independent research fields. In recent years, in order to achieve a more complete spatial and temporal representation of sound field, sound event localization and detection (SELD) has become a very active research topic. This paper presents a deep learning-based multi-overlapping sound event localization and detection algorithm in three-dimensional space. Log-Mel spectrum and generalized cross-correlation spectrum are joined together in channel dimension as input features. These features are classified and regressed in parallel after training by a neural network to obtain sound recognition and localization results respectively. The channel attention mechanism is also introduced in the network to selectively enhance the features containing essential information and suppress the useless features. Finally, a thourough comparison confirms the efficiency and effectiveness of the proposed SELD algorithm. Field experiments show that the proposed algorithm is robust to reverberation and environment and can achieve higher recognition and localization accuracy compared with the baseline method. © 1990-2011 Beijing Institute of Aerospace Information.
引用
收藏
页码:294 / 301
相关论文
共 50 条
  • [1] Sound event localization and detection based on deep learning
    ZHAO Dada
    DING Kai
    QI Xiaogang
    CHEN Yu
    FENG Hailin
    JournalofSystemsEngineeringandElectronics, 2024, 35 (02) : 294 - 301
  • [2] Sound Event Localization and Detection Based on Deep Learning
    Zhao, Dada
    Ding, Kai
    Qi, Xiaogang
    Chen, Yu
    Feng, Hailin
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2024, 35 (02) : 294 - 301
  • [3] A survey of Deep Learning for Polyphonic Sound event detection
    Dang, An
    Vu, Toan H.
    Wang, Jia-Ching
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON ORANGE TECHNOLOGIES (ICOT), 2017, : 75 - 78
  • [4] Filterbank Learning for Deep Neural Network Based Polyphonic Sound Event Detection
    Cakir, Emre
    Ozan, Ezgi Can
    Virtanen, Tuomas
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3399 - 3406
  • [5] Sound Event Localization and Detection Based on Dual Attention
    Xu, Chundong
    Liu, Hao
    Min, Yuan
    Zhen, Yadi
    Computer Engineering and Applications, 2023, 59 (19) : 99 - 105
  • [6] Novel sound event and sound activity detection framework based on intrinsic mode functions and deep learning
    Vahid Hajihashemi
    Abdorreza Alavigharahbagh
    J. J. M. Machado
    João Manuel R. S. Tavares
    Multimedia Tools and Applications, 2025, 84 (14) : 13515 - 13543
  • [7] Sound Event Localization and Detection Based on Multiple DOA Beamforming and Multi-task Learning
    Xue, Wei
    Tong, Ying
    Zhang, Chao
    Ding, Guohong
    He, Xiaodong
    Zhou, Bowen
    INTERSPEECH 2020, 2020, : 5091 - 5095
  • [8] Rare Sound Event Detection Using Deep Learning and Data Augmentation
    Chen, Yanping
    Jin, Hongxia
    INTERSPEECH 2019, 2019, : 619 - 623
  • [9] Abnormal Event Detection Based on Deep Learning
    Wen J.
    Wang H.-J.
    Deng J.
    Liu P.-F.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (02): : 308 - 313
  • [10] Robot Detection and Localization Based on Deep Learning
    Luo, Sha
    Lu, Huimin
    Xiao, Junhao
    Yu, Qinghua
    Zheng, Zhiqiang
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 7091 - 7095