Audio-visual based non-line-of-sight sound source localization: A feasibility study

被引:10
|
作者
King, E. A. [1 ]
Tatoglu, A. [2 ]
Iglesias, D. [1 ]
Matriss, A. [1 ]
机构
[1] Univ Hartford, Dept Mech Engn, Acoust Program & Lab, 200 Bloomfield, Hartford, CT 06117 USA
[2] Univ Hartford, Dept Mech Engn, Hartford, CT 06117 USA
关键词
Sound source localization; TDOA; Autonomous vehicles; Safety; Grid mapping; ROBOT; SLAM;
D O I
10.1016/j.apacoust.2020.107674
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper examines the feasibility of using an audio-visual methodology for sound source localization of acoustic sources hidden from direct view. A four channel microphone array is used in conjunction with LiDAR and 2D/3D mapping to merge estimated angles of arrival with room parameters for sound source localization. The time difference of arrival (TDOA) approach is used to estimate the angle(s) of arrival from an unknown sound source and a reverse ray-tracing approach is used to identify a possible source location behind a barrier/obstacle. Using a single-source zone approach based TDOA algorithm, a variety of estimated source directions are identified, each arising from multiple sound sources due to reflections in the room. Results are combined with a three-dimensional map of the space acquired from a ground robot equipped with a 2D SICK LMS LiDAR, and a reverse ray tracing approach is used to triangulate the likely position of the source. Tests were performed in both controlled and uncontrolled environments and the method was capable of finding the hidden source to within 0.5 m accuracy, which is approximately 10% of the length of an automobile. It is proposed that a methodology like this could be used to add sound source localization capabilities on autonomous vehicles to detect the position of emergency/warning sounds in traffic that may be shielded from the direct field of view. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Prompting Segmentation with Sound Is Generalizable Audio-Visual Source Localizer
    Wang, Yaoting
    Liu, Weisong
    Li, Guangyao
    Ding, Jian
    Hu, Di
    Li, Xi
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5669 - 5677
  • [32] Non-Line-of-Sight Footstep Localization for Autonomous Mobile Robots
    Yoshinari, Moeka
    Takahashi, Masaki
    Hioka, Yusuke
    2024 10TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND ROBOTICS ENGINEERING, ICMRE, 2024, : 114 - 119
  • [33] TDOA Fingerprinting for Localization in Non-Line-of-Sight and Multipath Environments
    Schmitz, Johannes
    Schroeder, Florian
    Mathar, Rudolf
    2015 INTERNATIONAL SYMPOSIUM ON ANTENNAS AND PROPAGATION (ISAP), 2015,
  • [34] SIGHT AND SOUND - SENSIBLE AND SENSITIVE USE OF AUDIO-VISUAL AIDS - EDGERTON,MF
    TORKELSON, GM
    MODERN LANGUAGE JOURNAL, 1970, 54 (04): : 278 - 280
  • [35] Span-based Audio-Visual Localization
    Wu, Yiling
    Zhang, Xinfeng
    Wang, Yaowei
    Huang, Qingming
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1252 - 1260
  • [36] Minimizing Indoor Localization Errors for Non-Line-of-Sight Propagation
    Pelka, Mathias
    Bartmann, Peter
    Leugner, Swen
    Hellbrtick, Horst
    2018 8TH INTERNATIONAL CONFERENCE ON LOCALIZATION AND GNSS (ICL-GNSS), 2018,
  • [37] Spatial Multi-Source Information Fusion Localization Algorithm in Non-Line-of-Sight Environments
    Wang, Jianhui
    Li, Jingjing
    Cui, Weijia
    Liu, Qing
    SENSORS, 2023, 23 (07)
  • [38] Non-Line-of-Sight Indoor Localization of Micro Air Vehicles Using First-Arrival Sound
    Mak, Lin Chi
    Furukawa, Tomonari
    INTERNATIONAL JOURNAL OF MICRO AIR VEHICLES, 2010, 2 (04) : 221 - 237
  • [39] Robust Audio-Visual Contrastive Learning for Proposal-Based Self-Supervised Sound Source Localization in Videos
    Xuan, Hanyu
    Wu, Zhiliang
    Yang, Jian
    Jiang, Bo
    Luo, Lei
    Alameda-Pineda, Xavier
    Yan, Yan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (07) : 4896 - 4907
  • [40] Visual interfaces for operation of non-line-of-sight mobile manipulation
    Will, Jeff
    Lynn, Ian
    Moore, Kevin L.
    UNMANNED SYSTEMS TECHNOLOGY X, 2008, 6962