ALCDNet: Loop Closure Detection Based on Acoustic Echoes

被引:0
作者
Liu, Guangyao [1 ,2 ]
Cui, Weimeng [1 ,2 ]
Jia, Naizheng [1 ,2 ]
Xi, Yuzhang [1 ,2 ]
Li, Shuyu [1 ,2 ]
Wang, Zhi [1 ,2 ]
机构
[1] Zhejiang Univ, State Key Lab Ind Control Technol, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Huzhou Inst, Huzhou 313000, Peoples R China
关键词
Feature extraction; Robots; Liquid crystal displays; Acoustics; Accuracy; Lighting; Laser radar; Ground penetrating radar; Geophysical measurement techniques; Interference; Acoustic; loop closure detection (LCD); PLACE RECOGNITION;
D O I
10.1109/LRA.2024.3519906
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Loop closure detection is a critical component of simultaneous localization and mapping (SLAM) systems, essential for mitigating the drift that accumulates over time. Traditional approaches utilizing light detection and ranging (LiDAR) and cameras have been developed to address this challenge. However, these methods can be ineffective when there is a lack of visual cues, such as smoke, poor lighting conditions, and textureless environments. In this letter, we propose an efficient loop closure detection method that employs a speaker and microphone array to gather spatial structure information. First, our method uses a microphone array to capture echoes from finely designed signals emitted by the speaker. Second, we apply momentum contrastive learning (MoCo) to train an echo feature encoder to learn the implicit spatial features embedded in the echo signals. Finally, loop closure detection is performed by computing the cosine similarity of features output by the encoding network from echo information at different locations. Experiments conducted in typical indoor environments demonstrate that our method outperforms vision-based methods in most cases and can still achieve accurate loop closure detection in smoky environments where both LiDAR and vision-based methods fail. This makes it a viable and cost-effective complementary solution in environments with sparse texture features, unstable lighting conditions or smoke.
引用
收藏
页码:1473 / 1480
页数:8
相关论文
共 32 条
[1]  
[Anonymous], 2011, MOBISYS 11
[2]   SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences [J].
Behley, Jens ;
Garbade, Martin ;
Milioto, Andres ;
Quenzel, Jan ;
Behnke, Sven ;
Stachniss, Cyrill ;
Gall, Juergen .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9296-9306
[3]   Rethinking Visual Geo-localization for Large-Scale Applications [J].
Berton, Gabriele ;
Masone, Carlo ;
Caputo, Barbara .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :4868-4878
[4]   LCDNet: Deep Loop Closure Detection and Point Cloud Registration for LiDAR SLAM [J].
Cattaneo, Daniele ;
Vaghi, Matteo ;
Valada, Abhinav .
IEEE TRANSACTIONS ON ROBOTICS, 2022, 38 (04) :2074-2093
[5]  
Christensen JH, 2020, IEEE INT CONF ROBOT, P1581, DOI [10.1109/icra40945.2020.9196934, 10.1109/ICRA40945.2020.9196934]
[6]   FAB-MAP: Probabilistic localization and mapping in the space of appearance [J].
Cummins, Mark ;
Newman, Paul .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2008, 27 (06) :647-665
[7]  
Cuturi M, 2017, PR MACH LEARN RES, V70
[8]  
Di Carlo D, 2020, INT CONF ACOUST SPEE, P156, DOI [10.1109/icassp40776.2020.9054647, 10.1109/ICASSP40776.2020.9054647]
[9]   Bags of Binary Words for Fast Place Recognition in Image Sequences [J].
Galvez-Lopez, Dorian ;
Tardos, Juan D. .
IEEE TRANSACTIONS ON ROBOTICS, 2012, 28 (05) :1188-1197
[10]   Momentum Contrast for Unsupervised Visual Representation Learning [J].
He, Kaiming ;
Fan, Haoqi ;
Wu, Yuxin ;
Xie, Saining ;
Girshick, Ross .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9726-9735