Cross transformer for LiDAR-based loop closure detection

被引:0
作者
Zheng, Rui [1 ]
Ren, Yang [1 ]
Zhou, Qi [2 ]
Ye, Yibin [1 ]
Zeng, Hui [1 ,3 ]
机构
[1] Univ Sci & Technol Beijing, Beijing Engn Res Ctr Ind Spectrum Imaging, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
[2] Harbin Engn Univ, Southampton Ocean Engn Joint Inst, Harbin 150001, Peoples R China
[3] Univ Sci & Technol Beijing, Shunde Innovat Sch, Foshan 528399, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Loop closure; LiDAR; SLAM; Transformer; PLACE RECOGNITION; LOCALIZATION; DESCRIPTOR; VISION;
D O I
10.1007/s00138-024-01629-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Loop closure detection, also known as place recognition, a key component of simultaneous localization and mapping (SLAM) systems, aims to recognize previously visited locations and reduce the accumulated drift error caused by odometry. Current vision-based methods are susceptible to variations in illumination and perspective, limiting their generalization ability and robustness. Thus, in this paper, we propose CrossT-Net (Cross Transformer Net), a novel cross-attention based loop closure detection network for LiDAR. CrossT-Net directly estimates the similarity between two frames by leveraging multi-class information maps, including range, intensity, and normal maps, to comprehensively characterize environmental features. A Siamese Encoder Net with shared parameters extracts frame features, and a Cross Transformer module captures intra-frame context and inter-frame correlations through self-attention and cross-attention mechanisms. In the final stage, an Overlap Estimation Module predicts the point cloud overlap between two frames. Experimental results on several benchmark datasets demonstrate that our proposed method outperforms existing methods in precision and recall, and exhibits strong generalization performance in different road environments. The implementation of our approach is available at: https://github.com/Bryan-ZhengRui/CrossT-Net_Pytorch.
引用
收藏
页数:15
相关论文
共 45 条
[31]  
Tombari F., 2010, P ACM WORKSH 3D OBJ, P57, DOI DOI 10.1145/1877808.1877821
[32]   PointNetVLAD: Deep Point Cloud Based Retrieval for Large-Scale Place Recognition [J].
Uy, Mikaela Angelina ;
Lee, Gim Hee .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4470-4479
[33]  
Wang H, 2020, IEEE INT CONF ROBOT, P2095, DOI [10.1109/ICRA40945.2020.9196764, 10.1109/icra40945.2020.9196764]
[34]   LiDAR Iris for Loop-Closure Detection [J].
Wang, Ying ;
Sun, Zezhou ;
Xu, Cheng-Zhong ;
Sarma, Sanjay E. ;
Yang, Jian ;
Kong, Hui .
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, :5769-5775
[35]   MinkLoc3D: Point Cloud Based Large-Scale Place Recognition [J].
Warsaw, Jacek Komorowski .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :1789-1798
[36]   Text2Loc: 3D Point Cloud Localization from Natural Language [J].
Xia, Yan ;
Shi, Letian ;
Ding, Zifeng ;
Henriques, Joao F. ;
Cremers, Daniel .
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, :14958-14967
[37]   CASSPR: Cross Attention Single Scan Place Recognition [J].
Xia, Yan ;
Gladkova, Mariia ;
Wang, Rui ;
Li, Qianyun ;
Stilla, Uwe ;
Henriques, Joao F. ;
Cremers, Daniel .
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, :8427-8438
[38]   SOE-Net: A Self-Attention and Orientation Encoding Network for Point Cloud based Place Recognition [J].
Xia, Yan ;
Xu, Yusheng ;
Li, Shuang ;
Wang, Rui ;
Du, Juan ;
Cremers, Daniel ;
Stilla, Uwe .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :11343-11352
[39]   ASFM-Net: Asymmetrical Siamese Feature Matching Network for Point Completion [J].
Xia, Yaqi ;
Xia, Yan ;
Li, Wei ;
Song, Rui ;
Cao, Kailang ;
Stilla, Uwe .
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :1938-1947
[40]  
Yin H, 2018, IEEE INT VEH SYM, P728, DOI 10.1109/IVS.2018.8500682