CROSS-MODAL 2D-3D LOCALIZATION WITH SINGLE-MODAL QUERY

被引:0
作者
Zhao, Zhipeng [1 ]
Yu, Huai [1 ]
Lyu, Chenwei [1 ]
Ji, Pengliang [2 ]
Yang, Xiangli [3 ]
Yang, Wen [1 ]
机构
[1] Wuhan Univ, Sch Elect Informat, Wuhan 430072, Peoples R China
[2] Beihang Univ, Sch Comp Sci, Beijing 100083, Peoples R China
[3] Chongqing Jiaotong Univ, Sch Informat Sci & Engn, Chongqing 400074, Peoples R China
来源
IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM | 2023年
关键词
Multimodal fusion; global localization; 2D-3D place recognition; spherical image; KITTI360;
D O I
10.1109/IGARSS52108.2023.10282358
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Global visual localization is an important task in geoscience with a plethora of applications such as SLAM and autonomous navigation. Current place recognition approaches restrict the modality of the query data which relies on the database data modality. However, real-world robots are equipped with different sensors in different application scenarios and it is difficult for data from a single fixed modality to accommodate all challenging environments. To overcome this limitation, we propose to build a generalized model that allows spherical images and point clouds to be retrieved under any single-modal query. Our 2D-3D dataset is created based on the KITTI360 dataset with spherical images and corresponding point clouds for training and evaluation. Extensive experimental results demonstrate the effectiveness of our proposed approach.
引用
收藏
页码:6171 / 6174
页数:4
相关论文
共 11 条
[1]  
Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]
[2]  
Cattaneo D, 2020, IEEE INT CONF ROBOT, P4365, DOI [10.1109/icra40945.2020.9196859, 10.1109/ICRA40945.2020.9196859]
[3]  
Chechik G, 2010, J MACH LEARN RES, V11, P1109
[4]   SphereNet: Learning Spherical Representations for Detection and Classification in Omnidirectional Images [J].
Coors, Benjamin ;
Condurache, Alexandru Paul ;
Geiger, Andreas .
COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 :525-541
[5]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[6]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[7]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]
[8]   AdaFusion: Visual-LiDAR Fusion With Adaptive Weights for Place Recognition [J].
Lai, Haowen ;
Yin, Peng ;
Scherer, Sebastian .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) :12038-12045
[9]   KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D [J].
Liao, Yiyi ;
Xie, Jun ;
Geiger, Andreas .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) :3292-3310
[10]   PointNetVLAD: Deep Point Cloud Based Retrieval for Large-Scale Place Recognition [J].
Uy, Mikaela Angelina ;
Lee, Gim Hee .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4470-4479