Geo-Localization With Transformer-Based 2D-3D Match Network

被引:6
作者
Li, Laijian [1 ]
Ma, Yukai [1 ]
Tang, Kai [1 ]
Zhao, Xiangrui [1 ]
Chen, Chao [1 ]
Huang, Jianxin [1 ]
Mei, Jianbiao [1 ]
Liu, Yong [1 ]
机构
[1] Zhejiang Univ, Inst Cyber Syst & Control, Hangzhou 310012, Peoples R China
基金
中国国家自然科学基金;
关键词
Laser radar; Point cloud compression; Feature extraction; Three-dimensional displays; Satellites; Location awareness; Global Positioning System; Geo-localization; 2D-3D match; SLAM; Index Terms;
D O I
10.1109/LRA.2023.3290526
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
This letter presents a novel method for geographical localization by registering satellite maps with LiDAR point clouds. This method includes a Transformer-based 2D-3D matching network called D-GLSNet that directly matches the LiDAR point clouds and satellite images through end-to-end learning. Without the need for feature point detection, D-GLSNet provides accurate pixel-to-point association between the LiDAR point clouds and satellite images. And then, we can easily calculate the horizontal offset $(\Delta x, \Delta y)$ and angular deviation $\Delta \theta _{yaw}$ between them, thereby achieving accurate registration. To demonstrate our network's localization potential, we have designed a Geo-localization Node (GLN) that implements geographical localization and is plug-and-play in the SLAM system. Compared to GPS, GLN is less susceptible to external interference, such as building occlusion. In urban scenarios, our proposed D-GLSNet can output high-quality matching, enabling GLN to function stably and deliver more accurate localization results. Extensive experiments on the KITTI dataset show that our D-GLSNet method achieves a mean Relative Translation Error (RTE) of 1.43 m. Furthermore, our method outperforms state-of-the-art LiDAR-based geospatial localization methods when combined with odometry.
引用
收藏
页码:4855 / 4862
页数:8
相关论文
共 50 条
  • [21] Line-Based 2-D-3-D Registration and Camera Localization in Structured Environments
    Yu, Huai
    Zhen, Weikun
    Yang, Wen
    Scherer, Sebastian
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2020, 69 (11) : 8962 - 8972
  • [22] Mixed 2D-3D Information for Face Recognition
    Tang, Hengliang
    Sun, Yanfeng
    Yin, Baocai
    Ge, Yun
    TRANSACTIONS ON EDUTAINMENT V, 2011, 6530 : 251 - 258
  • [23] TFIENet: Transformer Fusion Information Enhancement Network for Multimodel 3-D Object Detection
    Cao, Feng
    Jin, Yufeng
    Tao, Chongben
    Luo, Xizhao
    Gao, Zhen
    Zhang, Zufeng
    Zheng, Sifa
    Zhu, Yuan
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [24] Dynamic Scene Understanding for Autonomous Driving Using 2D-3D Convolution With Voxel Key Points
    Liu, Kunhua
    Zheng, Yi
    Xie, Junkun
    Xie, Yuting
    Wang, Feiyang
    Ma, Longyan
    Dai, Chenggang
    Lu, Tao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024,
  • [25] A Simplified 2D-3D CNN Architecture for Hyperspectral Image Classification Based on Spatial-Spectral Fusion
    Yu, Chunyan
    Han, Rei
    Song, Meiping
    Liu, Caiyu
    Chang, Chein-, I
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 2485 - 2501
  • [26] F3-Net: Multiview Scene Matching for Drone-Based Geo-Localization
    Sun, Bo
    Liu, Ganchao
    Yuan, Yuan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [27] MFFTNet: A Novel 3D Point Cloud Segmentation Network Based on Multi-Scale Feature Fusion and Transformer Architecture
    Bai, Hao
    Li, Xiongwei
    Meng, Qing
    Zhuo, Shulong
    Yan, Lili
    IEEE ACCESS, 2025, 13 : 9462 - 9472
  • [28] AMANet: Adaptive Multi-Path Aggregation for Learning Human 2D-3D Correspondences
    Wang, Xuanhan
    Guo, Yuyu
    Song, Jingkuan
    Gao, Lianli
    Shen, Heng Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 979 - 992
  • [29] 2D-3D Face Recognition via Restricted Boltzmann Machines
    Wang, Xiaolong
    Ly, Vincent
    Guo, Rui
    Kambhamettu, Chandra
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 574 - 580
  • [30] 3DCTN: 3D Convolution-Transformer Network for Point Cloud Classification
    Lu, Dening
    Xie, Qian
    Gao, Kyle
    Xu, Linlin
    Li, Jonathan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 24854 - 24865