I2D-Loc++: Camera Pose Tracking in LiDAR Maps With Multi-View Motion Flows

被引:0
|
作者
Yu, Huai [1 ]
Chen, Kuangyi [2 ]
Yang, Wen [1 ]
Scherer, Sebastian [3 ]
Xia, Gui-Song [4 ]
机构
[1] Wuhan Univ, Sch Elect Informat, Wuhan 430072, Peoples R China
[2] Graz Univ Technol, Inst Comp G & Vis, A-8010 Graz, Austria
[3] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA
[4] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 09期
基金
中国国家自然科学基金;
关键词
Cameras; Laser radar; Three-dimensional displays; Optical flow; Location awareness; Image motion analysis; Visualization; Camera localization; lidar maps; 2d-3d correspondence; flow estimation; LOCALIZATION; CALIBRATION; ROBUST; LINE;
D O I
10.1109/LRA.2024.3440851
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Camera localization in LiDAR maps has become increasingly popular due to its promising ability to handle complex scenarios, surpassing the limitations of visual-only localization methods. However, existing approaches mostly focus on addressing the cross-modal 2D-3D gaps while overlooking the relationship between adjacent image frames, which results in fluctuations and unreliability of camera poses. To alleviate this, we introduce a novel camera pose tracking framework in LiDAR maps by coupling the 2D-3D correspondences with 2D-2D feature matching (I2D-Loc++), which establishes the multi-view geometric constraints to improve localization stability and trajectory smoothness. Specifically, the framework consists of a front-end hybrid flow estimation network and a non-linear least square pose optimization module. We further design a cross-modal consistency loss to integrate the multi-view motion flows for the network training and the back-end pose optimization. The pose tracking model is trained on the KITTI odometry dataset, and tested on the KITTI odometry, Argoverse, Waymo and Lyft5 datasets, which demonstrates that I2D-Loc++ has superior performance and good generalization ability in improving the accuracy and robustness of camera pose tracking.
引用
收藏
页码:8162 / 8169
页数:8
相关论文
共 4 条
  • [1] I2D-Loc: Camera localization via image to LiDAR depth flow
    Chen, Kuangyi
    Yu, Huai
    Yang, Wen
    Yu, Lei
    Scherer, Sebastian
    Xia, Gui-Song
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 194 : 209 - 221
  • [2] Simultaneous Multi-View Camera Pose Estimation and Object Tracking With Squared Planar Markers
    Sarmadi, Hamid
    Munoz-Salinas, Rafael
    Berbis, M. A.
    Medina-Carnicer, R.
    IEEE ACCESS, 2019, 7 : 22927 - 22940
  • [3] Multi-View Weakly-Supervised 3D Human Pose Estimation for Depth Maps via SoG With Semantic Segmentation Information
    Bei, Siyuan
    Zhou, Yu
    Yu, Yao
    Du, Sidan
    IEEE ACCESS, 2024, 12 : 149557 - 149568
  • [4] MulViMotion: Shape-Aware 3D Myocardial Motion Tracking From Multi-View Cardiac MRI
    Meng, Qingjie
    Qin, Chen
    Bai, Wenjia
    Liu, Tianrui
    de Marvao, Antonio
    O'Regan, Declan P.
    Rueckert, Daniel
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (08) : 1961 - 1974