I2D-Loc++: Camera Pose Tracking in LiDAR Maps With Multi-View Motion Flows

被引：0

作者：

Yu, Huai ^{[1
]}

Chen, Kuangyi ^{[2
]}

Yang, Wen ^{[1
]}

Scherer, Sebastian ^{[3
]}

Xia, Gui-Song ^{[4
]}

机构：

[1] Wuhan Univ, Sch Elect Informat, Wuhan 430072, Peoples R China

[2] Graz Univ Technol, Inst Comp G & Vis, A-8010 Graz, Austria

[3] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA

[4] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 09期

基金：

中国国家自然科学基金;

关键词：

Cameras; Laser radar; Three-dimensional displays; Optical flow; Location awareness; Image motion analysis; Visualization; Camera localization; lidar maps; 2d-3d correspondence; flow estimation; LOCALIZATION; CALIBRATION; ROBUST; LINE;

D O I：

10.1109/LRA.2024.3440851

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Camera localization in LiDAR maps has become increasingly popular due to its promising ability to handle complex scenarios, surpassing the limitations of visual-only localization methods. However, existing approaches mostly focus on addressing the cross-modal 2D-3D gaps while overlooking the relationship between adjacent image frames, which results in fluctuations and unreliability of camera poses. To alleviate this, we introduce a novel camera pose tracking framework in LiDAR maps by coupling the 2D-3D correspondences with 2D-2D feature matching (I2D-Loc++), which establishes the multi-view geometric constraints to improve localization stability and trajectory smoothness. Specifically, the framework consists of a front-end hybrid flow estimation network and a non-linear least square pose optimization module. We further design a cross-modal consistency loss to integrate the multi-view motion flows for the network training and the back-end pose optimization. The pose tracking model is trained on the KITTI odometry dataset, and tested on the KITTI odometry, Argoverse, Waymo and Lyft5 datasets, which demonstrates that I2D-Loc++ has superior performance and good generalization ability in improving the accuracy and robustness of camera pose tracking.

引用

页码：8162 / 8169

页数：8

共 4 条

[1] I2D-Loc: Camera localization via image to LiDAR depth flow
Chen, Kuangyi
Yu, Huai
Yang, Wen
Yu, Lei
Scherer, Sebastian
Xia, Gui-Song
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 194 : 209 - 221
[2] Simultaneous Multi-View Camera Pose Estimation and Object Tracking With Squared Planar Markers
Sarmadi, Hamid
Munoz-Salinas, Rafael
Berbis, M. A.
Medina-Carnicer, R.
IEEE ACCESS, 2019, 7 : 22927 - 22940
[3] Multi-View Weakly-Supervised 3D Human Pose Estimation for Depth Maps via SoG With Semantic Segmentation Information
Bei, Siyuan
Zhou, Yu
Yu, Yao
Du, Sidan
IEEE ACCESS, 2024, 12 : 149557 - 149568
[4] MulViMotion: Shape-Aware 3D Myocardial Motion Tracking From Multi-View Cardiac MRI
Meng, Qingjie
Qin, Chen
Bai, Wenjia
Liu, Tianrui
de Marvao, Antonio
O'Regan, Declan P.
Rueckert, Daniel
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (08) : 1961 - 1974

← 1 →