Multimodal Features and Accurate Place Recognition With Robust Optimization for Lidar-Visual-Inertial SLAM

被引：8

作者：

Zhao, Xiongwei ^{[1
]}

Wen, Congcong ^{[2
]}

Manoj Prakhya, Sai ^{[3
]}

Yin, Hongpei ^{[4
]}

Zhou, Rundong ^{[5
]}

Sun, Yijiao ^{[1
]}

Xu, Jie ^{[6
]}

Bai, Haojie ^{[1
]}

Wang, Yang ^{[1
]}

机构：

[1] Harbin Inst Technol Shenzhen, Sch Elect & Informat Engn, Shenzhen 518071, Peoples R China

[2] NYU, Tandon Sch Engn, New York, NY 10012 USA

[3] Huawei Munich Res Ctr, D-80992 Munich, Germany

[4] Guangdong Inst Artificial Intelligence & Adv Comp, Guangzhou 510535, Peoples R China

[5] Harbin Inst Technol, Sch Elect & Informat Engn, Harbin 150001, Peoples R China

[6] Harbin Inst Technol, Sch Mech & Elect Engn, Harbin 150001, Peoples R China

来源：

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT | 2024年 / 73卷

关键词：

Laser radar; Simultaneous localization and mapping; Visualization; Feature extraction; Optimization; Robot sensing systems; Three-dimensional displays; 3-D lidar loop closure descriptor; lidar-visual-inertial simultaneous localization and mapping (SLAM) (LVINS); robust iterative optimization; state estimation; two-stage loop detection; LINE SEGMENT DETECTOR; REAL-TIME; DESCRIPTOR;

D O I：

10.1109/TIM.2024.3370762

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Lidar-visual-inertial simultaneous localization and mapping (SLAM) (LVINS) provides a compelling solution for accurate and robust state estimation and mapping, integrating complementary information from the multisensor data. However, in the front-end processing of existing LVINS systems, methods based on the visual line feature matching typically suffer from low accuracy and are time consuming. In addition, the back-end optimization of current multisensor fusion SLAM systems is adversely affected by feature association outliers, which constrains further enhancements in localization precision. In the loop closure process, the existing lidar loop closure descriptors, relying primarily on 2-D information from point clouds, often fall short in complex environments. To effectively tackle these challenges, we introduce the multimodal feature-based LVINS framework, abbreviated as MMF-LVINS. Our framework consists of three major innovations. First, we propose a novel coarse-to-fine (CTF) visual line matching method that utilizes geometric descriptor similarity and optical flow verification, substantially improving both efficiency and accuracy of line feature matching. Second, we present a robust iterative optimization approach featuring a newly proposed adaptive loss function. This function is tailored based on the quality of feature association and incorporates graduated nonconvexity, thereby reducing the impact of outliers on system accuracy. Third, to augment the precision of lidar-based loop closure detection, we introduce an innovative 3-D lidar descriptor that captures spatial, height, and intensity information from the point cloud. We also propose a two-stage place recognition module that synergistically combines both visual and this new lidar descriptor, significantly diminishing cumulative drift. Extensive experimental evaluations on six real-world datasets, including EuRoc, KITTI, NCLT, M2DGR, UrbanNav, and UrbanLoco, demonstrate that our MMF-LVINS system achieves superior state estimation accuracy compared with the existing state-of-the-art methods. These experiments also validate the effectiveness of our advanced techniques in visual line matching, robust iterative optimization, and enhanced lidar loop closure detection.

引用

页数：16

共 50 条

[41] CPL-SLAM: Centralized Collaborative Multirobot Visual-Inertial SLAM Using Point-and-Line Features [J].

Liu, Xin ;

Wen, Shuhuan ;

Liu, Huaping ;

Richard Yu, F. .

IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (12) :21866-21875

[42] An Fusion SLAM Method for LiDAR Visual and IMU Based on Factor Map Elimination Optimization [J].

Yuan G.-S. ;

Qi Y.-S. ;

Liu L.-Q. ;

Su J.-Q. ;

Zhang L.-J. .

Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (11) :3042-3052

[43] RLI-SLAM: Fast Robust Ranging-LiDAR-Inertial Tightly-Coupled Localization and Mapping [J].

Xin, Rui ;

Guo, Ningyan ;

Ma, Xingyu ;

Liu, Gang ;

Feng, Zhiyong .

SENSORS, 2024, 24 (17)

[44] Visual Inertial SLAM Based on Spatiotemporal Consistency Optimization in Diverse Environments [J].

Pu, Huayan ;

Luo, Jun ;

Wang, Gang ;

Huang, Tao ;

Wu, Lang ;

Xiao, Dengyu ;

Liu, Hongliang ;

Luo, Jun .

JOURNAL OF FIELD ROBOTICS, 2025, 42 (03) :679-696

[45] Hybrid CNN-Transformer Features for Visual Place Recognition [J].

Wang, Yuwei ;

Qiu, Yuanying ;

Cheng, Peitao ;

Zhang, Junyu .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (03) :1109-1122

[46] A ROBUST MONOCULAR VISUAL SLAM SYSTEM WITH POINT AND LINE FEATURES [J].

Zhang, Di ;

Xu, De ;

Song, Rui ;

Wang, Chaoqun ;

Wang, Yinchuan .

INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2025, 40 (01) :43-55

[47] Robust Stereo Visual-Inertial Odometry Using Nonlinear Optimization [J].

Ma, Shujun ;

Bai, Xinhui ;

Wang, Yinglei ;

Fang, Rui .

SENSORS, 2019, 19 (17)

[48] P3-LOAM: PPP/LiDAR Loosely Coupled SLAM With Accurate Covariance Estimation and Robust RAIM in Urban Canyon Environment [J].

Li, Tao ;

Pei, Ling ;

Xiang, Yan ;

Wu, Qi ;

Xia, Songpengcheng ;

Tao, Lihao ;

Guan, Xujun ;

Yu, Wenxian .

IEEE SENSORS JOURNAL, 2021, 21 (05) :6660-6671

[49] GIVL-SLAM: A Robust and High-Precision SLAM System by Tightly Coupled GNSS RTK, Inertial, Vision, and LiDAR [J].

Wang, Xuanbin ;

Li, Xingxing ;

Yu, Hui ;

Chang, Hanyu ;

Zhou, Yuxuan ;

Li, Shengyu .

IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2025, 30 (02) :1212-1223

[50] Accurate and robust odometry by fusing monocular visual, inertial, and wheel encoder [J].

Yuqian Niu ;

Jia Liu ;

Xia Wang ;

Wei Hao ;

Wenjie Li ;

Lijun Chen .

CCF Transactions on Pervasive Computing and Interaction, 2020, 2 :275-287

← 1 2 3 4 5 →