HI-SLAM: Hierarchical implicit neural representation for SLAM

被引：0

作者：

Li, Jingbo ^{[1
]}

Firkat, Eksan ^{[4
,5
]}

Zhu, Jingyu ^{[3
]}

Zhu, Bin ^{[2
]}

Zhu, Jihong ^{[3
]}

Hamdulla, Askar ^{[1
]}

机构：

[1] Xinjiang Univ, Sch Informat Sci & Engn, 666 Shengli Rd, Urumqi, Xinjiang, Peoples R China

[2] Tsinghua Univ, Dept Automat, 33 Shuangqing Rd, Beijing, Peoples R China

[3] Tsinghua Univ, Dept Precis Instrument, 33 Shuangqing Rd, Beijing, Peoples R China

[4] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China

[5] Great Bay Univ, Dongguan, Guangdong, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2025年 / 271卷

关键词：

Dense visual SLAM; Neural implicit representations; Localization; RGB-D camera; FEATURE FUSION; VERSATILE;

D O I：

10.1016/j.eswa.2025.126487

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Implicit neural representation can improve the expressive ability and performance of the model by learning the representation of high-dimensional feature space and has a wide range of applications in many fields and an exciting performance. Dense visual SLAM is one of the beneficiaries of the development of implicit neural representations. Still, the current methods are based on simple fully connected network architectures, resulting in poor generalization ability, insufficient real-time performance and inability to balance global and local optimization. This paper propose a hierarchical scene representation that treats color information and geometric information as equally important, one that encodes geometric and color information into different resolution grid sizes and combines multiple corresponding multi-layer perceptron decoders. The coarse-level grid captures the general shape and structure of the global scene and makes reasonable predictions for unobserved regions.In contrast, the medium-fine-level grid finely represents geometric details and color information. Rich and comprehensive high-fidelity reconstructions can be obtained in large-scale scenes by using meshes of different resolutions to encode geometric and color information. In this study, selectable keyframes are used to ensure that the local information of the scene is optimized while reducing redundant information preservation. Compared with recent dense visual SLAM systems via implicit neural representations, our method generalizes and operates more robustly, efficiently, and precisely in large-scale scenes.

引用

页数：10

共 35 条

[21] Hierarchical Multi-Level Information Fusion for Robust and Consistent Visual SLAM [J].

Yu, Jingrui ;

Xiang, ZhenZhen ;

Su, Jianbo .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (01) :250-259

[22] Improve generalization for neural visual-SLAM with Bayes online learning [J].

Liu, Jun ;

Deng, Haihang .

ROBOTICA, 2025,

[23] A New Hybrid Metric Map Representation by Using Voronoi Diagram and its Application to SLAM [J].

Guo, Shuai ;

Ma, Shugen ;

Li, Bin ;

Wang, Minghui ;

Wang, Yuechao .

PROCEEDING OF THE IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2012, :400-405

[24] Loop Closure Detection for Visual SLAM Using Simplified Convolution Neural Network [J].

Xu, Bingbing ;

Yang, Jinfu ;

Li, Mingai ;

Wu, Suishuo ;

Shan, Yi .

ADVANCES IN HARMONY SEARCH, SOFT COMPUTING AND APPLICATIONS, 2020, 1063 :54-62

[25] LRSLAM: Low-Rank Representation of Signed Distance Fields in Dense Visual SLAM System [J].

Park, Hongbeen ;

Park, Minjeong ;

Nam, Giljoo ;

Kim, Jinkyu .

COMPUTER VISION - ECCV 2024, PT LXXX, 2025, 15138 :225-240

[26] Continuous Pose for Monocular Cameras in Neural Implicit Representation [J].

Ma, Qi ;

Paudel, Danda Pani ;

Chhatkuli, Ajad ;

Van Gool, Luc .

2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, :5291-5301

[27] 3D EKF-SLAM Using Cylindrical and Planar Features With Multilevel Mapping Representation [J].

Mariga, Leonardo ;

Nascimento Junior, Cairo Lucio ;

Barros dos Santos, Sergio Ronaldo .

18TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE, SYSCON 2024, 2024,

[28] Loop closure detection using supervised and unsupervised deep neural networks for monocular SLAM systems [J].

Memon, Azam Rafique ;

Wang, Hesheng ;

Hussain, Abid .

ROBOTICS AND AUTONOMOUS SYSTEMS, 2020, 126

[29] Neural Network-Based Recent Research Developments in SLAM for Autonomous Ground Vehicles: A Review [J].

Saleem, Hajira ;

Malekian, Reza ;

Munir, Hussan .

IEEE SENSORS JOURNAL, 2023, 23 (13) :13829-13858

[30] Implicit neural representation model for camera relocalization in multiple scenes [J].

Yao, Shun ;

Cheng, Yongmei ;

Yang, Fei ;

Mozerov, Mikhail G. .

PATTERN RECOGNITION, 2025, 168

← 1 2 3 4 →