HI-SLAM: Hierarchical implicit neural representation for SLAM

被引:0
作者
Li, Jingbo [1 ]
Firkat, Eksan [4 ,5 ]
Zhu, Jingyu [3 ]
Zhu, Bin [2 ]
Zhu, Jihong [3 ]
Hamdulla, Askar [1 ]
机构
[1] Xinjiang Univ, Sch Informat Sci & Engn, 666 Shengli Rd, Urumqi, Xinjiang, Peoples R China
[2] Tsinghua Univ, Dept Automat, 33 Shuangqing Rd, Beijing, Peoples R China
[3] Tsinghua Univ, Dept Precis Instrument, 33 Shuangqing Rd, Beijing, Peoples R China
[4] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[5] Great Bay Univ, Dongguan, Guangdong, Peoples R China
关键词
Dense visual SLAM; Neural implicit representations; Localization; RGB-D camera; FEATURE FUSION; VERSATILE;
D O I
10.1016/j.eswa.2025.126487
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Implicit neural representation can improve the expressive ability and performance of the model by learning the representation of high-dimensional feature space and has a wide range of applications in many fields and an exciting performance. Dense visual SLAM is one of the beneficiaries of the development of implicit neural representations. Still, the current methods are based on simple fully connected network architectures, resulting in poor generalization ability, insufficient real-time performance and inability to balance global and local optimization. This paper propose a hierarchical scene representation that treats color information and geometric information as equally important, one that encodes geometric and color information into different resolution grid sizes and combines multiple corresponding multi-layer perceptron decoders. The coarse-level grid captures the general shape and structure of the global scene and makes reasonable predictions for unobserved regions.In contrast, the medium-fine-level grid finely represents geometric details and color information. Rich and comprehensive high-fidelity reconstructions can be obtained in large-scale scenes by using meshes of different resolutions to encode geometric and color information. In this study, selectable keyframes are used to ensure that the local information of the scene is optimized while reducing redundant information preservation. Compared with recent dense visual SLAM systems via implicit neural representations, our method generalizes and operates more robustly, efficiently, and precisely in large-scale scenes.
引用
收藏
页数:10
相关论文
共 30 条
  • [21] Loop Closure Detection for Visual SLAM Using Simplified Convolution Neural Network
    Xu, Bingbing
    Yang, Jinfu
    Li, Mingai
    Wu, Suishuo
    Shan, Yi
    ADVANCES IN HARMONY SEARCH, SOFT COMPUTING AND APPLICATIONS, 2020, 1063 : 54 - 62
  • [22] LRSLAM: Low-Rank Representation of Signed Distance Fields in Dense Visual SLAM System
    Park, Hongbeen
    Park, Minjeong
    Nam, Giljoo
    Kim, Jinkyu
    COMPUTER VISION - ECCV 2024, PT LXXX, 2025, 15138 : 225 - 240
  • [23] Continuous Pose for Monocular Cameras in Neural Implicit Representation
    Ma, Qi
    Paudel, Danda Pani
    Chhatkuli, Ajad
    Van Gool, Luc
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 5291 - 5301
  • [24] 3D EKF-SLAM Using Cylindrical and Planar Features With Multilevel Mapping Representation
    Mariga, Leonardo
    Nascimento Junior, Cairo Lucio
    Barros dos Santos, Sergio Ronaldo
    18TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE, SYSCON 2024, 2024,
  • [25] Loop closure detection using supervised and unsupervised deep neural networks for monocular SLAM systems
    Memon, Azam Rafique
    Wang, Hesheng
    Hussain, Abid
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2020, 126
  • [26] Neural Network-Based Recent Research Developments in SLAM for Autonomous Ground Vehicles: A Review
    Saleem, Hajira
    Malekian, Reza
    Munir, Hussan
    IEEE SENSORS JOURNAL, 2023, 23 (13) : 13829 - 13858
  • [27] NeurNCD: Novel Class Discovery via Implicit Neural Representation
    Wang, Junming
    Shi, Yi
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 257 - 265
  • [28] AN END-TO-END SIAMESE CONVOLUTIONAL NEURAL NETWORK FOR LOOP CLOSURE DETECTION IN VISUAL SLAM SYSTEM
    Liu, Hong
    Zhao, Chenyang
    Huang, Weipeng
    Shi, Wei
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 3121 - 3125
  • [29] 3D Convolutional Neural Network for Low-Light Image Sequence Enhancement in SLAM
    Quan, Yizhuo
    Fu, Dong
    Chang, Yuanfei
    Wang, Chengbo
    REMOTE SENSING, 2022, 14 (16)
  • [30] Fast and robust loop-closure detection using deep neural networks and matrix transformation for a visual SLAM system
    Chen, Yan
    Zhong, Yang
    Wang, Wenxiang
    Peng, Hongxing
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06) : 61816