Patchlpr: a multi-level feature fusion transformer network for LiDAR-based place recognition

被引:0
|
作者
Sun, Yang [1 ,2 ]
Guo, Jianhua [1 ,3 ]
Wang, Haiyang [4 ]
Zhang, Yuhang [1 ,3 ]
Zheng, Jiushuai [1 ,3 ]
Tian, Bin [5 ]
机构
[1] Hebei Univ Engn, Coll Mech & Equipment Engn, Handan 056038, Peoples R China
[2] Key Lab Intelligent Ind Equipment Technol Hebei Pr, Handan, Hebei, Peoples R China
[3] Handan Key Lab Intelligent Vehicles, Handan, Hebei, Peoples R China
[4] Jizhong Energy Fengfeng Grp Co Ltd, 16 Unicom South Rd, Handan, Hebei, Peoples R China
[5] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
关键词
SLAM; LiDAR Place recognition; Deep learning; Patch; VISION; DEEP;
D O I
10.1007/s11760-024-03138-9
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
LiDAR-based place recognition plays a crucial role in autonomous vehicles, enabling the identification of locations in GPS-invalid environments that were previously accessed. Localization in place recognition can be achieved by searching for nearest neighbors in the database. Two common types of place recognition features are local descriptors and global descriptors. Local descriptors typically compactly represent regions or points, while global descriptors provide an overarching view of the data. Despite the significant progress made in recent years by both types of descriptors, any representation inevitably involves information loss. To overcome this limitation, we have developed PatchLPR, a Transformer network employing multi-level feature fusion for robust place recognition. PatchLPR integrates global and local feature information, focusing on meaningful regions on the feature map to generate an environmental representation. We propose a patch feature extraction module based on the Vision Transformer to fully leverage the information and correlations of different features. We evaluated our approach on the KITTI dataset and a self-collected dataset covering over 4.2 km. The experimental results demonstrate that our method effectively utilizes multi-level features to enhance place recognition performance.
引用
收藏
页码:157 / 165
页数:9
相关论文
共 50 条
  • [1] Multi-level Feature Fusion Facial Expression Recognition Network
    Hu, Qian
    Wu, Chengdong
    Chi, Jianning
    Yu, Xiaosheng
    Wang, Huan
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 5267 - 5272
  • [2] Stabilize an Unsupervised Feature Learning for LiDAR-based Place Recognition
    Yin, Peng
    Xu, Lingyun
    Liu, Zhe
    Li, Lu
    Salman, Hadi
    He, Yuqing
    Xu, Weiliang
    Wang, Hesheng
    Choset, Howie
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 1162 - 1167
  • [3] Road Recognition Based on Multi-scale Convolutional Network with Multi-level Feature Fusion
    Li, Ye
    Guo, Lili
    Xu, Lele
    Wang, Xianfeng
    Jin, Shan
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069
  • [4] OverlapTransformer: An Efficient and Yaw-Angle-Invariant Transformer Network for LiDAR-Based Place Recognition
    Ma, Junyi
    Zhang, Jun
    Xu, Jintao
    Ai, Rui
    Gu, Weihao
    Chen, Xieyuanli
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 6958 - 6965
  • [5] Human Action Recognition Based On Multi-level Feature Fusion
    Xu, Y. Y.
    Xiao, G. Q.
    Tang, X. Q.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL APPLICATIONS (CISIA 2015), 2015, 18 : 353 - 355
  • [6] Context for LiDAR-based Place Recognition
    Li, Jiahao
    Qian, Hui
    Du, Xin
    2023 21ST INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS, ICAR, 2023, : 107 - 112
  • [7] CCTNet: A Circular Convolutional Transformer Network for LiDAR-Based Place Recognition Handling Movable Objects Occlusion
    Wang, Gang
    Zhu, Chaoran
    Xu, Qian
    Zhang, Tongzhou
    Zhang, Hai
    Fan, Xiaopeng
    Hu, Jue
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (04) : 3276 - 3289
  • [8] CVTNet: A Cross-View Transformer Network for LiDAR-Based Place Recognition in Autonomous Driving Environments
    Ma, Junyi
    Xiong, Guangming
    Xu, Jingyi
    Chen, Xieyuanli
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (03) : 4039 - 4048
  • [9] TransVPR: Transformer-Based Place Recognition with Multi-Level Attention Aggregation
    Wang, Ruotong
    Shen, Yanqing
    Zuo, Weiliang
    Zhou, Sanping
    Zheng, Nanning
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13638 - 13647
  • [10] Multi-level feature fusion based Locality-Constrained Spatial Transformer network for video crowd counting
    Fang Y.
    Gao S.
    Li J.
    Luo W.
    He L.
    Hu B.
    Neurocomputing, 2020, 392 : 98 - 107