Pyramid Point Cloud Transformer for Large-Scale Place Recognition

被引:77
|
作者
Hui, Le [1 ]
Yang, Hang [1 ]
Cheng, Mingmei [1 ]
Xie, Jin [1 ]
Yang, Jian [1 ]
机构
[1] Nanjing Univ Sci & Technol, PCA Lab, Key Lab Intelligent Percept & Syst High Dimens In, Minist Educ, Nanjing, Jiangsu, Peoples R China
关键词
SIMULTANEOUS LOCALIZATION; SLAM; HISTOGRAMS; ROBUST;
D O I
10.1109/ICCV48922.2021.00604
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, deep learning based point cloud descriptors have achieved impressive results in the place recognition task. Nonetheless, due to the sparsity of point clouds, how to extract discriminative local features of point clouds to efficiently form a global descriptor is still a challenging problem. In this paper, we propose a pyramid point cloud transformer network (PPT-Net) to learn the discriminative global descriptors from point clouds for efficient retrieval. Specifically, we first develop a pyramid point transformer module that adaptively learns the spatial relationship of the different k-NN neighboring points of point clouds, where the grouped self-attention is proposed to extract discriminative local features of the point clouds. The grouped self-attention not only enhances long-term dependencies of the point clouds, but also reduces the computational cost. In order to obtain discriminative global descriptors, we construct a pyramid VLAD module to aggregate the multi-scale feature maps of point clouds into the global descriptors. By applying VLAD pooling on multi-scale feature maps, we utilize the context gating mechanism on the multiple global descriptors to adaptively weight the multi-scale global context information into the final global descriptor. Experimental results on the Oxford dataset and three in-house datasets show that our method achieves the state-of-the-art on the point cloud based place recognition task.
引用
收藏
页码:6078 / 6087
页数:10
相关论文
共 50 条
  • [1] PointNetVLAD: Deep Point Cloud Based Retrieval for Large-Scale Place Recognition
    Uy, Mikaela Angelina
    Lee, Gim Hee
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4470 - 4479
  • [2] MinkLoc3D: Point Cloud Based Large-Scale Place Recognition
    Warsaw, Jacek Komorowski
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1789 - 1798
  • [3] Efficient 3D Point Cloud Feature Learning for Large-Scale Place Recognition
    Hui, Le
    Cheng, Mingmei
    Xie, Jin
    Yang, Jian
    Cheng, Ming-Ming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1258 - 1270
  • [4] Efficient 3D Point Cloud Feature Learning for Large-Scale Place Recognition
    Hui, Le
    Cheng, Mingmei
    Xie, Jin
    Yang, Jian
    Cheng, Ming-Ming
    IEEE Transactions on Image Processing, 2022, 31 : 1258 - 1270
  • [5] Reflective Noise Filtering of Large-Scale Point Cloud Using Transformer
    Gao, Rui
    Li, Mengyu
    Yang, Seung-Jun
    Cho, Kyungeun
    REMOTE SENSING, 2022, 14 (03)
  • [6] HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud
    Hou, Zhixing
    Yan, Yan
    Xu, Chengzhong
    Kong, Hui
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 2612 - 2618
  • [7] Hierarchical Bidirected Graph Convolutions for Large-Scale 3-D Point Cloud Place Recognition
    Shu, Dong Wook
    Kwon, Junseok
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9651 - 9662
  • [8] MSAPVT: a multi-scale attention pyramid vision transformer network for large-scale fruit recognition
    Rao, Yao
    Li, Chaofeng
    Xu, Feiran
    Guo, Ya
    JOURNAL OF FOOD MEASUREMENT AND CHARACTERIZATION, 2024, 18 (11) : 9233 - 9251
  • [9] Radial Transformer for Large-Scale Outdoor LiDAR Point Cloud Semantic Segmentation
    He, Xiang
    Li, Xu
    Ni, Peizhou
    Xu, Wang
    Xu, Qimin
    Liu, Xixiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [10] TransLoc3D: point cloud based large-scale place recognition using adaptive receptive fields
    Xu, Tian-xing
    Guo, Yuan-chen
    Li, Zhiqiang
    Lai, Yu-kun
    Zhang, Song-hai
    Yu, Ge
    COMMUNICATIONS IN INFORMATION AND SYSTEMS, 2023, 23 (01) : 57 - 83