Pyramid Point Cloud Transformer for Large-Scale Place Recognition

被引:77
|
作者
Hui, Le [1 ]
Yang, Hang [1 ]
Cheng, Mingmei [1 ]
Xie, Jin [1 ]
Yang, Jian [1 ]
机构
[1] Nanjing Univ Sci & Technol, PCA Lab, Key Lab Intelligent Percept & Syst High Dimens In, Minist Educ, Nanjing, Jiangsu, Peoples R China
关键词
SIMULTANEOUS LOCALIZATION; SLAM; HISTOGRAMS; ROBUST;
D O I
10.1109/ICCV48922.2021.00604
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, deep learning based point cloud descriptors have achieved impressive results in the place recognition task. Nonetheless, due to the sparsity of point clouds, how to extract discriminative local features of point clouds to efficiently form a global descriptor is still a challenging problem. In this paper, we propose a pyramid point cloud transformer network (PPT-Net) to learn the discriminative global descriptors from point clouds for efficient retrieval. Specifically, we first develop a pyramid point transformer module that adaptively learns the spatial relationship of the different k-NN neighboring points of point clouds, where the grouped self-attention is proposed to extract discriminative local features of the point clouds. The grouped self-attention not only enhances long-term dependencies of the point clouds, but also reduces the computational cost. In order to obtain discriminative global descriptors, we construct a pyramid VLAD module to aggregate the multi-scale feature maps of point clouds into the global descriptors. By applying VLAD pooling on multi-scale feature maps, we utilize the context gating mechanism on the multiple global descriptors to adaptively weight the multi-scale global context information into the final global descriptor. Experimental results on the Oxford dataset and three in-house datasets show that our method achieves the state-of-the-art on the point cloud based place recognition task.
引用
收藏
页码:6078 / 6087
页数:10
相关论文
共 50 条
  • [41] DSC: Deep Scan Context Descriptor for Large-Scale Place Recognition
    Cui, Jiafeng
    Cai, Yingfeng
    Huang, Tengfei
    Zhao, Junqiao
    Xiong, Lu
    Yu, Zhuoping
    IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, 2022, 2022-September
  • [42] Place Recognition of Large-Scale Unstructured Orchards With Attention Score Maps
    Ou, Fang
    Li, Yunhui
    Miao, Zhonghua
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (02) : 958 - 965
  • [43] DSC: Deep Scan Context Descriptor for Large-Scale Place Recognition
    Cui, Jiafeng
    Cai, Yingfeng
    Huang, Tengfei
    Zhao, Junqiao
    Xiong, Lu
    Yu, Zhuoping
    2022 IEEE INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS (MFI), 2022,
  • [44] TopSPR-Net: Topology Aware Segment-Level Point Cloud Learning Descriptors for Three-Dimensional Place Recognition in Large-Scale Environments
    Kong, Dong
    Li, Xu
    Ni, Peizhou
    Hu, Yue
    Hu, Jinchao
    Hu, Weiming
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (10) : 13406 - 13416
  • [45] A Survey on Processing of Large-Scale 3D Point Cloud
    Liu, Xinying
    Meng, Weiliang
    Guo, Jianwei
    Zhang, Xiaopeng
    E-LEARNING AND GAMES, 2016, 9654 : 267 - 279
  • [46] Cascaded Contextual Reasoning for Large-Scale Point Cloud Semantic Segmentation
    Zhang, Fengyi
    Xia, Xiuyu
    IEEE ACCESS, 2023, 11 : 20755 - 20768
  • [47] Efficient Large-Scale Point Cloud Registration Using Loop Closures
    Shiratori, Takaaki
    Berclaz, Jerome
    Harville, Michael
    Shah, Chintan
    Li, Taoyu
    Matsushita, Yasuyuki
    Shiller, Stephen
    2015 INTERNATIONAL CONFERENCE ON 3D VISION, 2015, : 232 - 240
  • [48] AN ADAPTIVE FILTER FOR DEEP LEARNING NETWORKS ON LARGE-SCALE POINT CLOUD
    Zhao, Wang
    Yi, Ran
    Liu, Yong-Jin
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1620 - 1624
  • [49] Point cloud transformers for parameter inference of large-scale tissue simulations
    Herold, Julian M.
    Schug, Alexander H.
    Behle, Eric
    BIOPHYSICAL JOURNAL, 2024, 123 (03) : 325A - 325A
  • [50] Robust Point Cloud Based Reconstruction of Large-Scale Outdoor Scenes
    Lan, Ziquan
    Yew, Zi Jian
    Lee, Gim Hee
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9682 - 9690