Pyramid Point Cloud Transformer for Large-Scale Place Recognition

被引：77

作者：

Hui, Le ^{[1
]}

Yang, Hang ^{[1
]}

Cheng, Mingmei ^{[1
]}

Xie, Jin ^{[1
]}

Yang, Jian ^{[1
]}

机构：

[1] Nanjing Univ Sci & Technol, PCA Lab, Key Lab Intelligent Percept & Syst High Dimens In, Minist Educ, Nanjing, Jiangsu, Peoples R China

来源：

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年

关键词：

SIMULTANEOUS LOCALIZATION; SLAM; HISTOGRAMS; ROBUST;

D O I：

10.1109/ICCV48922.2021.00604

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, deep learning based point cloud descriptors have achieved impressive results in the place recognition task. Nonetheless, due to the sparsity of point clouds, how to extract discriminative local features of point clouds to efficiently form a global descriptor is still a challenging problem. In this paper, we propose a pyramid point cloud transformer network (PPT-Net) to learn the discriminative global descriptors from point clouds for efficient retrieval. Specifically, we first develop a pyramid point transformer module that adaptively learns the spatial relationship of the different k-NN neighboring points of point clouds, where the grouped self-attention is proposed to extract discriminative local features of the point clouds. The grouped self-attention not only enhances long-term dependencies of the point clouds, but also reduces the computational cost. In order to obtain discriminative global descriptors, we construct a pyramid VLAD module to aggregate the multi-scale feature maps of point clouds into the global descriptors. By applying VLAD pooling on multi-scale feature maps, we utilize the context gating mechanism on the multiple global descriptors to adaptively weight the multi-scale global context information into the final global descriptor. Experimental results on the Oxford dataset and three in-house datasets show that our method achieves the state-of-the-art on the point cloud based place recognition task.

引用

页码：6078 / 6087

页数：10

共 50 条

[41] DSC: Deep Scan Context Descriptor for Large-Scale Place Recognition
Cui, Jiafeng
Cai, Yingfeng
Huang, Tengfei
Zhao, Junqiao
Xiong, Lu
Yu, Zhuoping
IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, 2022, 2022-September
[42] Place Recognition of Large-Scale Unstructured Orchards With Attention Score Maps
Ou, Fang
Li, Yunhui
Miao, Zhonghua
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (02) : 958 - 965
[43] DSC: Deep Scan Context Descriptor for Large-Scale Place Recognition
Cui, Jiafeng
Cai, Yingfeng
Huang, Tengfei
Zhao, Junqiao
Xiong, Lu
Yu, Zhuoping
2022 IEEE INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS (MFI), 2022,
[44] TopSPR-Net: Topology Aware Segment-Level Point Cloud Learning Descriptors for Three-Dimensional Place Recognition in Large-Scale Environments
Kong, Dong
Li, Xu
Ni, Peizhou
Hu, Yue
Hu, Jinchao
Hu, Weiming
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (10) : 13406 - 13416
[45] A Survey on Processing of Large-Scale 3D Point Cloud
Liu, Xinying
Meng, Weiliang
Guo, Jianwei
Zhang, Xiaopeng
E-LEARNING AND GAMES, 2016, 9654 : 267 - 279
[46] Cascaded Contextual Reasoning for Large-Scale Point Cloud Semantic Segmentation
Zhang, Fengyi
Xia, Xiuyu
IEEE ACCESS, 2023, 11 : 20755 - 20768
[47] Efficient Large-Scale Point Cloud Registration Using Loop Closures
Shiratori, Takaaki
Berclaz, Jerome
Harville, Michael
Shah, Chintan
Li, Taoyu
Matsushita, Yasuyuki
Shiller, Stephen
2015 INTERNATIONAL CONFERENCE ON 3D VISION, 2015, : 232 - 240
[48] AN ADAPTIVE FILTER FOR DEEP LEARNING NETWORKS ON LARGE-SCALE POINT CLOUD
Zhao, Wang
Yi, Ran
Liu, Yong-Jin
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1620 - 1624
[49] Point cloud transformers for parameter inference of large-scale tissue simulations
Herold, Julian M.
Schug, Alexander H.
Behle, Eric
BIOPHYSICAL JOURNAL, 2024, 123 (03) : 325A - 325A
[50] Robust Point Cloud Based Reconstruction of Large-Scale Outdoor Scenes
Lan, Ziquan
Yew, Zi Jian
Lee, Gim Hee
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9682 - 9690

← 1 2 3 4 5 →