GeoTransformer: Fast and Robust Point Cloud Registration With Geometric Transformer

被引:86
作者
Qin, Zheng [1 ]
Yu, Hao [2 ]
Wang, Changjian [1 ]
Guo, Yulan [1 ,3 ]
Peng, Yuxing [1 ]
Ilic, Slobodan [2 ,4 ]
Hu, Dewen [1 ]
Xu, Kai [1 ]
机构
[1] Natl Univ Def Technol, Changsha 410073, Peoples R China
[2] Tech Univ Munich, D-80333 Munich, Germany
[3] SunYat Sen Univ, Guangzhou 510275, Peoples R China
[4] Siemens AG, D-80333 Munich, Germany
基金
中国国家自然科学基金;
关键词
Point cloud compression; Transformers; Feature extraction; Three-dimensional displays; Benchmark testing; Convergence; Task analysis; Coarse-to-fine correspondence; geometric consistency; point cloud matching; point cloud registration; transformer;
D O I
10.1109/TPAMI.2023.3259038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the problem of extracting accurate correspondences for point cloud registration. Recent keypoint-free methods have shown great potential through bypassing the detection of repeatable keypoints which is difficult to do especially in low-overlap scenarios. They seek correspondences over downsampled superpoints, which are then propagated to dense points. Superpoints are matched based on whether their neighboring patches overlap. Such sparse and loose matching requires contextual features capturing the geometric structure of the point clouds. We propose Geometric Transformer, or GeoTransformer for short, to learn geometric feature for robust superpoint matching. It encodes pair-wise distances and triplet-wise angles, making it invariant to rigid transformation and robust in low-overlap cases. The simplistic design attains surprisingly high matching accuracy such that no RANSAC is required in the estimation of alignment transformation, leading to 100 times acceleration. Extensive experiments on rich benchmarks encompassing indoor, outdoor, synthetic, multiway and non-rigid demonstrate the efficacy of GeoTransformer. Notably, our method improves the inlier ratio by 18 similar to 31 percentage points and the registration recall by over 7 points on the challenging 3DLoMatch benchmark.
引用
收藏
页码:9806 / 9821
页数:16
相关论文
共 63 条
[1]   SpinNet: Learning a General Surface Descriptor for 3D Point Cloud Registration [J].
Ao, Sheng ;
Hu, Qingyong ;
Yang, Bo ;
Markham, Andrew ;
Guo, Yulan .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :11748-11757
[2]   PointNetLK: Robust & Efficient Point Cloud Registration using PointNet [J].
Aoki, Yasuhiro ;
Goforth, Hunter ;
Srivatsan, Rangaprasad Arun ;
Lucey, Simon .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7156-7165
[3]   PointDSC: Robust Point Cloud Registration using Deep Spatial Consistency [J].
Bai, Xuyang ;
Luo, Zixin ;
Zhou, Lei ;
Chen, Hongkai ;
Li, Lei ;
Hu, Zeyu ;
Fu, Hongbo ;
Tai, Chiew-Lan .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :15854-15864
[4]   D3Feat: Joint Learning of Dense Detection and Description of 3D Local Features [J].
Bai, Xuyang ;
Luo, Zixin ;
Zhou, Lei ;
Fu, Hongbo ;
Quan, Long ;
Tai, Chiew-Lan .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6358-6366
[5]   A METHOD FOR REGISTRATION OF 3-D SHAPES [J].
BESL, PJ ;
MCKAY, ND .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1992, 14 (02) :239-256
[6]   PCAM: Product of Cross-Attention Matrices for Rigid Registration of Point Clouds [J].
Cao, Anh-Quan ;
Puy, Gilles ;
Boulch, Alexandre ;
Marlet, Renaud .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :13209-13218
[7]  
Choi S, 2015, PROC CVPR IEEE, P5556, DOI 10.1109/CVPR.2015.7299195
[8]   Deep Global Registration [J].
Choy, Christopher ;
Dong, Wei ;
Koltun, Vladlen .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2511-2520
[9]   Fully Convolutional Geometric Features [J].
Choy, Christopher ;
Park, Jaesik ;
Koltun, Vladlen .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8957-8965
[10]   PPF-FoldNet: Unsupervised Learning of Rotation Invariant 3D Local Descriptors [J].
Deng, Haowen ;
Birdal, Tolga ;
Ilic, Slobodan .
COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 :620-638