Point Transformer

被引:186
作者
Engel, Nico [1 ]
Belagiannis, Vasileios [1 ]
Dietmayer, Klaus [1 ]
机构
[1] Ulm Univ, Inst Measurement Control & Microtechnol, D-89081 Ulm, Germany
关键词
Shape; Transformers; Three-dimensional displays; Standards; Task analysis; Feature extraction; Computer vision; 3D point processing; artificial neural networks; computer vision; feedforward neural networks; transformer;
D O I
10.1109/ACCESS.2021.3116304
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we present Point Transformer, a deep neural network that operates directly on unordered and unstructured point sets. We design Point Transformer to extract local and global features and relate both representations by introducing the local-global attention mechanism, which aims to capture spatial point relations and shape information. For that purpose, we propose SortNet, as part of the Point Transformer, which induces input permutation invariance by selecting points based on a learned score. The output of Point Transformer is a sorted and permutation invariant feature list that can directly be incorporated into common computer vision applications. We evaluate our approach on standard classification and part segmentation benchmarks to demonstrate competitive results compared to the prior work.
引用
收藏
页码:134826 / 134840
页数:15
相关论文
共 45 条
[1]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[2]  
Chang A.X., 2015, Technical Report
[3]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[4]  
Engel N, 2019, IEEE INT C INTELL TR, P926, DOI [10.1109/itsc.2019.8917005, 10.1109/ITSC.2019.8917005]
[5]  
Engel N, 2018, IEEE INT C INTELL TR, P3852, DOI 10.1109/ITSC.2018.8569234
[6]  
Fischer K., 2020, ARXIV200203983
[7]   Deep Learning for 3D Point Clouds: A Survey [J].
Guo, Yulan ;
Wang, Hanyun ;
Hu, Qingyong ;
Liu, Hao ;
Liu, Li ;
Bennamoun, Mohammed .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (12) :4338-4364
[8]   DeepCLR: Correspondence-Less Architecture for Deep End-to-End Point Cloud Registration [J].
Horn, Markus ;
Engel, Nico ;
Belagiannis, Vasileios ;
Buchholz, Michael ;
Dietmayer, Klaus .
2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
[9]   Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1026-1034
[10]   RotationNet: Joint Object Categorization and Pose Estimation Using Multiviews from Unsupervised Viewpoints [J].
Kanezaki, Asako ;
Matsushita, Yasuyuki ;
Nishida, Yoshifumi .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5010-5019