Cross self-attention network for 3D point cloud

被引:59
作者
Wang, Gaihua [1 ,2 ]
Zhai, Qianyu [1 ]
Liu, Hong [1 ]
机构
[1] Hubei Univ Technol, Sch Elect & Elect Engn, Wuhan 430068, Peoples R China
[2] Hubei Univ Technol, Hubei Key Lab High efficiency Utilizat Solar Energ, Wuhan 430068, Peoples R China
关键词
Deep learning; Point cloud; Self-attention; Semantic segmentation; Shape classification; Multi-scale fusion;
D O I
10.1016/j.knosys.2022.108769
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is a challenge to design a deep neural network for raw point cloud, which is disordered and unstructured data. In this paper, we introduce a cross self-attention network (CSANet) to solve raw point cloud classification and segmentation tasks. It has permutation invariance and can learn the coordinates and features of point cloud at the same time. To better capture features of different scales, a multi-scale fusion (MF) module is proposed, which can adaptively consider the information of different scales and establish a fast descent branch to bring richer gradient information. Extensive experiments on ModelNet40, ShapeNetPart, and S3DIS demonstrate that the proposed method can achieve competitive results. (C)& nbsp;2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:8
相关论文
共 42 条
[1]  
[Anonymous], 2018, arXiv
[2]  
[Anonymous], 2015, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2015.7298801
[3]  
[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.170
[4]  
Ashraf K., 2016, SQUEEZENET ALEXNET L
[5]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[6]   Multi-View 3D Object Detection Network for Autonomous Driving [J].
Chen, Xiaozhi ;
Ma, Huimin ;
Wan, Ji ;
Li, Bo ;
Xia, Tian .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534
[7]  
Cicek Ozgun, 2016, Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016. 19th International Conference. Proceedings: LNCS 9901, P424, DOI 10.1007/978-3-319-46723-8_49
[8]  
Dosovitskiy A, 2020, ARXIV
[9]   3D mixed CNNs with edge-point feature learning [J].
Du, Zijin ;
Ye, Hailiang ;
Cao, Feilong .
KNOWLEDGE-BASED SYSTEMS, 2021, 221
[10]   Point Transformer [J].
Engel, Nico ;
Belagiannis, Vasileios ;
Dietmayer, Klaus .
IEEE ACCESS, 2021, 9 :134826-134840