Direction-induced convolution for point cloud analysis

被引:3
作者
Fang, Yuan [1 ]
Xu, Chunyan [1 ]
Zhou, Chuanwei [1 ]
Cui, Zhen [1 ]
Hu, Chunlong [2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China
[2] Jiangsu Univ Sci & Technol, Sch Comp Sci & Engn, Zhenjiang 212003, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Point cloud; Convolution; Semantic segmentation; Classification; SEGMENTATION; NETWORKS;
D O I
10.1007/s00530-021-00770-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Point cloud analysis becomes a fundamental but challenging problem in the field of 3D scene understanding. To deal with unstructured and unordered point clouds in the embedded 3D space, we propose a novel direction-induced convolution (DIConv) to obtain the hierarchical representations of point clouds and then boost the performance of point cloud analysis. Specifically, we first construct a direction set as the basis of spatial direction information, where its entries can denote these latent direction components of 3D points. For each neighbor point, we can project its direction information into the constructed direction set for achieving an array of direction-dependent weights, then transform its features into the canonical ordered direction set space. After that, the standard image-like convolution can be leveraged to encode the unordered neighborhood regions of point cloud data. We further develop a residual DIConv (Res_DIConv) module and a farthest point sampling residual DIConv (FPS_Res_DIConv) module for jointly capturing the hierarchical features of input point clouds. By alternately stacking Res_DIConv modules and FPS_Res_DIConv modules, a direction-induced convolution network (DICNet) can be built to perform point cloud analysis in an end-to-end fashion. Comprehensive experiments on three benchmark datasets (including ModelNet40, ShapeNet Part, and S3DIS) demonstrate that the proposed DIConv method achieves encouraging performance on both point cloud classification and semantic segmentation tasks.
引用
收藏
页码:457 / 468
页数:12
相关论文
共 47 条
[1]  
[Anonymous], 2015, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2015.7298801
[2]  
[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.170
[3]   Point Convolutional Neural Networks by Extension Operators [J].
Atzmon, Matan ;
Maron, Haggai ;
Lipman, Yaron .
ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04)
[4]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[5]   Multiresolution Tree Networks for 3D Point Cloud Processing [J].
Gadelha, Matheus ;
Wang, Rui ;
Maji, Subhransu .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :105-122
[6]   3D Semantic Segmentation with Submanifold Sparse Convolutional Networks [J].
Graham, Benjamin ;
Engelcke, Martin ;
van der Maaten, Laurens .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9224-9232
[7]  
Han WK, 2020, AAAI CONF ARTIF INTE, V34, P10925
[8]  
Hermosilla P, 2018, SIGGRAPH ASIA'18: SIGGRAPH ASIA 2018 TECHNICAL PAPERS, DOI 10.1145/3272127.3275110
[9]   RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds [J].
Hu, Qingyong ;
Yang, Bo ;
Xie, Linhai ;
Rosa, Stefano ;
Guo, Yulan ;
Wang, Zhihua ;
Trigoni, Niki ;
Markham, Andrew .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11105-11114
[10]   Recurrent Slice Networks for 3D Segmentation of Point Clouds [J].
Huang, Qiangui ;
Wang, Weiyue ;
Neumann, Ulrich .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2626-2635