Spherical Kernel for Efficient Graph Convolution on 3D Point Clouds

被引:139
作者
Lei, Huan [1 ]
Akhtar, Naveed [1 ]
Mian, Ajmal [1 ]
机构
[1] Univ Western Australia, Dept Comp Sci & Software Engn, 35 Stirling Highway, Crawley, WA 6009, Australia
基金
澳大利亚研究理事会;
关键词
Three-dimensional displays; Kernel; Convolution; Neural networks; Feature extraction; Semantics; Computer architecture; 3D point cloud; spherical kernel; graph neural network; semantic segmentation; HISTOGRAMS; NETWORKS;
D O I
10.1109/TPAMI.2020.2983410
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a spherical kernel for efficient graph convolution of 3D point clouds. Our metric-based kernels systematically quantize the local 3D space to identify distinctive geometric relationships in the data. Similar to the regular grid CNN kernels, the spherical kernel maintains translation-invariance and asymmetry properties, where the former guarantees weight sharing among similar local structures in the data and the latter facilitates fine geometric learning. The proposed kernel is applied to graph neural networks without edge-dependent filter generation, making it computationally attractive for large point clouds. In our graph networks, each vertex is associated with a single point location and edges connect the neighborhood points within a defined range. The graph gets coarsened in the network with farthest point sampling. Analogous to the standard CNNs, we define pooling and unpooling operations for our network. We demonstrate the effectiveness of the proposed spherical kernel with graph neural networks for point cloud classification and semantic segmentation using ModelNet, ShapeNet, RueMonge2014, ScanNet and S3DIS datasets. The source code and the trained models can be downloaded from https://github.com/hlei-ziyan/SPH3D-GCN.
引用
收藏
页码:3664 / 3680
页数:17
相关论文
共 93 条
[91]   A Scalable Active Framework for Region Annotation in 3D Shape Collections [J].
Yi, Li ;
Kim, Vladimir G. ;
Ceylan, Duygu ;
Shen, I-Chao ;
Yan, Mengyan ;
Su, Hao ;
Lu, Cewu ;
Huang, Qixing ;
Sheffer, Alla ;
Guibas, Leonidas .
ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (06)
[92]   3DMatch: Learning Local Geometric Descriptors from RGB-D Reconstructions [J].
Zeng, Andy ;
Song, Shuran ;
Niessner, Matthias ;
Fisher, Matthew ;
Xiao, Jianxiong ;
Funkhouser, Thomas .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :199-208
[93]   DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding [J].
Zhang, Yinda ;
Bai, Mingru ;
Kohli, Pushmeet ;
Izadi, Shahram ;
Xiao, Jianxiong .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1201-1210