Spherical Kernel for Efficient Graph Convolution on 3D Point Clouds

被引：139

作者：

Lei, Huan ^{[1
]}

Akhtar, Naveed ^{[1
]}

Mian, Ajmal ^{[1
]}

机构：

[1] Univ Western Australia, Dept Comp Sci & Software Engn, 35 Stirling Highway, Crawley, WA 6009, Australia

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2021年 / 43卷 / 10期

基金：

澳大利亚研究理事会;

关键词：

Three-dimensional displays; Kernel; Convolution; Neural networks; Feature extraction; Semantics; Computer architecture; 3D point cloud; spherical kernel; graph neural network; semantic segmentation; HISTOGRAMS; NETWORKS;

D O I：

10.1109/TPAMI.2020.2983410

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a spherical kernel for efficient graph convolution of 3D point clouds. Our metric-based kernels systematically quantize the local 3D space to identify distinctive geometric relationships in the data. Similar to the regular grid CNN kernels, the spherical kernel maintains translation-invariance and asymmetry properties, where the former guarantees weight sharing among similar local structures in the data and the latter facilitates fine geometric learning. The proposed kernel is applied to graph neural networks without edge-dependent filter generation, making it computationally attractive for large point clouds. In our graph networks, each vertex is associated with a single point location and edges connect the neighborhood points within a defined range. The graph gets coarsened in the network with farthest point sampling. Analogous to the standard CNNs, we define pooling and unpooling operations for our network. We demonstrate the effectiveness of the proposed spherical kernel with graph neural networks for point cloud classification and semantic segmentation using ModelNet, ShapeNet, RueMonge2014, ScanNet and S3DIS datasets. The source code and the trained models can be downloaded from https://github.com/hlei-ziyan/SPH3D-GCN.

引用

页码：3664 / 3680

页数：17

共 93 条

[91] A Scalable Active Framework for Region Annotation in 3D Shape Collections [J].

Yi, Li ;

Kim, Vladimir G. ;

Ceylan, Duygu ;

Shen, I-Chao ;

Yan, Mengyan ;

Su, Hao ;

Lu, Cewu ;

Huang, Qixing ;

Sheffer, Alla ;

Guibas, Leonidas .

ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (06)

[92] 3DMatch: Learning Local Geometric Descriptors from RGB-D Reconstructions [J].

Zeng, Andy ;

Song, Shuran ;

Niessner, Matthias ;

Fisher, Matthew ;

Xiao, Jianxiong ;

Funkhouser, Thomas .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :199-208

[93] DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding [J].

Zhang, Yinda ;

Bai, Mingru ;

Kohli, Pushmeet ;

Izadi, Shahram ;

Xiao, Jianxiong .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1201-1210

← 1 2 3 4 5 6 7 8 9 10 →