VV-NET: Voxel VAE Net with Group Convolutions for Point Cloud Segmentation

被引：229

作者：

Meng, Hsien-Yu ^{[1
,4
]}

Gao, Lin ^{[2
]}

Lai, Yu-Kun ^{[3
]}

Manocha, Dinesh ^{[1
]}

机构：

[1] Univ Maryland, College Pk, MD 20742 USA

[2] Chinese Acad Sci, Inst Comp Technol, Beijing Key Lab Mobile Comp & Pervas Device, Beijing, Peoples R China

[3] Cardiff Univ, Sch Comp Sci & Informat, Cardiff, S Glam, Wales

[4] Tsinghua Univ, Beijing, Peoples R China

来源：

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年

基金：

中国国家自然科学基金; 北京市自然科学基金;

关键词：

NETWORKS;

D O I：

10.1109/ICCV.2019.00859

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a novel algorithm for point cloud segmentation. Our approach transforms unstructured point clouds into regular voxel grids, and further uses a kernel-based interpolated variational autoencoder (VAE) architecture to encode the local geometry within each voxel. Traditionally, the voxel representation only comprises Boolean occupancy information which fails to capture the sparsely distributed points within voxels in a compact manner. In order to handle sparse distributions of points, we further employ radial basis functions (RBF) to compute a local, continuous representation within each voxel. Our approach results in a good volumetric representation that effectively tackles noisy point cloud datasets and is more robust for learning. Moreover, we further introduce group equivariant CNN to 3D, by defining the convolution operator on a symmetry group acting on Z3 and its isomorphic sets. This improves the expressive capacity without increasing parameters, leading to more robust segmentation results. We highlight the performance on standard benchmarks and show that our approach outperforms state-of-the-art segmentation algorithms on the ShapeNet and S3DIS datasets.

引用

页码：8499 / 8507

页数：9

共 31 条

[1]

[Anonymous], 2016, CVPR, DOI DOI 10.1109/CVPR.2016.170

[2]

Bruna J., 2014, 2 INT C LEARN REPR, P1

[3]

Cohen T. S., 2016, ARXIV161208498

[4]

Cohen Taco S., 2016, GROUP EQUIVARIANT CO

[5]

Cohen Taco S., 2018, ABS180110130

[6] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].

Dai, Angela ;

Qi, Charles Ruizhongtai ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554

[7] Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds [J].

Engelmann, Francis ;

Kontogianni, Theodora ;

Hermans, Alexander ;

Leibe, Bastian .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :716-724

[8]

Gao Lin, 2019, ARXIV190804520

[9] Recurrent Slice Networks for 3D Segmentation of Point Clouds [J].

Huang, Qiangui ;

Wang, Weiyue ;

Neumann, Ulrich .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2626-2635

[10] An Empirical Comparison of Similarity Measures for Abstract Test Case Prioritization [J].

Huang, Rubing ;

Zhou, Yunan ;

Zong, Weiwen ;

Towey, Dave ;

Chen, Jinfu .

2017 IEEE 41ST ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1, 2017, :3-12

← 1 2 3 4 →