O-CNN: Octree-based Convolutional Neural Networks for 3D Shape Analysis

被引:739
作者
Wang, Peng-Shuai [1 ,2 ]
Liu, Yang [2 ]
Guo, Yu-Xiao [2 ,3 ]
Sun, Chun-Yu [1 ,2 ]
Tong, Xin [2 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
[3] Univ Elect Sci & Technol China, Chengdu, Sichuan, Peoples R China
来源
ACM TRANSACTIONS ON GRAPHICS | 2017年 / 36卷 / 04期
关键词
octree; convolutional neural network; object classification; shape retrieval; shape segmentation;
D O I
10.1145/3072959.3073608
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present O-CNN, an Octree-based Convolutional Neural Network (CNN) for 3D shape analysis. Built upon the octree representation of 3D shapes, our method takes the average normal vectors of a 3D model sampled in the finest leaf octants as input and performs 3D CNN operations on the octants occupied by the 3D shape surface. We design a novel octree data structure to efficiently store the octant information and CNN features into the graphics memory and execute the entire O-CNN training and evaluation on the GPU. O-CNN supports various CNN structures and works for 3D shapes in different representations. By restraining the computations on the octants occupied by 3D surfaces, the memory and computational costs of the O-CNN grow quadratically as the depth of the octree increases, which makes the 3D CNN feasible for high-resolution 3D models. We compare the performance of the O-CNN with other existing 3D CNN solutions and demonstrate the efficiency and efficacy of O-CNN in three shape analysis tasks, including object classification, shape retrieval, and shape segmentation.
引用
收藏
页数:11
相关论文
共 36 条
  • [1] [Anonymous], 2017, Computer Vision and Pattern Recognition CVPR
  • [2] [Anonymous], BRIT MACH VIS C BMVC
  • [3] [Anonymous], 2016, P EUR WORKSH 3D OBJ
  • [4] [Anonymous], COMPUTER VISION PATT
  • [5] [Anonymous], 2013, P INT C MACH LEARN
  • [6] Learning class-specific descriptors for deformable shapes using localized spectral convolutional networks
    Boscaini, D.
    Masci, J.
    Mezi, S.
    Bronstein, M. M.
    Castellani, U.
    Vandergheynst, P.
    [J]. COMPUTER GRAPHICS FORUM, 2015, 34 (05) : 13 - 23
  • [7] Boscaini Davide, 2016, Neural Information Processing Systems
  • [8] Brock Andrew, 2016, 3D DEEP NIPS LEARN W
  • [9] Bronstein M. M., 2017, IEEE SIG P MAG
  • [10] Chang Angel X., 2015, arXiv