View-GCN: View-based Graph Convolutional Network for 3D Shape Analysis

被引:267
作者
Wei, Xin [1 ]
Yu, Ruixuan [1 ]
Sun, Jian [1 ]
机构
[1] Xi An Jiao Tong Univ, Xian 710049, Peoples R China
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2020年
关键词
NEURAL-NETWORK;
D O I
10.1109/CVPR42600.2020.00192
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
View-based approach that recognizes 3D shape through its projected 2D images has achieved state-of-the-art results for 3D shape recognition. The major challenge for view-based approach is how to aggregate multi-view features to be a global shape descriptor. In this work, we propose a novel view-based Graph Convolutional Neural Network, dubbed as view-GCN, to recognize 3D shape based on graph representation of multiple views in flexible view configurations. We first construct view-graph with multiple views as graph nodes, then design a graph convolutional neural network over view-graph to hierarchically learn discriminative shape descriptor considering relations of multiple views. The view-GCN is a hierarchical network based on local and non-local graph convolution for feature transform, and selective view-sampling for graph coarsening. Extensive experiments on benchmark datasets show that view-GCN achieves state-of-the-art results for 3D shape classification and retrieval.
引用
收藏
页码:1847 / 1856
页数:10
相关论文
共 55 条
[1]  
[Anonymous], 2016, NEURIPS 3D DEEP LEAR
[2]  
[Anonymous], 2016, NEURIPS
[3]  
[Anonymous], 2017, NEURIPS WORKSH
[4]   A Multi-Modal, Discriminative and Spatially Invariant CNN for RGB-D Object Labeling [J].
Asif, Umar ;
Bennamoun, Mohammed ;
Sohel, Ferdous A. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (09) :2051-2065
[5]   GIFT: Towards Scalable 3D Shape Retrieval [J].
Bai, Song ;
Bai, Xiang ;
Zhou, Zhichao ;
Zhang, Zhaoxiang ;
Tian, Qi ;
Latecki, Longin Jan .
IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (06) :1257-1271
[6]   GIFT: A Real-time and Scalable 3D Shape Search Engine [J].
Bai, Song ;
Bai, Xiang ;
Zhou, Zhichao ;
Zhang, Zhaoxiang ;
Latecki, Longin Jan .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5023-5032
[7]  
Bruna J, 2013, 2 INT C LEARN REPR I
[8]   VERAM: View-Enhanced Recurrent Attention Model for 3D Shape Classification [J].
Chen, Songle ;
Zheng, Lintao ;
Zhang, Yan ;
Sun, Zhixin ;
Xu, Kai .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (12) :3244-3257
[9]   Convolutional Fisher Kernels for RGB-D Object Recognition [J].
Cheng, Yanhua ;
Cai, Rui ;
Zhao, Xin ;
Huang, Kaiqi .
2015 INTERNATIONAL CONFERENCE ON 3D VISION, 2015, :135-143
[10]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554