Learning representative viewpoints in 3D shape recognition

被引：0

作者：

Huazhen Chu

Chao Le

Rongquan Wang

Xi Li

Huimin Ma

机构：

[1] University of Science and Technology Beijing,

[2] Tsinghua University,undefined

来源：

The Visual Computer | 2022年 / 38卷

关键词：

3D shape recognition; View structure; Representative viewpoints; Deep neural network;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Adopting many viewpoints and mining the relationship between them, 3D shape recognition inferring the object’s category from 2D rendered images has proven effective. However, using a limited number of general representative viewpoints to form a reasonable expression of the object is a task with both practical and theoretical significance. This paper proposes a multi-view CNN architecture with independent viewpoint feature extraction and the unity of importance weights, which can dramatically decrease the number of viewpoints by learning the representative ones. First, the view-based and independent view features are extracted by a deep neural network. Second, the network automatically learns relativity between these viewpoints and outputs the importance weights of views. Finally, view features are aggregated to predict the category of objects. Through iterative learning of these critical weights in instances, global representative viewpoints are selected. We assess our method on two challenging datasets, ModelNet and ShapeNet. Rigorous experiments show that our strategy is competitive with the latest method using only six viewpoints and RGB information as input. Meanwhile, our approach also achieves state-of-the-art performance by using 20 viewpoints as input. Specifically, the proposed approach achieves 99.34% and 97.49% accuracy on the ModelNet10 and ModelNet40, and 80.0% mAP on ShapeNet.

引用

页码：3703 / 3718

页数：15

共 37 条

[1]

Ren S(2017)Faster r-cnn: Towards real-time object detection with region proposal networks IEEE Trans. Pattern Anal. Mach. Intell. 39 1137-1149

[2]

He K(2014)Fully convolutional networks for semantic segmentation IEEE Trans. Pattern Anal. Mach. Intell. 39 640-651

[3]

Girshick R(2018)Point-wise saliency detection on 3d point clouds via covariance descriptors Visual Comput. 34 1325-1338

[4]

Sun J(2017)3d shape recognition and retrieval based on multi-modality deep learning Neurocomputing 259 183-193

[5]

Long J(2019)3d2seqviews: aggregating sequential views for 3d global feature learning by cnn with hierarchical attention aggregation IEEE Trans. Image Process. 28 3986-3999

[6]

Shelhamer E(2018)Seqviews2seqlabels: learning 3d global features via aggregating sequential views by rnn with attention IEEE Trans. Image Process. 28 658-672

[7]

Darrell T(2017)Gift: towards scalable 3d shape retrieval IEEE Trans. Multimedia 19 1257-1271

[8]

Guo Y(undefined)undefined undefined undefined undefined-undefined

[9]

Wang F(undefined)undefined undefined undefined undefined-undefined

[10]

Xin J(undefined)undefined undefined undefined undefined-undefined

← 1 2 3 4 →