Pixels, voxels, and views: A study of shape representations for single view 3D object shape prediction

被引：57

作者：

Shin, Daeyun ^{[1
]}

Fowlkes, Charless C. ^{[1
]}

Hoiem, Derek ^{[2
]}

机构：

[1] Univ Calif Irvine, Irvine, CA 92697 USA

[2] Univ Illinois, Urbana, IL 61801 USA

来源：

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年

基金：

美国国家科学基金会;

关键词：

RECOGNITION;

D O I：

10.1109/CVPR.2018.00323

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The goal of this paper is to compare surface-based and volumetric 3D object shape representations, as well as viewer-centered and object-centered reference frames for single-view 3D shape prediction. We propose a new algorithm for predicting depth maps from multiple viewpoints, with a single depth or RGB image as input. By modifying the network and the way models are evaluated, we can directly compare the merits of voxels vs. surfaces and viewer centered vs. object-centered for familiar vs. unfamiliar objects, as predicted from RGB or depth images. Among our findings, we show that surface-based methods outperform voxel representations for objects from novel classes and produce higher resolution outputs. We also find that using viewer-centered coordinates is advantageous for novel objects, while object-centered representations are better for more familiar objects. Interestingly, the coordinate frame significantly affects the shape representation learned, with object-centered placing more importance on implicitly recognizing the object category and viewer-centered producing shape representations with less dependence on category recognition.

引用

页码：3061 / 3069

页数：9

共 25 条

[1]

[Anonymous], 2017, P IEEE INT C COMP VI

[2]

[Anonymous], 2014, P NEURIPS, DOI DOI 10.5555/2968826.2968851

[3]

[Anonymous], 2017, C COMP VIS PATT REC

[4]

[Anonymous], 2017, ICCV

[5]

[Anonymous], 2015, PROC CVPR IEEE, DOI 10.1109/CVPR.2015.7298801

[6]

[Anonymous], 2016, P EUR C COMP VIS ECC

[7]

[Anonymous], 2015, P IEEE INT C COMP VI

[8]

[Anonymous], ADV NEURAL INFORM PR

[9]

[Anonymous], 2015, TECHNICAL REPORT

[10]

[Anonymous], 2013, IEEE INT C COMP VIS

← 1 2 3 →