DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

被引：23

作者：

Zhang, Yinda ^{[1
]}

Bai, Mingru ^{[1
]}

Kohli, Pushmeet ^{[2
,5
]}

Izadi, Shahram ^{[3
,5
]}

Xiao, Jianxiong ^{[1
,4
]}

机构：

[1] Princeton Univ, Princeton, NJ 08544 USA

[2] DeepMind, London, England

[3] PerceptiveIO, San Francisco, CA USA

[4] AutoX, Saratoga, CA USA

[5] Microsoft Res, Bengaluru, India

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2017年

关键词：

D O I：

10.1109/ICCV.2017.135

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D context has been shown to be extremely important for scene understanding, yet very little research has been done on integrating context information with deep neural network architectures. This paper presents an approach to embed 3D context into the topology of a neural network trained to perform holistic scene understanding. Given a depth image depicting a 3D scene, our network aligns the observed scene with a predefined 3D scene template, and then reasons about the existence and location of each object within the scene template. In doing so, our model recognizes multiple objects in a single forward pass of a 3D convolutional neural network, capturing both global scene and local object information simultaneously. To create training data for this 3D network, we generate partially synthetic depth images which are rendered by replacing real objects with a repository of CAD models of the same object category(1). Extensive experiments demonstrate the effectiveness of our algorithm compared to the state of the art.

引用

页码：1201 / 1210

页数：10

共 49 条

[1]

[Anonymous], 2007, ICCV

[2]

[Anonymous], 2015, SCENENET UNDERSTANDI

[3]

[Anonymous], IJCV

[4]

[Anonymous], 2010, CVPR

[5]

[Anonymous], 2015, P 28 INT C NEUR INF

[6]

[Anonymous], 2014, ECCV

[7]

[Anonymous], 2013, CVPR

[8]

[Anonymous], 2015, CVPR

[9]

[Anonymous], 2010, ECCV

[10]

[Anonymous], PATTERN RECOGNITION

← 1 2 3 4 5 →