Indoor Scene Understanding with Geometric and Semantic Contexts

被引：0

作者：

Wongun Choi

Yu-Wei Chao

Caroline Pantofaru

Silvio Savarese

机构：

[1] NEC Laboratories America,

[2] University of Michigan,undefined

[3] Google,undefined

[4] Inc,undefined

[5] Stanford University,undefined

来源：

International Journal of Computer Vision | 2015年 / 112卷

关键词：

Scene understanding; Scene parsing; Object recognition; 3D layout;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Truly understanding a scene involves integrating information at multiple levels as well as studying the interactions between scene elements. Individual object detectors, layout estimators and scene classifiers are powerful but ultimately confounded by complicated real-world scenes with high variability, different viewpoints and occlusions. We propose a method that can automatically learn the interactions among scene elements and apply them to the holistic understanding of indoor scenes from a single image. This interpretation is performed within a hierarchical interaction model which describes an image by a parse graph, thereby fusing together object detection, layout estimation and scene classification. At the root of the parse graph is the scene type and layout while the leaves are the individual detections of objects. In between is the core of the system, our 3D Geometric Phrases (3DGP). We conduct extensive experimental evaluations on single image 3D scene understanding using both 2D and 3D metrics. The results demonstrate that our model with 3DGPs can provide robust estimation of scene type, 3D space, and 3D objects by leveraging the contextual relationships among the visual elements.

引用

页码：204 / 220

页数：16

共 11 条

[1] Chang CC(2011)LIBSVM: A library for support vector machines ACM Trans. Intell. Syst. Technol. 2 27:1-27:27
[2] Lin CJ(2010)Object detection with discriminatively trained part based models PAMI 32 1627-1645
[3] Felzenszwalb P(1998)Convergence properties of the nelder-mead simplex method in low dimensions SIAM Journal on Optimization 9 148-158
[4] Girshick R(2004)Distinctive image features from scale-invariant keypoints IJCV 60 91-110
[5] McAllester D(undefined)undefined undefined undefined undefined-undefined
[6] Ramanan D(undefined)undefined undefined undefined undefined-undefined
[7] Lagarias JC(undefined)undefined undefined undefined undefined-undefined
[8] Reeds JA(undefined)undefined undefined undefined undefined-undefined
[9] Wright MH(undefined)undefined undefined undefined undefined-undefined
[10] Wright PE(undefined)undefined undefined undefined undefined-undefined

← 1 2 →