Indoor Scene Understanding with Geometric and Semantic Contexts

被引：31

作者：

Choi, Wongun ^{[1
]}

Chao, Yu-Wei ^{[2
]}

Pantofaru, Caroline ^{[3
]}

Savarese, Silvio ^{[4
]}

机构：

[1] NEC Labs Amer, Cupertino, CA 95014 USA

[2] Univ Michigan, Ann Arbor, MI 48109 USA

[3] Google Inc, Mountain View, CA USA

[4] Stanford Univ, Stanford, CA 94305 USA

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2015年 / 112卷 / 02期

关键词：

Scene understanding; Scene parsing; Object recognition; 3D layout;

D O I：

10.1007/s11263-014-0779-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Truly understanding a scene involves integrating information at multiple levels as well as studying the interactions between scene elements. Individual object detectors, layout estimators and scene classifiers are powerful but ultimately confounded by complicated real-world scenes with high variability, different viewpoints and occlusions. We propose a method that can automatically learn the interactions among scene elements and apply them to the holistic understanding of indoor scenes from a single image. This interpretation is performed within a hierarchical interaction model which describes an image by a parse graph, thereby fusing together object detection, layout estimation and scene classification. At the root of the parse graph is the scene type and layout while the leaves are the individual detections of objects. In between is the core of the system, our 3D Geometric Phrases (3DGP). We conduct extensive experimental evaluations on single image 3D scene understanding using both 2D and 3D metrics. The results demonstrate that our model with 3DGPs can provide robust estimation of scene type, 3D space, and 3D objects by leveraging the contextual relationships among the visual elements.

引用

页码：204 / 220

页数：17

共 50 条

[41] A Scene Image is Nonmutually Exclusive-A Fuzzy Qualitative Scene Understanding
Lim, Chern Hong
Risnumawan, Anhar
Chan, Chee Seng
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2014, 22 (06) : 1541 - 1556
[42] Structured Generative Models for Scene Understanding
Williams, Christopher K. I.
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, : 2845 - 2867
[43] Indoor Scene Recognition based on Weighted Voting Schemes
Hernandez, Alejandra C.
Gomez, Clara
Derner, Erik
Barber, Ramon
2019 EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR), 2019,
[44] SUM: Sequential Scene Understanding and Manipulation
Sui, Zhiqiang
Zhou, Zheming
Zeng, Zhen
Jenkins, Odest Chadwicke
2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 3281 - 3288
[45] Integrating Geometrical Context for Semantic Labeling of Indoor Scenes using RGBD Images
Khan, Salman H.
Bennamoun, Mohammed
Sohel, Ferdous
Togneri, Roberto
Naseem, Imran
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 117 (01) : 1 - 20
[46] NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding
Zhai, Hongjia
Huang, Gan
Hu, Qirui
Li, Guanglin
Bao, Hujun
Zhang, Guofeng
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (11) : 7129 - 7139
[47] Building semantic scene models from unconstrained video
Dee, Hannah M.
Cohn, Anthony G.
Hogg, David C.
COMPUTER VISION AND IMAGE UNDERSTANDING, 2012, 116 (03) : 446 - 456
[48] Semantic object processing is modulated by prior scene context
Krugliak, Alexandra
Draschkow, Dejan
Vo, Melissa L. -H.
Clarke, Alex
LANGUAGE COGNITION AND NEUROSCIENCE, 2024, 39 (08) : 962 - 971
[49] 3D Semantic Scene Completion: A Survey
Luis Roldão
Raoul de Charette
Anne Verroust-Blondet
International Journal of Computer Vision, 2022, 130 : 1978 - 2005
[50] Learnable scene prior for point cloud semantic segmentation
Chai, Yuanhao
Gong, Jingyu
Tan, Xin
Xu, Jiachen
Xie, Yuan
Ma, Lizhuang
VISUAL COMPUTER, 2025, 41 (01): : 517 - 534

← 1 2 3 4 5 →