Indoor Scene Understanding with Geometric and Semantic Contexts

被引:31
|
作者
Choi, Wongun [1 ]
Chao, Yu-Wei [2 ]
Pantofaru, Caroline [3 ]
Savarese, Silvio [4 ]
机构
[1] NEC Labs Amer, Cupertino, CA 95014 USA
[2] Univ Michigan, Ann Arbor, MI 48109 USA
[3] Google Inc, Mountain View, CA USA
[4] Stanford Univ, Stanford, CA 94305 USA
关键词
Scene understanding; Scene parsing; Object recognition; 3D layout;
D O I
10.1007/s11263-014-0779-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Truly understanding a scene involves integrating information at multiple levels as well as studying the interactions between scene elements. Individual object detectors, layout estimators and scene classifiers are powerful but ultimately confounded by complicated real-world scenes with high variability, different viewpoints and occlusions. We propose a method that can automatically learn the interactions among scene elements and apply them to the holistic understanding of indoor scenes from a single image. This interpretation is performed within a hierarchical interaction model which describes an image by a parse graph, thereby fusing together object detection, layout estimation and scene classification. At the root of the parse graph is the scene type and layout while the leaves are the individual detections of objects. In between is the core of the system, our 3D Geometric Phrases (3DGP). We conduct extensive experimental evaluations on single image 3D scene understanding using both 2D and 3D metrics. The results demonstrate that our model with 3DGPs can provide robust estimation of scene type, 3D space, and 3D objects by leveraging the contextual relationships among the visual elements.
引用
收藏
页码:204 / 220
页数:17
相关论文
共 50 条
  • [41] A Scene Image is Nonmutually Exclusive-A Fuzzy Qualitative Scene Understanding
    Lim, Chern Hong
    Risnumawan, Anhar
    Chan, Chee Seng
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2014, 22 (06) : 1541 - 1556
  • [42] Structured Generative Models for Scene Understanding
    Williams, Christopher K. I.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, : 2845 - 2867
  • [43] Indoor Scene Recognition based on Weighted Voting Schemes
    Hernandez, Alejandra C.
    Gomez, Clara
    Derner, Erik
    Barber, Ramon
    2019 EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR), 2019,
  • [44] SUM: Sequential Scene Understanding and Manipulation
    Sui, Zhiqiang
    Zhou, Zheming
    Zeng, Zhen
    Jenkins, Odest Chadwicke
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 3281 - 3288
  • [45] Integrating Geometrical Context for Semantic Labeling of Indoor Scenes using RGBD Images
    Khan, Salman H.
    Bennamoun, Mohammed
    Sohel, Ferdous
    Togneri, Roberto
    Naseem, Imran
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 117 (01) : 1 - 20
  • [46] NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding
    Zhai, Hongjia
    Huang, Gan
    Hu, Qirui
    Li, Guanglin
    Bao, Hujun
    Zhang, Guofeng
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (11) : 7129 - 7139
  • [47] Building semantic scene models from unconstrained video
    Dee, Hannah M.
    Cohn, Anthony G.
    Hogg, David C.
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2012, 116 (03) : 446 - 456
  • [48] Semantic object processing is modulated by prior scene context
    Krugliak, Alexandra
    Draschkow, Dejan
    Vo, Melissa L. -H.
    Clarke, Alex
    LANGUAGE COGNITION AND NEUROSCIENCE, 2024, 39 (08) : 962 - 971
  • [49] 3D Semantic Scene Completion: A Survey
    Luis Roldão
    Raoul de Charette
    Anne Verroust-Blondet
    International Journal of Computer Vision, 2022, 130 : 1978 - 2005
  • [50] Learnable scene prior for point cloud semantic segmentation
    Chai, Yuanhao
    Gong, Jingyu
    Tan, Xin
    Xu, Jiachen
    Xie, Yuan
    Ma, Lizhuang
    VISUAL COMPUTER, 2025, 41 (01): : 517 - 534