Indoor Scene Understanding with Geometric and Semantic Contexts

被引:31
|
作者
Choi, Wongun [1 ]
Chao, Yu-Wei [2 ]
Pantofaru, Caroline [3 ]
Savarese, Silvio [4 ]
机构
[1] NEC Labs Amer, Cupertino, CA 95014 USA
[2] Univ Michigan, Ann Arbor, MI 48109 USA
[3] Google Inc, Mountain View, CA USA
[4] Stanford Univ, Stanford, CA 94305 USA
关键词
Scene understanding; Scene parsing; Object recognition; 3D layout;
D O I
10.1007/s11263-014-0779-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Truly understanding a scene involves integrating information at multiple levels as well as studying the interactions between scene elements. Individual object detectors, layout estimators and scene classifiers are powerful but ultimately confounded by complicated real-world scenes with high variability, different viewpoints and occlusions. We propose a method that can automatically learn the interactions among scene elements and apply them to the holistic understanding of indoor scenes from a single image. This interpretation is performed within a hierarchical interaction model which describes an image by a parse graph, thereby fusing together object detection, layout estimation and scene classification. At the root of the parse graph is the scene type and layout while the leaves are the individual detections of objects. In between is the core of the system, our 3D Geometric Phrases (3DGP). We conduct extensive experimental evaluations on single image 3D scene understanding using both 2D and 3D metrics. The results demonstrate that our model with 3DGPs can provide robust estimation of scene type, 3D space, and 3D objects by leveraging the contextual relationships among the visual elements.
引用
收藏
页码:204 / 220
页数:17
相关论文
共 50 条
  • [21] Learning Direct Optimization for scene understanding
    Romaszko, Lukasz
    Williams, Christopher K., I
    Winn, John
    PATTERN RECOGNITION, 2020, 105
  • [22] Indoor Scene Understanding by Fusing Multi-View RGB-D Image Frames
    Li X.
    Zhang B.
    Sun F.
    Liu J.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (06): : 1218 - 1226
  • [23] Scene Understanding - A Survey
    Aarthi, S.
    Chitrakala, S.
    2017 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND SIGNAL PROCESSING (ICCCSP), 2017, : 191 - 194
  • [24] Analysis and design framework for the development of indoor scene understanding assistive solutions for the person with visual impairment/blindness
    Valipoor, Moeen
    de Antonio, Angelica
    Cabrera, Julian
    MULTIMEDIA SYSTEMS, 2024, 30 (03)
  • [25] Scene Understanding and Semantic Mapping for Unmanned Ground Vehicles Using 3D Point Clouds
    Yan, Fei
    He, Guojian
    Zhuang, Yan
    Chang, Huan
    2018 8TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST 2018), 2018, : 341 - 347
  • [26] Scene understanding using natural language description based on 3D semantic graph map
    Jiyoun Moon
    Beomhee Lee
    Intelligent Service Robotics, 2018, 11 : 347 - 354
  • [27] Enhancing semantic segmentation for autonomous vehicle scene understanding in indian context using modified CANet model
    Khairnar, Smita
    Thepade, Sudeep D.
    Kolekar, Suresh
    Gite, Shilpa
    Pradhan, Biswajeet
    Alamri, Abdullah
    Patil, Bhagyesha
    Dahake, Shrutee
    Gaikwad, Radhika
    Chaudhari, Atharva
    METHODSX, 2025, 14
  • [28] Vision-Based Semantic Segmentation in Scene Understanding for Autonomous Driving: Recent Achievements, Challenges, and Outlooks
    Muhammad, Khan
    Hussain, Tanveer
    Ullah, Hayat
    Del Ser, Javier
    Rezaei, Mahdi
    Kumar, Neeraj
    Hijji, Mohammad
    Bellavista, Paolo
    de Albuquerque, Victor Hugo C.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 22694 - 22715
  • [29] Scene understanding using natural language description based on 3D semantic graph map
    Moon, Jiyoun
    Lee, Beomhee
    INTELLIGENT SERVICE ROBOTICS, 2018, 11 (04) : 347 - 354
  • [30] 3D Scene Reconstruction and Object Recognition for Indoor Scene
    Shen, Yangping
    Manabe, Yoshitsugu
    Yata, Noriko
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGE TECHNOLOGY (IWAIT) 2019, 2019, 11049