A Coarse-to-Fine Indoor Layout Estimation (CFILE) Method

被引:23
作者
Ren, Yuzhuo [1 ]
Li, Shangwen [1 ]
Chen, Chen [1 ]
Kuo, C. -C. Jay [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90089 USA
来源
COMPUTER VISION - ACCV 2016, PT V | 2017年 / 10115卷
关键词
SCENES;
D O I
10.1007/978-3-319-54193-8_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of estimating the spatial layout of cluttered indoor scenes from a single RGB image is addressed in this work. Existing solutions to this problem largely rely on hand-crafted features and vanishing lines, and they often fail in highly cluttered indoor scenes. The proposed coarse-to-fine indoor layout estimation (CFILE) method consists of two stages: (1) coarse layout estimation; and (2) fine layout localization. In the first stage, we adopt a fully convolutional neural network (FCN) to obtain a coarse-scale room layout estimate that is close to the ground truth globally. The proposed FCN combines the layout contour property and the surface property so as to provide a robust estimation in the presence of cluttered objects. In the second stage, we formulate an optimization framework that enforces several constraints such as layout contour straightness, surface smoothness and geometric constraints for layout detail refinement. Our proposed system offers the state-of-the-art performance on two commonly used benchmark datasets.
引用
收藏
页码:36 / 51
页数:16
相关论文
共 34 条
  • [1] [Anonymous], 2015, ARXIV150402437
  • [2] [Anonymous], 2015, WORKSH CLOS LOOP VIS
  • [3] Contour Detection and Hierarchical Image Segmentation
    Arbelaez, Pablo
    Maire, Michael
    Fowlkes, Charless
    Malik, Jitendra
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) : 898 - 916
  • [4] Understanding Indoor Scenes using 3D Geometric Phrases
    Choi, Wongun
    Chao, Yu-Wei
    Pantofaru, Caroline
    Savarese, Silvio
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 33 - 40
  • [5] Dai J, 2015, IEEE COMPUT SOC CONF
  • [6] Dasgupta S., 2016, 2016 IEEE C COMP VIS
  • [7] Understanding Bayesian rooms using composite 3D object models
    Del Pero, Luca
    Bowdish, Joshua
    Kermgard, Bonnie
    Hartley, Emily
    Barnard, Kobus
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 153 - 160
  • [8] Del Pero L, 2012, PROC CVPR IEEE, P2719, DOI 10.1109/CVPR.2012.6247994
  • [9] Fidler Sanja, 2012, Neural Information Processing Systems, P611
  • [10] People Watching: Human Actions as a Cue for Single View Geometry
    Fouhey, David F.
    Delaitre, Vincent
    Gupta, Abhinav
    Efros, Alexei A.
    Laptev, Ivan
    Sivic, Josef
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 110 (03) : 259 - 274