DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes

被引:91
作者
Dasgupta, Saumitro [1 ]
Fang, Kuan [1 ]
Chen, Kevin [1 ]
Savarese, Silvio [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
来源
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年
关键词
D O I
10.1109/CVPR.2016.73
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of estimating the spatial layout of an indoor scene from a monocular RGB image, modeled as the projection of a 3D cuboid. Existing solutions to this problem often rely strongly on hand-engineered features and vanishing point detection, which are prone to failure in the presence of clutter. In this paper, we present a method that uses a fully convolutional neural network (FCNN) in conjunction with a novel optimization framework for generating layout estimates. We demonstrate that our method is robust in the presence of clutter and handles a wide range of highly challenging scenes. We evaluate our method on two standard benchmarks and show that it achieves state of the art results, outperforming previous methods by a wide margin.
引用
收藏
页码:616 / 624
页数:9
相关论文
共 23 条
  • [1] Chao YW, 2013, LECT NOTES COMPUT SC, V8157, P489, DOI 10.1007/978-3-642-41184-7_50
  • [2] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [3] COUGHLAN JM, 2000, NIPS, P845, DOI DOI 10.5555/3008751.3008869
  • [4] Understanding Bayesian rooms using composite 3D object models
    Del Pero, Luca
    Bowdish, Joshua
    Kermgard, Bonnie
    Hartley, Emily
    Barnard, Kobus
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 153 - 160
  • [5] Del Pero L, 2012, PROC CVPR IEEE, P2719, DOI 10.1109/CVPR.2012.6247994
  • [6] The Pascal Visual Object Classes (VOC) Challenge
    Everingham, Mark
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338
  • [7] Gupta Abhinav, 2010, NIPS, P3
  • [8] Recovering the Spatial Layout of Cluttered Rooms
    Hedau, Varsha
    Hoiem, Derek
    Forsyth, David
    [J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 1849 - 1856
  • [9] Hödlmoser M, 2013, LECT NOTES COMPUT SC, V7887, P41
  • [10] JIA Y, 2014, P 22 ACM INT C MULT, DOI [DOI 10.1145/2647868.2654889, 10.1145/2647868.2654889]