DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes

被引:99
作者
Dasgupta, Saumitro [1 ]
Fang, Kuan [1 ]
Chen, Kevin [1 ]
Savarese, Silvio [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
来源
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年
关键词
D O I
10.1109/CVPR.2016.73
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of estimating the spatial layout of an indoor scene from a monocular RGB image, modeled as the projection of a 3D cuboid. Existing solutions to this problem often rely strongly on hand-engineered features and vanishing point detection, which are prone to failure in the presence of clutter. In this paper, we present a method that uses a fully convolutional neural network (FCNN) in conjunction with a novel optimization framework for generating layout estimates. We demonstrate that our method is robust in the presence of clutter and handles a wide range of highly challenging scenes. We evaluate our method on two standard benchmarks and show that it achieves state of the art results, outperforming previous methods by a wide margin.
引用
收藏
页码:616 / 624
页数:9
相关论文
共 23 条
[1]  
Chao YW, 2013, LECT NOTES COMPUT SC, V8157, P489, DOI 10.1007/978-3-642-41184-7_50
[2]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[3]  
COUGHLAN JM, 2000, NIPS, P845, DOI DOI 10.5555/3008751.3008869
[4]   Understanding Bayesian rooms using composite 3D object models [J].
Del Pero, Luca ;
Bowdish, Joshua ;
Kermgard, Bonnie ;
Hartley, Emily ;
Barnard, Kobus .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :153-160
[5]  
Del Pero L, 2012, PROC CVPR IEEE, P2719, DOI 10.1109/CVPR.2012.6247994
[6]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[7]  
Gupta Abhinav, 2010, NIPS, P3
[8]   Recovering the Spatial Layout of Cluttered Rooms [J].
Hedau, Varsha ;
Hoiem, Derek ;
Forsyth, David .
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :1849-1856
[9]  
Hödlmoser M, 2013, LECT NOTES COMPUT SC, V7887, P41
[10]  
JIA Y, 2014, P 22 ACM INT C MULT, DOI [DOI 10.1145/2647868.2654889, 10.1145/2647868.2654889]