A Coarse-to-Fine Indoor Layout Estimation (CFILE) Method

被引：23

作者：

Ren, Yuzhuo ^{[1
]}

Li, Shangwen ^{[1
]}

Chen, Chen ^{[1
]}

Kuo, C. -C. Jay ^{[1
]}

机构：

[1] Univ Southern Calif, Los Angeles, CA 90089 USA

来源：

COMPUTER VISION - ACCV 2016, PT V | 2017年 / 10115卷

关键词：

SCENES;

D O I：

10.1007/978-3-319-54193-8_3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The task of estimating the spatial layout of cluttered indoor scenes from a single RGB image is addressed in this work. Existing solutions to this problem largely rely on hand-crafted features and vanishing lines, and they often fail in highly cluttered indoor scenes. The proposed coarse-to-fine indoor layout estimation (CFILE) method consists of two stages: (1) coarse layout estimation; and (2) fine layout localization. In the first stage, we adopt a fully convolutional neural network (FCN) to obtain a coarse-scale room layout estimate that is close to the ground truth globally. The proposed FCN combines the layout contour property and the surface property so as to provide a robust estimation in the presence of cluttered objects. In the second stage, we formulate an optimization framework that enforces several constraints such as layout contour straightness, surface smoothness and geometric constraints for layout detail refinement. Our proposed system offers the state-of-the-art performance on two commonly used benchmark datasets.

引用

页码：36 / 51

页数：16

共 34 条

[1] [Anonymous], 2015, ARXIV150402437
[2] [Anonymous], 2015, WORKSH CLOS LOOP VIS
[3] Contour Detection and Hierarchical Image Segmentation
Arbelaez, Pablo
Maire, Michael
Fowlkes, Charless
Malik, Jitendra
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) : 898 - 916
[4] Understanding Indoor Scenes using 3D Geometric Phrases
Choi, Wongun
Chao, Yu-Wei
Pantofaru, Caroline
Savarese, Silvio
[J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 33 - 40
[5] Dai J, 2015, IEEE COMPUT SOC CONF
[6] Dasgupta S., 2016, 2016 IEEE C COMP VIS
[7] Understanding Bayesian rooms using composite 3D object models
Del Pero, Luca
Bowdish, Joshua
Kermgard, Bonnie
Hartley, Emily
Barnard, Kobus
[J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 153 - 160
[8] Del Pero L, 2012, PROC CVPR IEEE, P2719, DOI 10.1109/CVPR.2012.6247994
[9] Fidler Sanja, 2012, Neural Information Processing Systems, P611
[10] People Watching: Human Actions as a Cue for Single View Geometry
Fouhey, David F.
Delaitre, Vincent
Gupta, Abhinav
Efros, Alexei A.
Laptev, Ivan
Sivic, Josef
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 110 (03) : 259 - 274

← 1 2 3 4 →