Recovering 3D Planes from a Single Image via Convolutional Neural Networks

被引:70
作者
Yang, Fengting [1 ]
Zhou, Zihan [1 ]
机构
[1] Penn State Univ, University Pk, PA 16802 USA
来源
COMPUTER VISION - ECCV 2018, PT X | 2018年 / 11214卷
关键词
3D reconstruction; Plane segmentation; Deep learning;
D O I
10.1007/978-3-030-01249-6_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study the problem of recovering 3D planar surfaces from a single image of man-made environment. We show that it is possible to directly train a deep neural network to achieve this goal. A novel plane structure-induced loss is proposed to train the network to simultaneously predict a plane segmentation map and the parameters of the 3D planes. Further, to avoid the tedious manual labeling process, we show how to leverage existing large-scale RGB-D dataset to train our network without explicit 3D plane annotations, and how to take advantage of the semantic labels come with the dataset for accurate planar and non-planar classification. Experiment results demonstrate that our method significantly outperforms existing methods, both qualitatively and quantitatively. The recovered planes could potentially benefit many important visual tasks such as vision-based navigation and human-robot interaction.
引用
收藏
页码:87 / 103
页数:17
相关论文
共 34 条
[1]  
[Anonymous], 2000, Multiple View Geometry in Computer Vision
[2]   Contour Detection and Hierarchical Image Segmentation [J].
Arbelaez, Pablo ;
Maire, Michael ;
Fowlkes, Charless ;
Malik, Jitendra .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) :898-916
[3]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[4]  
Barinova O, 2008, LECT NOTES COMPUT SC, V5303, P100, DOI 10.1007/978-3-540-88688-4_8
[5]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[6]   DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes [J].
Dasgupta, Saumitro ;
Fang, Kuan ;
Chen, Kevin ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :616-624
[7]  
Delage E., 2005, ROBOTICS RES ISRR, V28, P305, DOI [10.1007/978-3-540-48113-3, DOI 10.1007/978-3-540-48113-328]
[8]  
Eigen D, 2014, ADV NEUR IN, V27
[9]   Data-Driven 3D Primitives for Single Image Understanding [J].
Fouhey, David F. ;
Gupta, Abhinav ;
Hebert, Martial .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :3392-3399
[10]  
Fouhey DF, 2014, LECT NOTES COMPUT SC, V8694, P687, DOI 10.1007/978-3-319-10599-4_44