Recovering 3D Planes from a Single Image via Convolutional Neural Networks

被引:59
作者
Yang, Fengting [1 ]
Zhou, Zihan [1 ]
机构
[1] Penn State Univ, University Pk, PA 16802 USA
来源
COMPUTER VISION - ECCV 2018, PT X | 2018年 / 11214卷
关键词
3D reconstruction; Plane segmentation; Deep learning;
D O I
10.1007/978-3-030-01249-6_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study the problem of recovering 3D planar surfaces from a single image of man-made environment. We show that it is possible to directly train a deep neural network to achieve this goal. A novel plane structure-induced loss is proposed to train the network to simultaneously predict a plane segmentation map and the parameters of the 3D planes. Further, to avoid the tedious manual labeling process, we show how to leverage existing large-scale RGB-D dataset to train our network without explicit 3D plane annotations, and how to take advantage of the semantic labels come with the dataset for accurate planar and non-planar classification. Experiment results demonstrate that our method significantly outperforms existing methods, both qualitatively and quantitatively. The recovered planes could potentially benefit many important visual tasks such as vision-based navigation and human-robot interaction.
引用
收藏
页码:87 / 103
页数:17
相关论文
共 34 条
  • [1] [Anonymous], 2000, Multiple View Geometry in Computer Vision
  • [2] Contour Detection and Hierarchical Image Segmentation
    Arbelaez, Pablo
    Maire, Michael
    Fowlkes, Charless
    Malik, Jitendra
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) : 898 - 916
  • [3] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [4] Barinova O, 2008, LECT NOTES COMPUT SC, V5303, P100, DOI 10.1007/978-3-540-88688-4_8
  • [5] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [6] DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes
    Dasgupta, Saumitro
    Fang, Kuan
    Chen, Kevin
    Savarese, Silvio
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 616 - 624
  • [7] Delage E., 2005, ROBOTICS RES ISRR, V28, P305, DOI [10.1007/978-3-540-48113-3, DOI 10.1007/978-3-540-48113-328]
  • [8] Eigen D, 2014, ADV NEUR IN, V27
  • [9] Data-Driven 3D Primitives for Single Image Understanding
    Fouhey, David F.
    Gupta, Abhinav
    Hebert, Martial
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3392 - 3399
  • [10] Fouhey DF, 2014, LECT NOTES COMPUT SC, V8694, P687, DOI 10.1007/978-3-319-10599-4_44