Recovering 3D Planes from a Single Image via Convolutional Neural Networks

被引：70

作者：

Yang, Fengting ^{[1
]}

Zhou, Zihan ^{[1
]}

机构：

[1] Penn State Univ, University Pk, PA 16802 USA

来源：

COMPUTER VISION - ECCV 2018, PT X | 2018年 / 11214卷

关键词：

3D reconstruction; Plane segmentation; Deep learning;

D O I：

10.1007/978-3-030-01249-6_6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we study the problem of recovering 3D planar surfaces from a single image of man-made environment. We show that it is possible to directly train a deep neural network to achieve this goal. A novel plane structure-induced loss is proposed to train the network to simultaneously predict a plane segmentation map and the parameters of the 3D planes. Further, to avoid the tedious manual labeling process, we show how to leverage existing large-scale RGB-D dataset to train our network without explicit 3D plane annotations, and how to take advantage of the semantic labels come with the dataset for accurate planar and non-planar classification. Experiment results demonstrate that our method significantly outperforms existing methods, both qualitatively and quantitatively. The recovered planes could potentially benefit many important visual tasks such as vision-based navigation and human-robot interaction.

引用

页码：87 / 103

页数：17

共 34 条

[1]

[Anonymous], 2000, Multiple View Geometry in Computer Vision

[2] Contour Detection and Hierarchical Image Segmentation [J].

Arbelaez, Pablo ;

Maire, Michael ;

Fowlkes, Charless ;

Malik, Jitendra .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) :898-916

[3] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].

Badrinarayanan, Vijay ;

Kendall, Alex ;

Cipolla, Roberto .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495

[4]

Barinova O, 2008, LECT NOTES COMPUT SC, V5303, P100, DOI 10.1007/978-3-540-88688-4_8

[5] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[6] DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes [J].

Dasgupta, Saumitro ;

Fang, Kuan ;

Chen, Kevin ;

Savarese, Silvio .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :616-624

[7]

Delage E., 2005, ROBOTICS RES ISRR, V28, P305, DOI [10.1007/978-3-540-48113-3, DOI 10.1007/978-3-540-48113-328]

[8]

Eigen D, 2014, ADV NEUR IN, V27

[9] Data-Driven 3D Primitives for Single Image Understanding [J].

Fouhey, David F. ;

Gupta, Abhinav ;

Hebert, Martial .

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :3392-3399

[10]

Fouhey DF, 2014, LECT NOTES COMPUT SC, V8694, P687, DOI 10.1007/978-3-319-10599-4_44

← 1 2 3 4 →