High Level 3D Structure Extraction from a Single Image Using a CNN-Based Approach

被引:7
作者
de Jesus Osuna-Coutino, J. A. [1 ]
Martinez-Carranza, Jose [1 ,2 ]
机构
[1] Inst Nacl Astrofis Opt & Electr, Dept Comp Sci, Puebla 72840, Mexico
[2] Univ Bristol, Dept Comp Sci, Bristol BS8 1TH, Avon, England
关键词
high level 3D structure extraction; depth data analysis; CNN; single image; 3D vision; MOTION;
D O I
10.3390/s19030563
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
High-Level Structure (HLS) extraction in a set of images consists of recognizing 3D elements with useful information to the user or application. There are several approaches to HLS extraction. However, most of these approaches are based on processing two or more images captured from different camera views or on processing 3D data in the form of point clouds extracted from the camera images. In contrast and motivated by the extensive work developed for the problem of depth estimation in a single image, where parallax constraints are not required, in this work, we propose a novel methodology towards HLS extraction from a single image with promising results. For that, our method has four steps. First, we use a CNN to predict the depth for a single image. Second, we propose a region-wise analysis to refine depth estimates. Third, we introduce a graph analysis to segment the depth in semantic orientations aiming at identifying potential HLS. Finally, the depth sections are provided to a new CNN architecture that predicts HLS in the shape of cubes and rectangular parallelepipeds.
引用
收藏
页数:18
相关论文
共 39 条
[1]   Depth from a Motion Algorithm and a Hardware Architecture for Smart Cameras [J].
Aguilar-Gonzalez, Abiel ;
Arias-Estrada, Miguel ;
Berry, Francois .
SENSORS, 2019, 19 (01)
[2]  
[Anonymous], 1998, MARKOV CHAINS
[3]  
[Anonymous], 2015, P IEEE C COMPUTER VI, DOI 10.1109/CVPR.2015.7298801
[4]  
[Anonymous], 2001, International archives of photogrammetry remote sensing and spatial information sciences, DOI DOI 10.5194/ISPRSARCHIVES-XL-5-W2-207-2013
[5]   Constrained structure and motion from multiple uncalibrated views of a piecewise planar scene [J].
Bartoli, A ;
Sturm, P .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2003, 52 (01) :45-64
[6]  
Bazarra M., 1990, LINEAR PROGRAMMING N
[7]  
Cherian A, 2009, IEEE INT CONF ROBOT, P519
[8]  
Dani A, 2013, IEEE INT C INT ROBOT, P602, DOI 10.1109/IROS.2013.6696413
[9]  
Osuna-Coutino JAD, 2016, ADJUNCT PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR-ADJUNCT), P135, DOI [10.1109/ISMAR-Adjunct.2016.53, 10.1109/ISMAR-Adjunct.2016.0060]
[10]   Structured Prediction of Unobserved Voxels From a Single Depth Image [J].
Firman, Michael ;
Mac Aodha, Oisin ;
Julier, Simon ;
Brostow, Gabriel J. .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5431-5440