Reconstructing Superquadrics from Intensity and Color Images

被引：1

作者：

Tomasevic, Darian ^{[1
]}

Peer, Peter ^{[1
]}

Solina, Franc ^{[1
]}

Jaklic, Ales ^{[1
]}

Struc, Vitomir ^{[2
]}

机构：

[1] Univ Ljubljana, Fac Comp & Informat Sci, Ljubljana 1000, Slovenia

[2] Univ Ljubljana, Fac Elect Engn, Ljubljana 1000, Slovenia

来源：

SENSORS | 2022年 / 22卷 / 14期

关键词：

superquadrics; reconstruction; color images; deep learning; convolutional neural networks; MODELS;

D O I：

10.3390/s22145332

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

The task of reconstructing 3D scenes based on visual data represents a longstanding problem in computer vision. Common reconstruction approaches rely on the use of multiple volumetric primitives to describe complex objects. Superquadrics (a class of volumetric primitives) have shown great promise due to their ability to describe various shapes with only a few parameters. Recent research has shown that deep learning methods can be used to accurately reconstruct random superquadrics from both 3D point cloud data and simple depth images. In this paper, we extended these reconstruction methods to intensity and color images. Specifically, we used a dedicated convolutional neural network (CNN) model to reconstruct a single superquadric from the given input image. We analyzed the results in a qualitative and quantitative manner, by visualizing reconstructed superquadrics as well as observing error and accuracy distributions of predictions. We showed that a CNN model designed around a simple ResNet backbone can be used to accurately reconstruct superquadrics from images containing one object, but only if one of the spatial parameters is fixed or if it can be determined from other image characteristics, e.g., shadows. Furthermore, we experimented with images of increasing complexity, for example, by adding textures, and observed that the results degraded only slightly. In addition, we show that our model outperforms the current state-of-the-art method on the studied task. Our final result is a highly accurate superquadric reconstruction model, which can also reconstruct superquadrics from real images of simple objects, without additional training.

引用

页数：26

共 50 条

[1] The Episolar Constraint: Monocular Shape from Shadow Correspondence
Abrams, Austin
Miskell, Kylia
Pless, Robert
[J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 1407 - 1414
[2] Barr A. H., 1981, IEEE Computer Graphics and Applications, V1, P11, DOI 10.1109/MCG.1981.1673799
[3] Bengio Yoshua, 2012, Neural Networks: Tricks of the Trade. Second Edition: LNCS 7700, P437, DOI 10.1007/978-3-642-35289-8_26
[4] Boult T. E., 1988, Proceedings of the SPIE - The International Society for Optical Engineering, V848, P358, DOI 10.1117/12.942759
[5] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[6] DARBOUX FRAMES, SNAKES, AND SUPER-QUADRICS - GEOMETRY FROM THE BOTTOM UP
FERRIE, FP
LAGARDE, J
WHAITE, P
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1993, 15 (08) : 771 - 784
[7] Gross A. D., 1988, Second International Conference on Computer Vision (IEEE Cat. No.88CH2664-1), P690, DOI 10.1109/CCV.1988.590052
[8] Volumetric Representation of Semantically Segmented Human Body Parts Using Superquadrics
Hachiuma, Ryo
Saito, Hideo
[J]. VIRTUAL REALITY AND AUGMENTED REALITY, EUROVR 2019, 2019, 11883 : 52 - 61
[9] HYPERQUADRICS - SMOOTHLY DEFORMABLE SHAPES WITH CONVEX POLYHEDRAL BOUNDS
HANSON, AJ
[J]. COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1988, 44 (02): : 191 - 210
[10] Geometry-Based Grasping Pipeline for Bi-Modal Pick and Place
Haschke, Robert
Walck, Guillaume
Ritter, Helge
[J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 4002 - 4008

← 1 2 3 4 5 →