Large-scale, real-time 3D scene reconstruction on a mobile device

被引：12

作者：

Dryanovski, Ivan ^{[1
]}

Klingensmith, Matthew ^{[2
]}

Srinivasa, Siddhartha S. ^{[2
]}

Xiao, Jizhong ^{[3
]}

机构：

[1] CUNY, Grad Ctr, Dept Comp Sci, 365 Fifth Ave, New York, NY 10016 USA

[2] Carnegie Mellon Robot Inst, 5000 Forbes Ave, Pittsburgh, PA 15213 USA

[3] CUNY City Coll, Dept Elect Engn, 160 Convent Ave, New York, NY 10031 USA

来源：

AUTONOMOUS ROBOTS | 2017年 / 41卷 / 06期

关键词：

3D reconstruction; Mobile technology; SLAM; Computer vision; Mapping; Pose estimation;

D O I：

10.1007/s10514-017-9624-2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Google's Project Tango has made integrated depth sensing and onboard visual-intertial odometry available to mobile devices such as phones and tablets. In this work, we explore the problem of large-scale, real-time 3D reconstruction on a mobile devices of this type. Solving this problem is a necessary prerequisite for many indoor applications, including navigation, augmented reality and building scanning. The main challenges include dealing with noisy and low-frequency depth data and managing limited computational and memory resources. State of the art approaches in large-scale dense reconstruction require large amounts of memory and high-performance GPU computing. Other existing 3D reconstruction approaches on mobile devices either only build a sparse reconstruction, offload their computation to other devices, or require long post-processing to extract the geometric mesh. In contrast, we can reconstruct and render a global mesh on the fly, using only the mobile device's CPU, in very large (300m(2)) scenes, at a resolutions of 2-3 cm. To achieve this, we divide the scene into spatial volumes indexed by a hash map. Each volume contains the truncated signed distance function for that area of space, as well as the mesh segment derived from the distance function. This approach allows us to focus computational and memory resources only in areas of the scene which are currently observed, as well as leverage parallelization techniques formulti-core processing. Furthermore, we describe an on-device post-processing method for fusing datasets from multiple, independent trials, in order to improve the quality and coverage of the reconstruction. We discuss how the particularities of the devices impact our algorithm and implementation decisions. Finally, we provide both qualitative and quantitative results on publicly available RGB-D datasets, and on datasets collected in real-time from two devices.

引用

页码：1423 / 1445

页数：23

共 38 条

[1]

Amanatides John, 1987, P EUROGRAPHICS, P3

[2]

[Anonymous], 2015, ROBOTICS SCI SYSTEMS

[3]

[Anonymous], 2010, PROC IEEE INT C ROBO

[4]

[Anonymous], KIN FOR WIND

[5]

[Anonymous], 2014, EUR C COMP VIS ECCV

[6]

Bylow E., 2013, ROB SCI SYST RSS C 2

[7] Scalable Real-time Volumetric Surface Reconstruction [J].

Chen, Jiawen ;

Bautembach, Dennis ;

Izadi, Shahram .

ACM TRANSACTIONS ON GRAPHICS, 2013, 32 (04)

[8]

CHEN Y, 1991, 1991 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-3, P2724, DOI 10.1109/ROBOT.1991.132043

[9] Making pointer-based data structures cache conscious [J].

Chilimbi, TM ;

Hill, MD ;

Larus, JR .

COMPUTER, 2000, 33 (12) :67-+

[10]

Curless B., 1996, Computer Graphics Proceedings. SIGGRAPH '96, P303, DOI 10.1145/237170.237269

← 1 2 3 4 →