Efficient Three-Dimensional Scene Modeling and Mosaicing

被引：51

作者：

Nicosevici, Tudor ^{[1
]}

Gracias, Nuno ^{[1
]}

Negahdaripour, Shahriar ^{[2
]}

Garcia, Rafael ^{[1
]}

机构：

[1] Univ Girona, Underwater Vis Lab, Girona 17071, Spain

[2] Univ Miami, Dept Elect & Comp Engn, Coral Gables, FL 33124 USA

来源：

JOURNAL OF FIELD ROBOTICS | 2009年 / 26卷 / 10期

关键词：

IMAGE; WATERSHEDS; ALGORITHM; VIDEO; VIEW;

D O I：

10.1002/rob.20305

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Scene modeling has a key role in applications ranging from visual mapping to augmented reality. This paper presents an end-to-end solution for creating accurate three-dimensional (3D) textured models using monocular video sequences. The methods are developed within the framework of sequential structure from motion, in which a 3D model of the environment is maintained and updated as new visual information becomes available. The proposed approach contains contributions at different levels. The camera pose is recovered by directly associating the 3D scene model with local image observations, using a dual-registration approach. Compared to the standard structure from motion techniques, this approach decreases the error accumulation while increasing the robustness to scene occlusions and feature association failures, while allowing 3D reconstructions for any type of scene. Motivated by the need to map large areas, a novel 3D vertex selection mechanism is proposed, which takes into account the geometry of the scene. Vertices are selected not only to have high reconstruction accuracy but also to be representative of the local shape of the scene. This results in a reduction in the complexity of the final 3D model, with minimal loss of precision. As a final step, a composite visual map of the scene (mosaic) is generated. We present a method for blending image textures using 3D geometric information and photometric differences between registered textures. The method allows high-quality mosaicing over 3D surfaces by reducing the effects of the distortions induced by camera viewpoint and illumination changes. The results are presented for four scene modeling scenarios, including a comparison with ground truth under a realistic scenario and a challenging underwater data set. Although developed primarily for underwater mapping applications, the methods are general and applicable to other domains, such as aerial and land-based mapping. (C) 2009 Wiley Periodicals, Inc.

引用

页码：759 / 788

页数：30

共 68 条

[1]

AGARWALA A, 2004, P SIGGRAPH04 LOS ANG

[2]

[Anonymous], EUR C COMP VIS

[3]

[Anonymous], P EUR C COMP VIS ECC

[4]

[Anonymous], 2001, Robotica, DOI DOI 10.1017/S0263574700223217

[5] Overall view regarding fundamental matrix estimation [J].

Armangué, X ;

Salvi, J .

IMAGE AND VISION COMPUTING, 2003, 21 (02) :205-220

[6] An optimal algorithm for approximate nearest neighbor searching in fixed dimensions [J].

Arya, S ;

Mount, DM ;

Netanyahu, NS ;

Silverman, R ;

Wu, AY .

JOURNAL OF THE ACM, 1998, 45 (06) :891-923

[7]

Baumberg A, 2000, PROC CVPR IEEE, P774, DOI 10.1109/CVPR.2000.855899

[8]

BAUMBERG A, 2002, P BRIT MACH VIS C CA

[9]

BEARDSLEY PA, 1994, LNCS SERIES, V801, P85

[10] Markov random fields with efficient approximations [J].

Boykov, Y ;

Veksler, O ;

Zabih, R .

1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, :648-655

← 1 2 3 4 5 6 7 →