Visual Localization Through Virtual Views

被引：0

作者：

Song, Zhenbo ^{[1
,2
]}

Sun, Xi ^{[2
]}

Xue, Zhou ^{[2
]}

Xie, Dong ^{[1
]}

We, Chao ^{[2
]}

机构：

[1] Nanjing Univ Sci & Technol, Nanjing, Peoples R China

[2] ByteDance, Beijing, Peoples R China

来源：

ARTIFICIAL INTELLIGENCE, CICAI 2022, PT III | 2022年 / 13606卷

关键词：

Visual localization; Pose estimation; View synthesis; Structure from motion;

D O I：

10.1007/978-3-031-20503-3_52

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the problem of camera localization, i.e. 6 DoF pose estimation, with respect to a given 3D reconstruction. Current methods often use a coarse-to-fine image registration framework, which integrates image retrieval and visual keypoint matching. However, the localization accuracy is restricted by the limited invariance of feature descriptors. For example, when the query image has been acquired at the illumination (day/night) not consistent with the model image time, or from a position not covered by the model images, retrieval and feature matching may fail, leading to false pose estimation. In this paper, we propose to increase the diversity of model images, namely new viewpoints and new visual appearances, by synthesizing novel images with neural rendering methods. Specifically, we build the 3D model on Neural Radiance Fields (NeRF), and use appearance embeddings to encode variation of illuminations. Then we propose an efficient strategy to interpolate appearance embeddings and place virtual cameras in the scene to generate virtual model images. In order to facilitate the model image management, the appearance embeddings are associated with image acquisition conditions, such as daytime, season, and weather. Query image pose is estimated through similar conditional virtual views using the conventional hierarchical localization framework. We demonstrate the approach by conducting single smartphone image localization in a large-scale 3D urban model, showing the improvement in the accuracy of pose estimation.

引用

页码：582 / 587

页数：6

共 17 条

[1] NetVLAD: CNN architecture for weakly supervised place recognition [J].

Arandjelovic, Relja ;

Gronat, Petr ;

Torii, Akihiko ;

Pajdla, Tomas ;

Sivic, Josef .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5297-5307

[2]

Deng KL, 2024, Arxiv, DOI arXiv:2107.02791

[3] SuperPoint: Self-Supervised Interest Point Detection and Description [J].

DeTone, Daniel ;

Malisiewicz, Tomasz ;

Rabinovich, Andrew .

PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :337-349

[4] RANDOM SAMPLE CONSENSUS - A PARADIGM FOR MODEL-FITTING WITH APPLICATIONS TO IMAGE-ANALYSIS AND AUTOMATED CARTOGRAPHY [J].

FISCHLER, MA ;

BOLLES, RC .

COMMUNICATIONS OF THE ACM, 1981, 24 (06) :381-395

[5] Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition [J].

Hausler, Stephen ;

Garg, Sourav ;

Xu, Ming ;

Milford, Michael ;

Fischer, Tobias .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14136-14147

[6] AN ANALYTIC SOLUTION FOR THE PERSPECTIVE 4-POINT PROBLEM [J].

HORAUD, R ;

CONIO, B ;

LEBOULLEUX, O ;

LACOLLE, B .

COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1989, 47 (01) :33-44

[7]

Irschara A, 2009, PROC CVPR IEEE, P2591, DOI 10.1109/CVPRW.2009.5206587

[8] Efficient Global 2D-3D Matching for Camera Localization in a Large-Scale 3D Map [J].

Liu, Liu ;

Li, Hongdong ;

Dai, Yuchao .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2391-2400

[9] ASLFeat: Learning Local Features of Accurate Shape and Localization [J].

Luo, Zixin ;

Zhou, Lei ;

Bai, Xuyang ;

Chen, Hongkai ;

Zhang, Jiahui ;

Yao, Yao ;

Li, Shiwei ;

Fang, Tian ;

Quan, Long .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6588-6597

[10] NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections [J].

Martin-Brualla, Ricardo ;

Radwan, Noha ;

Sajjadi, Mehdi S. M. ;

Barron, Jonathan T. ;

Dosovitskiy, Alexey ;

Duckworth, Daniel .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :7206-7215

← 1 2 →