VirtualLoc: Large-scale Visual Localization Using Virtual Images

被引：1

作者：

Xiong, Yuan ^{[1
]}

Wang, Jingru ^{[1
]}

Zhou, Zhong ^{[1
,2
,3
]}

机构：

[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China

[2] State Key Lab Virtual Real Technol & Syst, 37 Xueyuan Rd, Beijing 100191, Peoples R China

[3] Zhongguancun Lab, 37 Xueyuan Rd, Beijing 100191, Peoples R China

来源：

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS | 2024年 / 20卷 / 03期

关键词：

Visual localization; virtual reality; image retrieval; rendering;

D O I：

10.1145/3622788

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Robust and accurate camera pose estimation is fundamental in computer vision. Learning-based regression approaches acquire six-degree-of-freedom camera parameters accurately from visual cues of an input image. However, most are trained on street-view and landmark datasets. These approaches can hardly be generalized to overlooking use cases, such as the calibration of the surveillance camera and unmanned aerial vehicle. Besides, reference images captured from the real world are rare and expensive, and their diversity is not guaranteed. In this article, we address the problem of using alternative virtual images for visual localization training. This work has the following principle contributions: First, we present a new challenging localization dataset containing six reconstructed large-scale three-dimensional scenes, 10,594 calibrated photographs with condition changes, and 300k virtual images with pixelwise labeled depth, relative surface normal, and semantic segmentation. Second, we present a flexible multi-feature fusion network trained on virtual image datasets for robust image retrieval. Third, we propose an end-to-end confidence map prediction network for feature filtering and pose estimation. We demonstrate that large-scale rendered virtual images are beneficial to visual localization. Using virtual images can solve the diversity problem of real images and leverage labeled multi-feature data for deep learning. Experimental results show that our method achieves remarkable performance surpassing state-of-the-art approaches. To foster research on improvement for visual localization using synthetic images, we release our benchmark at https://github.com/YuanXiong/contributions.

引用

页数：19

共 50 条

[41] The role of perception and action on the use of allocentric information in a large-scale virtual environment
Karimpur, Harun
Kurz, Johannes
Fiehler, Katja
EXPERIMENTAL BRAIN RESEARCH, 2020, 238 (09) : 1813 - 1826
[42] Deep image retrieval of large-scale vessels images based on BoW model
Chi Tian
Jinfeng Xia
Ji Tang
Hui Yin
Multimedia Tools and Applications, 2020, 79 : 9387 - 9401
[43] Deep image retrieval of large-scale vessels images based on BoW model
Tian, Chi
Xia, Jinfeng
Tang, Ji
Yin, Hui
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (13-14) : 9387 - 9401
[44] Visual landmark recognition from Internet photo collections: A large-scale evaluation
Weyand, Tobias
Leibe, Bastian
COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 135 : 1 - 15
[45] Temporal Aggregation of Visual Features for Large-Scale Image-to-Video Retrieval
Garcia, Noa
ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 489 - 492
[46] Development of MirrorShape: High Fidelity Large-Scale Shape Rendering Framework for Virtual Reality
Fedoseev, Aleksey
Chernyadev, Nikita
Tsetserukou, Dzmitry
25TH ACM SYMPOSIUM ON VIRTUAL REALITY SOFTWARE AND TECHNOLOGY (VRST 2019), 2019,
[47] A Novel Method of Multi-user Redirected Walking for Large-Scale Virtual Environments
Dong, Tianyang
Song, Yifan
Shen, Yuqi
Fan, Jing
ADVANCES IN COMPUTER GRAPHICS, CGI 2019, 2019, 11542 : 143 - 154
[48] OPENWEBGLOBE - AN OPEN SOURCE SDK FOR CREATING LARGE-SCALE VIRTUAL GLOBES ON A WEBGL BASIS
Loesch, B.
Christen, M.
Nebiker, S.
XXII ISPRS CONGRESS, TECHNICAL COMMISSION IV, 2012, 39-B4 : 195 - 200
[49] Fully Connected Hashing Neural Networks for Indexing Large-Scale Remote Sensing Images
Liu, Na
Mou, Haiming
Tang, Jun
Wan, Lihong
Li, Qingdu
Yuan, Ye
MATHEMATICS, 2022, 10 (24)
[50] Utilizing Software Architecture Recovery to Explore Large-Scale Software Systems in Virtual Reality
Hoff, Adrian
Gerling, Lea
Seidl, Christoph
2022 WORKING CONFERENCE ON SOFTWARE VISUALIZATION (IEEE VISSOFT), 2022, : 119 - 130

← 1 2 3 4 5 →