VirtualLoc: Large-scale Visual Localization Using Virtual Images

被引：1

作者：

Xiong, Yuan ^{[1
]}

Wang, Jingru ^{[1
]}

Zhou, Zhong ^{[1
,2
,3
]}

机构：

[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China

[2] State Key Lab Virtual Real Technol & Syst, 37 Xueyuan Rd, Beijing 100191, Peoples R China

[3] Zhongguancun Lab, 37 Xueyuan Rd, Beijing 100191, Peoples R China

来源：

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS | 2024年 / 20卷 / 03期

关键词：

Visual localization; virtual reality; image retrieval; rendering;

D O I：

10.1145/3622788

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Robust and accurate camera pose estimation is fundamental in computer vision. Learning-based regression approaches acquire six-degree-of-freedom camera parameters accurately from visual cues of an input image. However, most are trained on street-view and landmark datasets. These approaches can hardly be generalized to overlooking use cases, such as the calibration of the surveillance camera and unmanned aerial vehicle. Besides, reference images captured from the real world are rare and expensive, and their diversity is not guaranteed. In this article, we address the problem of using alternative virtual images for visual localization training. This work has the following principle contributions: First, we present a new challenging localization dataset containing six reconstructed large-scale three-dimensional scenes, 10,594 calibrated photographs with condition changes, and 300k virtual images with pixelwise labeled depth, relative surface normal, and semantic segmentation. Second, we present a flexible multi-feature fusion network trained on virtual image datasets for robust image retrieval. Third, we propose an end-to-end confidence map prediction network for feature filtering and pose estimation. We demonstrate that large-scale rendered virtual images are beneficial to visual localization. Using virtual images can solve the diversity problem of real images and leverage labeled multi-feature data for deep learning. Experimental results show that our method achieves remarkable performance surpassing state-of-the-art approaches. To foster research on improvement for visual localization using synthetic images, we release our benchmark at https://github.com/YuanXiong/contributions.

引用

页数：19

共 50 条

[21] Immersive human-computer interactive virtual environment using large-scale display system
Wang, Xiuhui
Yan, Ke
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 96 : 649 - 659
[22] VOIR: Virtual Reality Visualization Software for Large-Scale Simulations
Ohno, Nobuaki
Kageyama, Akira
PLASMA AND FUSION RESEARCH, 2024, 19 : 1401024 - 1401026
[23] Depth as attention to learn image representations for visual localization, using monocular images
Hettiarachchi, Dulmini
Tian, Ye
Yu, Han
Kamijo, Shunsuke
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
[24] Large-scale Visual Search and Similarity for E-Commerce
Anand, Gaurav
Wang, Siyun
Ni, Karl
APPLICATIONS OF MACHINE LEARNING 2021, 2021, 11843
[25] Large-scale machinery monitoring system based on the visual reality
Zhang, Yusi
Ruan, Jun
PROCEEDINGS OF 2018 IEEE 3RD ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC 2018), 2018, : 863 - 867
[26] A Visual Positioning Method of UAV in a Large-Scale Outdoor Environment
Zhao, Chenhao
Wu, Dewei
He, Jing
Dai, Chuanjin
SENSORS, 2023, 23 (15)
[27] Distributed training of CosPlace for large-scale visual place recognition
Zaccone, Riccardo
Berton, Gabriele
Masone, Carlo
FRONTIERS IN ROBOTICS AND AI, 2024, 11
[28] Understanding virtual design behaviors: A large-scale analysis of the design process in Virtual Reality
Wang, Portia
Miller, Mark R.
Han, Eugy
Deveaux, Cyan
Bailenson, Jeremy N.
DESIGN STUDIES, 2024, 90
[29] EfiLoc: large-scale visual indoor localization with efficient correlation between sparse features and 3D points
Li, Ning
Ai, Haojun
VISUAL COMPUTER, 2022, 38 (06) : 2091 - 2106
[30] EfiLoc: large-scale visual indoor localization with efficient correlation between sparse features and 3D points
Ning Li
Haojun Ai
The Visual Computer, 2022, 38 : 2091 - 2106

← 1 2 3 4 5 →