VirtualLoc: Large-scale Visual Localization Using Virtual Images

被引:1
|
作者
Xiong, Yuan [1 ]
Wang, Jingru [1 ]
Zhou, Zhong [1 ,2 ,3 ]
机构
[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] State Key Lab Virtual Real Technol & Syst, 37 Xueyuan Rd, Beijing 100191, Peoples R China
[3] Zhongguancun Lab, 37 Xueyuan Rd, Beijing 100191, Peoples R China
关键词
Visual localization; virtual reality; image retrieval; rendering;
D O I
10.1145/3622788
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Robust and accurate camera pose estimation is fundamental in computer vision. Learning-based regression approaches acquire six-degree-of-freedom camera parameters accurately from visual cues of an input image. However, most are trained on street-view and landmark datasets. These approaches can hardly be generalized to overlooking use cases, such as the calibration of the surveillance camera and unmanned aerial vehicle. Besides, reference images captured from the real world are rare and expensive, and their diversity is not guaranteed. In this article, we address the problem of using alternative virtual images for visual localization training. This work has the following principle contributions: First, we present a new challenging localization dataset containing six reconstructed large-scale three-dimensional scenes, 10,594 calibrated photographs with condition changes, and 300k virtual images with pixelwise labeled depth, relative surface normal, and semantic segmentation. Second, we present a flexible multi-feature fusion network trained on virtual image datasets for robust image retrieval. Third, we propose an end-to-end confidence map prediction network for feature filtering and pose estimation. We demonstrate that large-scale rendered virtual images are beneficial to visual localization. Using virtual images can solve the diversity problem of real images and leverage labeled multi-feature data for deep learning. Experimental results show that our method achieves remarkable performance surpassing state-of-the-art approaches. To foster research on improvement for visual localization using synthetic images, we release our benchmark at https://github.com/YuanXiong/contributions.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] The role of perception and action on the use of allocentric information in a large-scale virtual environment
    Karimpur, Harun
    Kurz, Johannes
    Fiehler, Katja
    EXPERIMENTAL BRAIN RESEARCH, 2020, 238 (09) : 1813 - 1826
  • [42] Deep image retrieval of large-scale vessels images based on BoW model
    Chi Tian
    Jinfeng Xia
    Ji Tang
    Hui Yin
    Multimedia Tools and Applications, 2020, 79 : 9387 - 9401
  • [43] Deep image retrieval of large-scale vessels images based on BoW model
    Tian, Chi
    Xia, Jinfeng
    Tang, Ji
    Yin, Hui
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (13-14) : 9387 - 9401
  • [44] Visual landmark recognition from Internet photo collections: A large-scale evaluation
    Weyand, Tobias
    Leibe, Bastian
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 135 : 1 - 15
  • [45] Temporal Aggregation of Visual Features for Large-Scale Image-to-Video Retrieval
    Garcia, Noa
    ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 489 - 492
  • [46] Development of MirrorShape: High Fidelity Large-Scale Shape Rendering Framework for Virtual Reality
    Fedoseev, Aleksey
    Chernyadev, Nikita
    Tsetserukou, Dzmitry
    25TH ACM SYMPOSIUM ON VIRTUAL REALITY SOFTWARE AND TECHNOLOGY (VRST 2019), 2019,
  • [47] A Novel Method of Multi-user Redirected Walking for Large-Scale Virtual Environments
    Dong, Tianyang
    Song, Yifan
    Shen, Yuqi
    Fan, Jing
    ADVANCES IN COMPUTER GRAPHICS, CGI 2019, 2019, 11542 : 143 - 154
  • [48] OPENWEBGLOBE - AN OPEN SOURCE SDK FOR CREATING LARGE-SCALE VIRTUAL GLOBES ON A WEBGL BASIS
    Loesch, B.
    Christen, M.
    Nebiker, S.
    XXII ISPRS CONGRESS, TECHNICAL COMMISSION IV, 2012, 39-B4 : 195 - 200
  • [49] Fully Connected Hashing Neural Networks for Indexing Large-Scale Remote Sensing Images
    Liu, Na
    Mou, Haiming
    Tang, Jun
    Wan, Lihong
    Li, Qingdu
    Yuan, Ye
    MATHEMATICS, 2022, 10 (24)
  • [50] Utilizing Software Architecture Recovery to Explore Large-Scale Software Systems in Virtual Reality
    Hoff, Adrian
    Gerling, Lea
    Seidl, Christoph
    2022 WORKING CONFERENCE ON SOFTWARE VISUALIZATION (IEEE VISSOFT), 2022, : 119 - 130