VirtualLoc: Large-scale Visual Localization Using Virtual Images

被引:1
|
作者
Xiong, Yuan [1 ]
Wang, Jingru [1 ]
Zhou, Zhong [1 ,2 ,3 ]
机构
[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] State Key Lab Virtual Real Technol & Syst, 37 Xueyuan Rd, Beijing 100191, Peoples R China
[3] Zhongguancun Lab, 37 Xueyuan Rd, Beijing 100191, Peoples R China
关键词
Visual localization; virtual reality; image retrieval; rendering;
D O I
10.1145/3622788
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Robust and accurate camera pose estimation is fundamental in computer vision. Learning-based regression approaches acquire six-degree-of-freedom camera parameters accurately from visual cues of an input image. However, most are trained on street-view and landmark datasets. These approaches can hardly be generalized to overlooking use cases, such as the calibration of the surveillance camera and unmanned aerial vehicle. Besides, reference images captured from the real world are rare and expensive, and their diversity is not guaranteed. In this article, we address the problem of using alternative virtual images for visual localization training. This work has the following principle contributions: First, we present a new challenging localization dataset containing six reconstructed large-scale three-dimensional scenes, 10,594 calibrated photographs with condition changes, and 300k virtual images with pixelwise labeled depth, relative surface normal, and semantic segmentation. Second, we present a flexible multi-feature fusion network trained on virtual image datasets for robust image retrieval. Third, we propose an end-to-end confidence map prediction network for feature filtering and pose estimation. We demonstrate that large-scale rendered virtual images are beneficial to visual localization. Using virtual images can solve the diversity problem of real images and leverage labeled multi-feature data for deep learning. Experimental results show that our method achieves remarkable performance surpassing state-of-the-art approaches. To foster research on improvement for visual localization using synthetic images, we release our benchmark at https://github.com/YuanXiong/contributions.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Immersive human-computer interactive virtual environment using large-scale display system
    Wang, Xiuhui
    Yan, Ke
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 96 : 649 - 659
  • [22] VOIR: Virtual Reality Visualization Software for Large-Scale Simulations
    Ohno, Nobuaki
    Kageyama, Akira
    PLASMA AND FUSION RESEARCH, 2024, 19 : 1401024 - 1401026
  • [23] Depth as attention to learn image representations for visual localization, using monocular images
    Hettiarachchi, Dulmini
    Tian, Ye
    Yu, Han
    Kamijo, Shunsuke
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
  • [24] Large-scale Visual Search and Similarity for E-Commerce
    Anand, Gaurav
    Wang, Siyun
    Ni, Karl
    APPLICATIONS OF MACHINE LEARNING 2021, 2021, 11843
  • [25] Large-scale machinery monitoring system based on the visual reality
    Zhang, Yusi
    Ruan, Jun
    PROCEEDINGS OF 2018 IEEE 3RD ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC 2018), 2018, : 863 - 867
  • [26] A Visual Positioning Method of UAV in a Large-Scale Outdoor Environment
    Zhao, Chenhao
    Wu, Dewei
    He, Jing
    Dai, Chuanjin
    SENSORS, 2023, 23 (15)
  • [27] Distributed training of CosPlace for large-scale visual place recognition
    Zaccone, Riccardo
    Berton, Gabriele
    Masone, Carlo
    FRONTIERS IN ROBOTICS AND AI, 2024, 11
  • [28] Understanding virtual design behaviors: A large-scale analysis of the design process in Virtual Reality
    Wang, Portia
    Miller, Mark R.
    Han, Eugy
    Deveaux, Cyan
    Bailenson, Jeremy N.
    DESIGN STUDIES, 2024, 90
  • [29] EfiLoc: large-scale visual indoor localization with efficient correlation between sparse features and 3D points
    Li, Ning
    Ai, Haojun
    VISUAL COMPUTER, 2022, 38 (06) : 2091 - 2106
  • [30] EfiLoc: large-scale visual indoor localization with efficient correlation between sparse features and 3D points
    Ning Li
    Haojun Ai
    The Visual Computer, 2022, 38 : 2091 - 2106