Photo-realistic 3D model based accurate visual positioning system for large-scale indoor spaces

被引:3
作者
Hyeon, Janghun [1 ]
Jang, Bumchul [2 ,3 ]
Choi, Hyunga [3 ]
Kim, Joohyung [2 ]
Kim, Dongwoo [4 ]
Doh, Nakju [3 ,5 ]
机构
[1] Korea Univ, Semicond Res Inst, Seoul 02841, South Korea
[2] Korea Univ, Sch Elect Engn, Seoul 02841, South Korea
[3] TeeLabs, Seoul 02857, South Korea
[4] Hyundai Mobis, Seoul 16891, South Korea
[5] Korea Univ, Inst Convergence Sci, Seoul 02841, South Korea
基金
新加坡国家研究基金会;
关键词
Visual localization; Visual positioning systems; Camera pose estimation; Image retrieval; Place recognition; Indoor spaces;
D O I
10.1016/j.engappai.2023.106256
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study presents a novel and reliable visual positioning system (VPS), KR-Net, for kidnap recovery tasks, which predicts an accurate position when a robot is first initiated. KR-Net is based on a hierarchical visual localization method and demonstrates significant robustness in large-scale indoor environments. The proposed VPS utilizes a photo-realistic 3D model to generate a dense database of any camera pose and incorporates a novel global descriptor for indoor spaces, i-GeM, that outperforms existing methods in terms of robustness. Additionally, the proposed combinatorial pooling approach overcomes the limitations of previous single image-based predictions in large-scale indoor environments, allowing for accurate discrimination between similar locations. Extensive evaluations were performed on six large-scale indoor datasets to demonstrate the contributions of each component. To the best of our knowledge, KR-Net is the first system to estimate wake-up positions with a near 100% confidence level within a 1.0 m distance error threshold.
引用
收藏
页数:15
相关论文
共 42 条
  • [31] From Coarse to Fine: Robust Hierarchical Localization at Large Scale
    Sarlin, Paul-Edouard
    Cadena, Cesar
    Siegwart, Roland
    Dymczyk, Marcin
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12708 - 12717
  • [32] Understanding the Limitations of CNN-based Absolute Camera Pose Regression
    Sattler, Torsten
    Zhou, Qunjie
    Pollefeys, Marc
    Leal-Taixe, Laura
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3297 - 3307
  • [33] Structure-from-Motion Revisited
    Schonberger, Johannes L.
    Frahm, Jan -Michael
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4104 - 4113
  • [34] Pixelwise View Selection for Unstructured Multi-View Stereo
    Schonberger, Johannes L.
    Zheng, Enliang
    Frahm, Jan-Michael
    Pollefeys, Marc
    [J]. COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 501 - 518
  • [35] Segal A. V., 2009, ROBOTICS SCI SYSTEMS, P435, DOI [10.7551/mitpress/8727.003.0022, DOI 10.15607/RSS.2009.V.021]
  • [36] Simonyan K, 2015, Arxiv, DOI arXiv:1409.1556
  • [37] InLoc: Indoor Visual Localization with Dense Matching and View Synthesis
    Taira, Hajime
    Okutomi, Masatoshi
    Sattler, Torsten
    Cimpoi, Mircea
    Pollefeys, Marc
    Sivic, Josef
    Pajdla, Tomas
    Torii, Akihiko
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7199 - 7209
  • [38] SOSNet: Second Order Similarity Regularization for Local Descriptor Learning
    Tian, Yurun
    Yu, Xin
    Fan, Bin
    Wu, Fuchao
    Heijnen, Huub
    Balntas, Vassileios
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11008 - 11017
  • [39] Tyszkiewicz M., 2020, ADV NEUR IN
  • [40] Valada A, 2018, IEEE INT CONF ROBOT, P6939, DOI 10.1109/ICRA.2018.8462979