Photo-realistic 3D model based accurate visual positioning system for large-scale indoor spaces

被引:3
作者
Hyeon, Janghun [1 ]
Jang, Bumchul [2 ,3 ]
Choi, Hyunga [3 ]
Kim, Joohyung [2 ]
Kim, Dongwoo [4 ]
Doh, Nakju [3 ,5 ]
机构
[1] Korea Univ, Semicond Res Inst, Seoul 02841, South Korea
[2] Korea Univ, Sch Elect Engn, Seoul 02841, South Korea
[3] TeeLabs, Seoul 02857, South Korea
[4] Hyundai Mobis, Seoul 16891, South Korea
[5] Korea Univ, Inst Convergence Sci, Seoul 02841, South Korea
基金
新加坡国家研究基金会;
关键词
Visual localization; Visual positioning systems; Camera pose estimation; Image retrieval; Place recognition; Indoor spaces;
D O I
10.1016/j.engappai.2023.106256
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study presents a novel and reliable visual positioning system (VPS), KR-Net, for kidnap recovery tasks, which predicts an accurate position when a robot is first initiated. KR-Net is based on a hierarchical visual localization method and demonstrates significant robustness in large-scale indoor environments. The proposed VPS utilizes a photo-realistic 3D model to generate a dense database of any camera pose and incorporates a novel global descriptor for indoor spaces, i-GeM, that outperforms existing methods in terms of robustness. Additionally, the proposed combinatorial pooling approach overcomes the limitations of previous single image-based predictions in large-scale indoor environments, allowing for accurate discrimination between similar locations. Extensive evaluations were performed on six large-scale indoor datasets to demonstrate the contributions of each component. To the best of our knowledge, KR-Net is the first system to estimate wake-up positions with a near 100% confidence level within a 1.0 m distance error threshold.
引用
收藏
页数:15
相关论文
共 42 条
  • [1] Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/CVPR.2016.572, 10.1109/TPAMI.2017.2711011]
  • [2] Video-rate Localization in Multiple Maps for Wearable Augmented Reality
    Castle, Robert
    Klein, Georg
    Murray, David W.
    [J]. TWELFTH IEEE INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, PROCEEDINGS, 2008, : 15 - 22
  • [3] Cavalli Luca, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12364), P770, DOI 10.1007/978-3-030-58529-7_45
  • [4] Exploiting Spatio-Temporal Correlations with Multiple 3D Convolutional Neural Networks for Citywide Vehicle Flow Prediction
    Chen, Cen
    Li, Kenli
    Teo, Sin G.
    Chen, Guizi
    Zou, Xiaofeng
    Yang, Xulei
    Vijay, Ramaseshan C.
    Feng, Jiashi
    Zeng, Zeng
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 893 - 898
  • [5] Chen Y., 2021, INFORM SCIENCES
  • [6] SuperPoint: Self-Supervised Interest Point Detection and Description
    DeTone, Daniel
    Malisiewicz, Tomasz
    Rabinovich, Andrew
    [J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 337 - 349
  • [7] TeeVR: Spatial Template-Based Acquisition, Modeling, and Rendering of Large-Scale Indoor Spaces
    Doh, Nathan
    Choi, Hyunga
    Jang, Bumchul
    Ahn, Sangmin
    Jung, Hyojin
    Lee, Sungkil
    [J]. ACM SIGGRAPH 2019 EMERGING TECHNOLOGIES (SIGGRAPH '19), 2019,
  • [8] Dusmanu M, 2019, Arxiv, DOI arXiv:1905.03561
  • [9] Ester M., 1996, P 2 INT C KNOWL DISC, P226, DOI DOI 10.5555/3001460.3001507
  • [10] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778