Photo-realistic 3D model based accurate visual positioning system for large-scale indoor spaces

被引：3

作者：

Hyeon, Janghun ^{[1
]}

Jang, Bumchul ^{[2
,3
]}

Choi, Hyunga ^{[3
]}

Kim, Joohyung ^{[2
]}

Kim, Dongwoo ^{[4
]}

Doh, Nakju ^{[3
,5
]}

机构：

[1] Korea Univ, Semicond Res Inst, Seoul 02841, South Korea

[2] Korea Univ, Sch Elect Engn, Seoul 02841, South Korea

[3] TeeLabs, Seoul 02857, South Korea

[4] Hyundai Mobis, Seoul 16891, South Korea

[5] Korea Univ, Inst Convergence Sci, Seoul 02841, South Korea

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2023年 / 123卷

基金：

新加坡国家研究基金会;

关键词：

Visual localization; Visual positioning systems; Camera pose estimation; Image retrieval; Place recognition; Indoor spaces;

D O I：

10.1016/j.engappai.2023.106256

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This study presents a novel and reliable visual positioning system (VPS), KR-Net, for kidnap recovery tasks, which predicts an accurate position when a robot is first initiated. KR-Net is based on a hierarchical visual localization method and demonstrates significant robustness in large-scale indoor environments. The proposed VPS utilizes a photo-realistic 3D model to generate a dense database of any camera pose and incorporates a novel global descriptor for indoor spaces, i-GeM, that outperforms existing methods in terms of robustness. Additionally, the proposed combinatorial pooling approach overcomes the limitations of previous single image-based predictions in large-scale indoor environments, allowing for accurate discrimination between similar locations. Extensive evaluations were performed on six large-scale indoor datasets to demonstrate the contributions of each component. To the best of our knowledge, KR-Net is the first system to estimate wake-up positions with a near 100% confidence level within a 1.0 m distance error threshold.

引用

页数：15

共 42 条

[31] From Coarse to Fine: Robust Hierarchical Localization at Large Scale
Sarlin, Paul-Edouard
Cadena, Cesar
Siegwart, Roland
Dymczyk, Marcin
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12708 - 12717
[32] Understanding the Limitations of CNN-based Absolute Camera Pose Regression
Sattler, Torsten
Zhou, Qunjie
Pollefeys, Marc
Leal-Taixe, Laura
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3297 - 3307
[33] Structure-from-Motion Revisited
Schonberger, Johannes L.
Frahm, Jan -Michael
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4104 - 4113
[34] Pixelwise View Selection for Unstructured Multi-View Stereo
Schonberger, Johannes L.
Zheng, Enliang
Frahm, Jan-Michael
Pollefeys, Marc
[J]. COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 501 - 518
[35] Segal A. V., 2009, ROBOTICS SCI SYSTEMS, P435, DOI [10.7551/mitpress/8727.003.0022, DOI 10.15607/RSS.2009.V.021]
[36] Simonyan K, 2015, Arxiv, DOI arXiv:1409.1556
[37] InLoc: Indoor Visual Localization with Dense Matching and View Synthesis
Taira, Hajime
Okutomi, Masatoshi
Sattler, Torsten
Cimpoi, Mircea
Pollefeys, Marc
Sivic, Josef
Pajdla, Tomas
Torii, Akihiko
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7199 - 7209
[38] SOSNet: Second Order Similarity Regularization for Local Descriptor Learning
Tian, Yurun
Yu, Xin
Fan, Bin
Wu, Fuchao
Heijnen, Huub
Balntas, Vassileios
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11008 - 11017
[39] Tyszkiewicz M., 2020, ADV NEUR IN
[40] Valada A, 2018, IEEE INT CONF ROBOT, P6939, DOI 10.1109/ICRA.2018.8462979

← 1 2 3 4 5 →