Photo-realistic 3D model based accurate visual positioning system for large-scale indoor spaces

被引：3

作者：

Hyeon, Janghun ^{[1
]}

Jang, Bumchul ^{[2
,3
]}

Choi, Hyunga ^{[3
]}

Kim, Joohyung ^{[2
]}

Kim, Dongwoo ^{[4
]}

Doh, Nakju ^{[3
,5
]}

机构：

[1] Korea Univ, Semicond Res Inst, Seoul 02841, South Korea

[2] Korea Univ, Sch Elect Engn, Seoul 02841, South Korea

[3] TeeLabs, Seoul 02857, South Korea

[4] Hyundai Mobis, Seoul 16891, South Korea

[5] Korea Univ, Inst Convergence Sci, Seoul 02841, South Korea

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2023年 / 123卷

基金：

新加坡国家研究基金会;

关键词：

Visual localization; Visual positioning systems; Camera pose estimation; Image retrieval; Place recognition; Indoor spaces;

D O I：

10.1016/j.engappai.2023.106256

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This study presents a novel and reliable visual positioning system (VPS), KR-Net, for kidnap recovery tasks, which predicts an accurate position when a robot is first initiated. KR-Net is based on a hierarchical visual localization method and demonstrates significant robustness in large-scale indoor environments. The proposed VPS utilizes a photo-realistic 3D model to generate a dense database of any camera pose and incorporates a novel global descriptor for indoor spaces, i-GeM, that outperforms existing methods in terms of robustness. Additionally, the proposed combinatorial pooling approach overcomes the limitations of previous single image-based predictions in large-scale indoor environments, allowing for accurate discrimination between similar locations. Extensive evaluations were performed on six large-scale indoor datasets to demonstrate the contributions of each component. To the best of our knowledge, KR-Net is the first system to estimate wake-up positions with a near 100% confidence level within a 1.0 m distance error threshold.

引用

页数：15

共 42 条

[1] Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/CVPR.2016.572, 10.1109/TPAMI.2017.2711011]
[2] Video-rate Localization in Multiple Maps for Wearable Augmented Reality
Castle, Robert
Klein, Georg
Murray, David W.
[J]. TWELFTH IEEE INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, PROCEEDINGS, 2008, : 15 - 22
[3] Cavalli Luca, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12364), P770, DOI 10.1007/978-3-030-58529-7_45
[4] Exploiting Spatio-Temporal Correlations with Multiple 3D Convolutional Neural Networks for Citywide Vehicle Flow Prediction
Chen, Cen
Li, Kenli
Teo, Sin G.
Chen, Guizi
Zou, Xiaofeng
Yang, Xulei
Vijay, Ramaseshan C.
Feng, Jiashi
Zeng, Zeng
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 893 - 898
[5] Chen Y., 2021, INFORM SCIENCES
[6] SuperPoint: Self-Supervised Interest Point Detection and Description
DeTone, Daniel
Malisiewicz, Tomasz
Rabinovich, Andrew
[J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 337 - 349
[7] TeeVR: Spatial Template-Based Acquisition, Modeling, and Rendering of Large-Scale Indoor Spaces
Doh, Nathan
Choi, Hyunga
Jang, Bumchul
Ahn, Sangmin
Jung, Hyojin
Lee, Sungkil
[J]. ACM SIGGRAPH 2019 EMERGING TECHNOLOGIES (SIGGRAPH '19), 2019,
[8] Dusmanu M, 2019, Arxiv, DOI arXiv:1905.03561
[9] Ester M., 1996, P 2 INT C KNOWL DISC, P226, DOI DOI 10.5555/3001460.3001507
[10] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778

← 1 2 3 4 5 →