POSE ESTIMATION THROUGH MASK-R CNN AND vSLAM IN LARGE-SCALE OUTDOORS AUGMENTED REALITY

被引:3
作者
Boutsi, A-M [1 ]
Bakalos, N. [1 ]
Ioannidis, C. [1 ]
机构
[1] Natl Tech Univ Athens, Sch Rural & Surveying Engn, Lab Photogrammetry, Athens, Greece
来源
XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION IV | 2022年 / 5-4卷
关键词
Deep Learning; Augmented Reality; CNN; image recognition; pose estimation; 3D rendering;
D O I
10.5194/isprs-annals-V-4-2022-197-2022
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Deep Learning (DL) ingrained into Mobile Augmented Reality (MAR) enables a new information-delivery paradigm. In the context of 6 DoF pose estimation, powerful DL networks could provide a direct solution for AR systems. However, their concurrent operation requires a significant number of computations per frame and yields to both misclassifications and localization errors. In this paper, a hybrid and lightweight solution on 3D tracking of arbitrary geometry for outdoor MAR scenarios is presented. The camera pose information obtained by ARCore SDK and vSLAM algorithm is combined with the semantic and geometric output of a CNN-object detector to validate and improve tracking performance in large-scale and uncontrolled outdoor environments. The methodology involves three main steps: i) training of the Mask-R CNN model to extract the class, bounding box and mask predictions, ii) real-time detection, segmentation and localization of the region of interest (ROI) in camera frames, and iii) computation of 2D-3D correspondences to enhance pose estimation of a 3D overlay. The dataset holds 30 images of the rock of St. Modestos - Modi in Meteora, Greece in which the ROI is an area with characteristic geological features. The comparative evaluation between the prototype system and the original one, as well as with R-CNN and FAST-R CNN detectors demonstrates higher precision accuracy and stable visualization at half a kilometre distance, while tracking time has decreased at 42% during far-field AR session.
引用
收藏
页码:197 / 204
页数:8
相关论文
共 34 条
[1]   Review of deep learning: concepts, CNN architectures, challenges, applications, future directions [J].
Alzubaidi, Laith ;
Zhang, Jinglan ;
Humaidi, Amjad J. ;
Al-Dujaili, Ayad ;
Duan, Ye ;
Al-Shamma, Omran ;
Santamaria, J. ;
Fadhel, Mohammed A. ;
Al-Amidie, Muthana ;
Farhan, Laith .
JOURNAL OF BIG DATA, 2021, 8 (01)
[2]  
[Anonymous], 2017, Keras
[3]  
ARCore SDK Google Inc, 2018, ARCORE SOFTWARE DEV
[4]  
ARKit SDK Apple Inc, 2017, ARKIT SOFTWARE DEV K
[5]  
Bakalos N., 2020, 13 ACM INT C PERVASI, P1
[6]  
Baruah Abhigyan, 2021, Trends in Wireless Communication and Information Security. Proceedings of EWCIS 2020. Lecture Notes in Electrical Engineering (LNEE 740), P175, DOI 10.1007/978-981-33-6393-9_19
[7]   Augmented Reality Meets Artificial Intelligence in Robotics: A Systematic Review [J].
Bassyouni, Zahraa ;
Elhajj, Imad H. .
FRONTIERS IN ROBOTICS AND AI, 2021, 8
[8]   Simultaneous localization and mapping: Part I [J].
Durrant-Whyte, Hugh ;
Bailey, Tim .
IEEE ROBOTICS & AUTOMATION MAGAZINE, 2006, 13 (02) :99-108
[9]  
EasyAR Sense VisionStar Information Technology (Shanghai) Co, 2015, EASYAR SENSE SOFTWAR
[10]   Content enhancement with augmented reality and machine learning [J].
Freeman, Justin .
JOURNAL OF SOUTHERN HEMISPHERE EARTH SYSTEMS SCIENCE, 2020, 70 (01) :143-150