Efficient vision-based multi-target augmented reality in the browser

被引:7
作者
Al-Zoube, Mohammed A. [1 ]
机构
[1] Princess Sumaya Univ Technol PSUT, Dept Comp Graph, POB 1438, Amman 11941, Jordan
关键词
Augmented reality; Web AR; Pose estimation; MobileNets; WebAssembly; Deep learning; Cross-platform; POSE ESTIMATION;
D O I
10.1007/s11042-022-12206-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Augmented Reality (AR) has gained rising attention from both industry and academia as it enhances the way we interact with the physical world. Compared with native AR apps, implementing AR with web technologies (Web AR) can provide lightweight and universal cross-platform deployment that does not involve extra downloading and installation in advance. However, there are some challenges when developing Web AR apps, such as computational efficiency and networking. The limited capabilities of the browser, especially on mobile devices, make it more challenging to develop efficient web apps. Fortunately, several technical advances have emerged that could change the status of Web AR. This paper presents an efficient implementation of a vision-based and multi-target Web AR app that runs at real-time frame rates on standard web browsers on mobile devices and PCs. A method based on natural features tracking (NFT) is used, and several new web technologies are optimized to achieve specific tasks. The proposed implementation takes advantage of an efficient and lightweight class of convolutional neural networks (CNN) to classify image targets. It uses an image registration method that eliminates the need for a database of the feature points' descriptors, which is usually used in natural feature tracking methods. Computation-intensive tasks, such as target extraction and pose estimation, were computed with separate threads. Thus, the main thread which handles the HTML rendering runs smoothly and is not blocked by these computation-intensive tasks. To evaluate the performance of the proposed architecture and validate its performance, a prototype app was developed. The findings demonstrate that the app can track multiple image targets with real-time frame rates and stable interaction.
引用
收藏
页码:14303 / 14320
页数:18
相关论文
共 37 条
[1]  
Abadi Martin, 2016, arXiv
[2]  
Abriata Luciano A., 2018, ARXIV PREPRINT ARXIV
[3]  
Acuna R, 2018, ARXIV PREPRINT ARXIV
[4]   Applying Deep Learning in Augmented Reality Tracking [J].
Akgul, Omer ;
Penekli, H. Ibrahim ;
Genc, Yakup .
2016 12TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS), 2016, :47-54
[5]   Web-Based Augmented Reality with Natural Feature Tracking and Advanced Rendering [J].
Al-Zoube, Mohammed A. .
2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, :320-326
[6]  
[Anonymous], 2017, AR JS PROJECT HOMEPA
[7]  
[Anonymous], 2011, P ACM INT C COMPANIO, DOI DOI 10.1145/2048147.2048224
[8]  
Belghit H, 2018, ARXIV PREPRINT ARXIV
[9]   Universal Web-Based Tracking for Augmented Reality Applications [J].
Bonenberger, Yannic ;
Rambach, Jason ;
Pagani, Alain ;
Stricker, Didier .
VIRTUAL REALITY AND AUGMENTED REALITY, EUROVR 2018, 2018, 11162 :18-27
[10]  
Bouguet J.-Y., 2001, Intel corporation, V5, DOI DOI 10.1109/ICETET.2009.154