Real-Time Onboard Object Detection for Augmented Reality: Enhancing Head-Mounted Display with YOLOv8

被引:3
|
作者
Lysakowski, Mikolaj [1 ]
Zywanowski, Kamil [1 ]
Banaszczyk, Adam [1 ]
Nowicki, Michal R. [1 ,2 ]
Skrzypczynski, Piotr [1 ,2 ]
Tadeja, Slawomir K. [3 ]
机构
[1] Poznan Univ Tech, Ctr Artificial Intelligence & Cybersecur, Poznan, Poland
[2] Poznan Univ Tech, Inst Robot & Machine Intelligence, Poznan, Poland
[3] Univ Cambridge, Inst Mfg, Dept Engn, Cambridge, England
关键词
augmented reality; machine learning; real-time object detection; edge computing;
D O I
10.1109/EDGE60047.2023.00059
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces a software architecture for real-time object detection using machine learning (ML) in an augmented reality (AR) environment. Our approach uses the recent state-of-the-art YOLOv8 network that runs onboard on the Microsoft HoloLens 2 head-mounted display (HMD). The primary motivation behind this research is to enable the application of advanced ML models for enhanced perception and situational awareness with a wearable, hands-free AR platform. We show the image processing pipeline for the YOLOv8 model and the techniques used to make it real-time on the resource-limited edge computing platform of the headset. The experimental results demonstrate that our solution achieves real-time processing without needing offloading tasks to the cloud or any other external servers while retaining satisfactory accuracy regarding the usual mAP metric and measured qualitative performance.
引用
收藏
页码:364 / 371
页数:8
相关论文
共 50 条
  • [1] Integrating a Head-mounted Display with a Mobile Device for Real-time Augmented Reality Purposes
    Madeira, Bruno
    Alves, Pedro
    Marto, Anabela
    Rodrigues, Nuno
    Goncalves, Alexandrino
    GRAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 1: GRAPP, 2021, : 313 - 319
  • [2] Augmented Reality for Presenting Real-Time Data During Students' Laboratory Work: Comparing a Head-Mounted Display With a Separate Display
    Thees, Michael
    Altmeyer, Kristin
    Kapp, Sebastian
    Rexigel, Eva
    Beil, Fabian
    Klein, Pascal
    Malone, Sarah
    Bruenken, Roland
    Kuhn, Jochen
    FRONTIERS IN PSYCHOLOGY, 2022, 13
  • [3] Real-Time Step Detection Using the Integrated Sensors of a Head-Mounted Display
    Caserman, Polona
    Krabbe, Patrick
    Wojtusch, Janis
    von Stryk, Oskar
    2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2016, : 3510 - 3515
  • [4] Towards a Low-Cost Augmented Reality Head-Mounted Display with Real-Time Eye Center Location Capability
    D'Angelo, Thiago
    Delabrida, Saul Emanuel
    Rabelo Oliveira, Ricardo A.
    Loureiro, Antonio A. F.
    2016 VI BRAZILIAN SYMPOSIUM ON COMPUTING SYSTEMS ENGINEERING (SBESC 2016), 2016, : 24 - 31
  • [5] Real-Time Vehicles Detection with YOLOv8
    Lin, Chih-Jer
    Lee, Chi-Mo
    2024 11TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN, ICCE-TAIWAN 2024, 2024, : 805 - 806
  • [6] Calibration of a head-mounted projective display for augmented reality systems
    Hua, H
    Gao, CY
    Ahuja, N
    INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY, PROCEEDINGS, 2002, : 176 - 185
  • [7] A lightweight YOLOv8 integrating FasterNet for real-time underwater object detection
    An Guo
    Kaiqiong Sun
    Ziyi Zhang
    Journal of Real-Time Image Processing, 2024, 21
  • [8] Head-mounted display augmented reality in manufacturing: A systematic review
    Fang, Wei
    Chen, Lixi
    Zhang, Tienong
    Chen, Chengjun
    Teng, Zhan
    Wang, Lihui
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2023, 83
  • [9] A lightweight YOLOv8 integrating FasterNet for real-time underwater object detection
    Guo, An
    Sun, Kaiqiong
    Zhang, Ziyi
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (02)
  • [10] η-repyolo: real-time object detection method based on η-RepConv and YOLOv8
    Feng, Shuai
    Qian, Huaming
    Wang, Huilin
    Wang, Wenna
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (03)