Deep Learning Approach for Enhanced Object Recognition and Assembly Guidance with Augmented Reality

被引:0
作者
Lee, Boon Giin [1 ]
Wang, Xiaoying [1 ]
Han, Renzhi [1 ]
Sun, Linjing [1 ]
Pike, Matthew [1 ]
Chung, Wan-Young [2 ]
机构
[1] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo 315100, Peoples R China
[2] Pukyong Natl Univ, Dept Artificial Convergence, Busan 48513, South Korea
来源
INTELLIGENT HUMAN COMPUTER INTERACTION, IHCI 2023, PT II | 2024年 / 14532卷
关键词
Augmented Reality; Assembly Tasks; Object Detection; Object Recognition;
D O I
10.1007/978-3-031-53830-8_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In an effort to enhance the efficiency and precision of manual part assembly in industrial settings, the development of software for assembly guidance becomes imperative. Augmented reality (AR) technology offers a means to provide visual instructions for assembly tasks, rendering the guidance more comprehensible. Nevertheless, a significant challenge lies in the technology's limited object detection capabilities, especially when distinguishing between similar assembled parts. This project proposes the utilization of deep learning neural networks to enhance the accuracy of object recognition within the AR guided assembly application. To achieve this objective, a dataset of assembly parts, known as the Visual Object Classes (VOC) dataset, was created. Data augmentation techniques were employed to expand this dataset, incorporating scale HSV (hue saturation value) transformations. Subsequently, deep learning models for the recognition of assembly parts were developed which were based on the Single Shot Multibox Detector (SSD) and the YOLOv7 detector. The models were trained and fine-tuned, targeting on the variations of the positions of detected parts. The effectiveness of this approach was evaluated using a case study involving an educational electronic blocks circuit science kit. The results demonstrated a high assembly part recognition accuracy of over 99% in mean average precision (MAP), along with favorable user testing outcomes. Consequently, the AR application was capable of offering high-quality guidance to users which holds promise for application in diverse scenarios and the resolution of real-world challenges.
引用
收藏
页码:105 / 114
页数:10
相关论文
共 10 条
[1]   Recent advances in augmented reality [J].
Azuma, R ;
Baillot, Y ;
Behringer, R ;
Feiner, S ;
Julier, S ;
MacIntyre, B .
IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2001, 21 (06) :34-47
[2]  
Brook J, 1996, Usability Evaluation in Industry, P189, DOI [DOI 10.1201/9781498710411, DOI 10.1201/9781498710411-35]
[3]   Use of projector based augmented reality to improve manual spot-welding precision and accuracy for automotive manufacturing [J].
Doshi, Ashish ;
Smith, Ross T. ;
Thomas, Bruce H. ;
Bouras, Con .
INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2017, 89 (5-8) :1279-1293
[4]   An AR-based hybrid approach for facility layout planning and evaluation for existing shop floors [J].
Jiang, S. ;
Ong, S. K. ;
Nee, A. Y. C. .
INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2014, 72 (1-4) :457-473
[5]  
Katiyar A., 2015, Advances in Computer Science and Information Technology, V2, P441
[6]   Augmented Reality applications as digital experiments for education - An example in the Earth-Moon System [J].
Lindner, Claudia ;
Rienow, Andreas ;
Juergens, Carsten .
ACTA ASTRONAUTICA, 2019, 161 :66-74
[7]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37
[8]   You Only Look Once: Unified, Real-Time Object Detection [J].
Redmon, Joseph ;
Divvala, Santosh ;
Girshick, Ross ;
Farhadi, Ali .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :779-788
[9]   A comprehensive and systematic look up into deep learning based object detection techniques: A review [J].
Sharma, Vipal Kumar ;
Mir, Roohie Naaz .
COMPUTER SCIENCE REVIEW, 2020, 38
[10]   YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors [J].
Wang, Chien-Yao ;
Bochkovskiy, Alexey ;
Liao, Hong-Yuan Mark .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :7464-7475