A gaze-driven manufacturing assembly assistant system with integrated step recognition, repetition analysis, and real-time feedback

被引:5
作者
Chen, Haodong [1 ]
Zendehdel, Niloofar [2 ]
Leu, Ming C. [2 ]
Yin, Zhaozheng [3 ,4 ]
机构
[1] Univ Maryland, Dept Mech Engn, College Pk, MD 20742 USA
[2] Missouri Univ Sci & Technol, Dept Mech & Aerosp Engn, Rolla, MO USA
[3] SUNY Stony Brook, Dept Biomed Informat, Stony Brook, NY USA
[4] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY USA
基金
美国国家科学基金会;
关键词
Assembly assistance; Eye gaze estimation; Repetitive action counting; Transformer; Implemented artificial intelligence; Application of artificial intelligence; EYE-MOVEMENTS;
D O I
10.1016/j.engappai.2025.110076
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Modern manufacturing faces significant challenges, including efficiency bottlenecks and high error rates in manual assembly operations. To address these challenges, we implement artificial intelligence (AI) and propose a gaze-driven assembly assistant system that leverages artificial intelligence for human-centered smart manufacturing. Our system processes video inputs of assembly activities using a Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) network for assembly step recognition, a Transformer network for repetitive action counting, and a gaze tracker for eye gaze estimation. The application of AI integrates the outputs of these tasks to deliver real-time visual assistance through a software interface that displays relevant tools, parts, and procedural instructions based on recognized steps and gaze data. Experimental results demonstrate the system's high performance, achieving 98.36% accuracy in assembly step recognition, a mean absolute error (MAE) of 4.37%, and an off-by-one accuracy (OBOA) of 95.88% inaction counting. Compared to existing solutions, our gaze-driven assistant offers superior precision and efficiency, providing a scalable and adaptable framework suitable for complex and large-scale manufacturing environments.
引用
收藏
页数:16
相关论文
共 63 条
[11]   Training and Preparing Tomorrow's Workforce for the Fourth Industrial Revolution [J].
Buehler, Michael Max ;
Jelinek, Thorsten ;
Nubel, Konrad .
EDUCATION SCIENCES, 2022, 12 (11)
[12]   ARGUS: Visualization of AI-Assisted Task Guidance in AR [J].
Castelo, Sonia ;
Rulff, Joao ;
McGowan, Erin ;
Steers, Bea ;
Wu, Guande ;
Chen, Shaoyu ;
Roman, Iran ;
Lopez, Roque ;
Brewer, Ethan ;
Zhao, Chen ;
Qian, Jing ;
Cho, Kyunghyun ;
He, He ;
Sun, Qi ;
Vo, Huy ;
Bello, Juan ;
Krone, Michael ;
Silva, Claudio .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (01) :1313-1323
[13]   Monitoring of Assembly Process Using Deep Learning Technology [J].
Chen, Chengjun ;
Zhang, Chunlin ;
Wang, Tiannuo ;
Li, Dongnian ;
Guo, Yang ;
Zhao, Zhengxu ;
Hong, Jun .
SENSORS, 2020, 20 (15) :1-18
[14]   Repetitive assembly action recognition based on object detection and pose estimation [J].
Chen, Chengjun ;
Wang, Tiannuo ;
Li, Dongnian ;
Hong, Jun .
JOURNAL OF MANUFACTURING SYSTEMS, 2020, 55 :325-333
[15]   Design of a robotic rehabilitation system for mild cognitive impairment based on computer vision [J].
Chen, Hao-Dong ;
Zhu, Hongbo ;
Teng, Zhiqiang ;
Zhao, Ping .
Journal of Engineering and Science in Medical Diagnostics and Therapy, 2020, 3 (02)
[16]  
Chen HD, 2024, Arxiv, DOI arXiv:2308.08632
[17]   Real-Time Human-Computer Interaction Using Eye Gazes [J].
Chen, Haodong ;
Zendehdel, Niloofar ;
Leu, Ming C. ;
Yin, Zhaozheng .
MANUFACTURING LETTERS, 2023, 35 :883-894
[18]   Fine-grained activity classification in assembly based on multi-visual modalities [J].
Chen, Haodong ;
Zendehdel, Niloofar ;
Leu, Ming C. ;
Yin, Zhaozheng .
JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 35 (05) :2215-2233
[19]   Real-Time Multi-Modal Human-Robot Collaboration Using Gestures and Speech [J].
Chen, Haodong ;
Leu, Ming C. ;
Yin, Zhaozheng .
JOURNAL OF MANUFACTURING SCIENCE AND ENGINEERING-TRANSACTIONS OF THE ASME, 2022, 144 (10)
[20]  
Chen HD, 2020, PROCEEDINGS OF THE 2020 INTERNATIONAL SYMPOSIUM ON FLEXIBLE AUTOMATION (ISFA2020)