A gaze-driven manufacturing assembly assistant system with integrated step recognition, repetition analysis, and real-time feedback

被引：5

作者：

Chen, Haodong ^{[1
]}

Zendehdel, Niloofar ^{[2
]}

Leu, Ming C. ^{[2
]}

Yin, Zhaozheng ^{[3
,4
]}

机构：

[1] Univ Maryland, Dept Mech Engn, College Pk, MD 20742 USA

[2] Missouri Univ Sci & Technol, Dept Mech & Aerosp Engn, Rolla, MO USA

[3] SUNY Stony Brook, Dept Biomed Informat, Stony Brook, NY USA

[4] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY USA

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2025年 / 144卷

基金：

美国国家科学基金会;

关键词：

Assembly assistance; Eye gaze estimation; Repetitive action counting; Transformer; Implemented artificial intelligence; Application of artificial intelligence; EYE-MOVEMENTS;

D O I：

10.1016/j.engappai.2025.110076

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Modern manufacturing faces significant challenges, including efficiency bottlenecks and high error rates in manual assembly operations. To address these challenges, we implement artificial intelligence (AI) and propose a gaze-driven assembly assistant system that leverages artificial intelligence for human-centered smart manufacturing. Our system processes video inputs of assembly activities using a Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) network for assembly step recognition, a Transformer network for repetitive action counting, and a gaze tracker for eye gaze estimation. The application of AI integrates the outputs of these tasks to deliver real-time visual assistance through a software interface that displays relevant tools, parts, and procedural instructions based on recognized steps and gaze data. Experimental results demonstrate the system's high performance, achieving 98.36% accuracy in assembly step recognition, a mean absolute error (MAE) of 4.37%, and an off-by-one accuracy (OBOA) of 95.88% inaction counting. Compared to existing solutions, our gaze-driven assistant offers superior precision and efficiency, providing a scalable and adaptable framework suitable for complex and large-scale manufacturing environments.

引用

页数：16

共 63 条

[11] Training and Preparing Tomorrow's Workforce for the Fourth Industrial Revolution [J].

Buehler, Michael Max ;

Jelinek, Thorsten ;

Nubel, Konrad .

EDUCATION SCIENCES, 2022, 12 (11)

[12] ARGUS: Visualization of AI-Assisted Task Guidance in AR [J].

Castelo, Sonia ;

Rulff, Joao ;

McGowan, Erin ;

Steers, Bea ;

Wu, Guande ;

Chen, Shaoyu ;

Roman, Iran ;

Lopez, Roque ;

Brewer, Ethan ;

Zhao, Chen ;

Qian, Jing ;

Cho, Kyunghyun ;

He, He ;

Sun, Qi ;

Vo, Huy ;

Bello, Juan ;

Krone, Michael ;

Silva, Claudio .

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (01) :1313-1323

[13] Monitoring of Assembly Process Using Deep Learning Technology [J].

Chen, Chengjun ;

Zhang, Chunlin ;

Wang, Tiannuo ;

Li, Dongnian ;

Guo, Yang ;

Zhao, Zhengxu ;

Hong, Jun .

SENSORS, 2020, 20 (15) :1-18

[14] Repetitive assembly action recognition based on object detection and pose estimation [J].

Chen, Chengjun ;

Wang, Tiannuo ;

Li, Dongnian ;

Hong, Jun .

JOURNAL OF MANUFACTURING SYSTEMS, 2020, 55 :325-333

[15] Design of a robotic rehabilitation system for mild cognitive impairment based on computer vision [J].

Chen, Hao-Dong ;

Zhu, Hongbo ;

Teng, Zhiqiang ;

Zhao, Ping .

Journal of Engineering and Science in Medical Diagnostics and Therapy, 2020, 3 (02)

[16]

Chen HD, 2024, Arxiv, DOI arXiv:2308.08632

[17] Real-Time Human-Computer Interaction Using Eye Gazes [J].

Chen, Haodong ;

Zendehdel, Niloofar ;

Leu, Ming C. ;

Yin, Zhaozheng .

MANUFACTURING LETTERS, 2023, 35 :883-894

[18] Fine-grained activity classification in assembly based on multi-visual modalities [J].

Chen, Haodong ;

Zendehdel, Niloofar ;

Leu, Ming C. ;

Yin, Zhaozheng .

JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 35 (05) :2215-2233

[19] Real-Time Multi-Modal Human-Robot Collaboration Using Gestures and Speech [J].

Chen, Haodong ;

Leu, Ming C. ;

Yin, Zhaozheng .

JOURNAL OF MANUFACTURING SCIENCE AND ENGINEERING-TRANSACTIONS OF THE ASME, 2022, 144 (10)

[20]

Chen HD, 2020, PROCEEDINGS OF THE 2020 INTERNATIONAL SYMPOSIUM ON FLEXIBLE AUTOMATION (ISFA2020)

← 1 2 3 4 5 6 7 →