Predictive intention recognition using deep learning for collaborative assembly

被引:0
作者
Rekik, Khansa [1 ]
Gajjar, Nishant [1 ]
Silva, Grimaldo [2 ,3 ]
Milner, Rainer [1 ]
机构
[1] ZeMA gGmbh, Saarbrucken, Germany
[2] SENAI CIMATEC, Salvador, BA, Brazil
[3] Univ Estado Bahia, Salvador, BA, Brazil
来源
2024 10TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES, CODIT 2024 | 2024年
关键词
hand segmentation; intention recognition; deep learning; long short term memory; HUMAN-ROBOT COLLABORATION; HAND DETECTION;
D O I
10.1109/CoDIT62066.2024.10708459
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Achieving seamless human-robot collaboration in tasks with uncertainty requires anticipating human intentions. In order to overcome the limitations of classical techniques for inferring human intentions, which often struggle due to delayed articulation and lack of knowledge about spatio-temporal dependencies, this study proposes an approach centered around real-time prediction of human intention using Long-Short Term Memories (LSTMs). To achieve this goal, the human hands and assembly product components were detected in each frame, using a novel dataset, the former is then used as the primary input to our LSTM model. Finally, we demonstrate and validate the effectiveness of this framework in a real industrial assembly scenario, where a robotic agent utilizes the predicted intention to efficiently assist the human in successfully completing the assembly of a product. The results indicate, with high confidence, a response time about three times faster using our approach, on average, compared to waiting for the object detection module to recognize that a component was removed from a work bin.
引用
收藏
页码:1153 / 1158
页数:6
相关论文
共 22 条
[1]   Human-robot collaboration: A survey [J].
Bauer, Andrea ;
Wollherr, Dirk ;
Buss, Martin .
INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2008, 5 (01) :47-66
[2]  
Bochkovskiy A., 2020, CORR, DOI [DOI 10.48550/ARXIV.2004.10934, 10.48550/ARXIV.2004.10934]
[3]  
Chandrasekaran B., 2015, SoutheastCon 2015, P1, DOI DOI 10.1109/SECON.2015.7132964
[4]   Human intention and workspace recognition for collaborative assembly [J].
Gajjar, Nishant Ketan ;
Rekik, Khansa ;
Kanso, Ali ;
Mueller, Rainer .
IFAC PAPERSONLINE, 2022, 55 (10) :365-370
[5]   Learning Rich Features from RGB-D Images for Object Detection and Segmentation [J].
Gupta, Saurabh ;
Girshick, Ross ;
Arbelaez, Pablo ;
Malik, Jitendra .
COMPUTER VISION - ECCV 2014, PT VII, 2014, 8695 :345-360
[6]  
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[7]   Real-time Hand Gesture Detection and Classification Using Convolutional Neural Networks [J].
Koepueklue, Okan ;
Gunduz, Ahmet ;
Kose, Neslihan ;
Rigoll, Gerhard .
2019 14TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2019), 2019, :407-414
[8]   Object detection in real time based on improved single shot multi-box detector algorithm [J].
Kumar, Ashwani ;
Zhang, Zuopeng Justin ;
Lyu, Hongbo .
EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2020, 2020 (01)
[9]  
Lee J., 2021, ARXIV
[10]   Pixel-level Hand Detection in Ego-Centric Videos [J].
Li, Cheng ;
Kitani, Kris M. .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :3570-3577