Fusing Hand and Body Skeletons for Human Action Recognition in Assembly

被引:4
作者
Aganian, Dustin [1 ]
Koehler, Mona [1 ]
Stephan, Benedict [1 ]
Eisenbach, Markus [1 ]
Gross, Horst-Michael [1 ]
机构
[1] Ilmenau Univ Technol, Neuroinformat & Cognit Robot Lab, D-98693 Ilmenau, Germany
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT I | 2023年 / 14254卷
关键词
Action Recognition; Skeleton-based; Fusion; Body Skeletons; Hand Skeletons; 3D/2D Skeletons; Assembly; Deep Learning;
D O I
10.1007/978-3-031-44207-0_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As collaborative robots (cobots) continue to gain popularity in industrial manufacturing, effective human-robot collaboration becomes crucial. Cobots should be able to recognize human actions to assist with assembly tasks and act autonomously. To achieve this, skeleton-based approaches are often used due to their ability to generalize across various people and environments. Although body skeleton approaches are widely used for action recognition, they may not be accurate enough for assembly actions where the worker's fingers and hands play a significant role. To address this limitation, we propose a method in which less detailed body skeletons are combined with highly detailed hand skeletons. We investigate CNNs and transformers, the latter of which are particularly adept at extracting and combining important information from both skeleton types using attention. This paper demonstrates the effectiveness of our proposed approach in enhancing action recognition in assembly scenarios.
引用
收藏
页码:207 / 219
页数:13
相关论文
共 20 条
[1]  
Aganian D, 2023, IEEE INT C ROB AUT I
[2]   How Object Information Improves Skeleton-based Human Action Recognition in Assembly Tasks [J].
Aganian, Dustin ;
Koehler, Mona ;
Baake, Sebastian ;
Eisenbach, Markus ;
Gross, Horst-Michael .
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[3]   The IKEA ASM Dataset: Understanding People Assembling Furniture through Actions, Objects and Pose [J].
Ben-Shabat, Yizhak ;
Yu, Xin ;
Saleh, Fatemeh ;
Campbell, Dylan ;
Rodriguez-Opazo, Cristian ;
Li, Hongdong ;
Gould, Stephen .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :846-858
[4]  
Du Y, 2015, PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, P579, DOI 10.1109/ACPR.2015.7486569
[5]   Revisiting Skeleton-based Action Recognition [J].
Duan, Haodong ;
Zhao, Yue ;
Chen, Kai ;
Lin, Dahua ;
Dai, Bo .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :2959-2968
[6]  
Eisenbach M., 2021, IEEE INT C AUT SCI E
[7]   Efficient Multi-Task Scene Analysis with RGB-D Transformers [J].
Fischedick, Soehnke Benedikt ;
Seichter, Daniel ;
Schmidt, Robin ;
Rabes, Leonard ;
Gross, Horst-Michael .
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[8]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[9]   Challenges and opportunities in human robot collaboration context of Industry 4.0-a state of the art review [J].
Inkulu, Anil Kumar ;
Bahubalendruni, M. V. A. Raju ;
Dara, Ashok ;
K., SankaranarayanaSamy .
INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2022, 49 (02) :226-239
[10]   Swin Transformer V2: Scaling Up Capacity and Resolution [J].
Liu, Ze ;
Hu, Han ;
Lin, Yutong ;
Yao, Zhuliang ;
Xie, Zhenda ;
Wei, Yixuan ;
Ning, Jia ;
Cao, Yue ;
Zhang, Zheng ;
Dong, Li ;
Wei, Furu ;
Guo, Baining .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :11999-12009