Human-Robot Collaboration Through a Multi-Scale Graph Convolution Neural Network With Temporal Attention

被引:9
作者
Liu, Zhaowei [1 ]
Lu, Xilang [1 ]
Liu, Wenzhe [1 ]
Qi, Wen [2 ]
Su, Hang [3 ]
机构
[1] Yantai Univ, Sch Comp & Control Engn, Yantai 264005, Peoples R China
[2] South China Univ Technol, Sch Future Technol, Guangzhou 510641, Peoples R China
[3] Paris Saclay Univ, F-91190 Paris, France
关键词
Robots; Collaboration; Skeleton; Task analysis; Human activity recognition; Feature extraction; Convolutional neural networks; Human-robot collaboration; intention recognition; skeleton; graph convolutional neural network; RECOGNITION;
D O I
10.1109/LRA.2024.3355752
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Collaborative robots sensing and understanding the movements and intentions of their human partners are crucial for realizing human-robot collaboration. Human skeleton sequences are widely recognized as a kind of data with great application potential in human action recognition. In this letter, a multi-scale skeleton-based human action recognition network is proposed, which leverages a spatio-temporal attention mechanism. The network achieves high-accuracy human action prediction by aggregating multi-level key point features of the skeleton and applying the spatio-temporal attention mechanism to extract key temporal information features. In addition, a human action skeleton dataset containing eight different categories is collected for a human-robot collaboration task, where the human activity recognition network predicts skeleton sequences from a camera and the collaborating robot makes collaborative actions based on the predicted actions. In this study, the performance of the proposed method is compared with state-of-the-art human action recognition methods and ablation experiments are performed. The results show that the multi-scale spatio-temporal graph convolutional neural network has an action recognition accuracy of 94.16%. The effectiveness of the method is also verified by performing human-robot collaboration experiments on a real robot platform in a laboratory environment.
引用
收藏
页码:2248 / 2255
页数:8
相关论文
共 30 条
[1]   Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].
Cao, Zhe ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310
[2]   Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].
Carreira, Joao ;
Zisserman, Andrew .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733
[3]  
Du Y, 2015, PROC CVPR IEEE, P1110, DOI 10.1109/CVPR.2015.7298714
[4]   Focal and Global Spatial-Temporal Transformer for Skeleton-Based Action Recognition [J].
Gao, Zhimin ;
Wang, Peitao ;
Lv, Pei ;
Jiang, Xiaoheng ;
Liu, Qidong ;
Wang, Pichao ;
Xu, Mingliang ;
Li, Wanqing .
COMPUTER VISION - ACCV 2022, PT IV, 2023, 13844 :155-171
[5]   Human-Centered Collaborative Robots With Deep Reinforcement Learning [J].
Ghadirzadeh, Ali ;
Chen, Xi ;
Yin, Wenjie ;
Yi, Zhengrong ;
Bjorkman, Marten ;
Kragic, Danica .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) :566-571
[6]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[7]   Context and Intention aware 3D Human Body Motion Prediction using an Attention Deep Learning model in Handover Tasks [J].
Laplaza, Javier ;
Moreno-Noguer, Francesc ;
Sanfeliu, Alberto .
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, :4743-4748
[8]   Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition [J].
Liu, Ziyu ;
Zhang, Hongwen ;
Chen, Zhenghao ;
Wang, Zhiyong ;
Ouyang, Wanli .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :140-149
[9]   Human-Robot Collaboration: Optimizing Stress and Productivity Based on Game Theory [J].
Messeri, Costanza ;
Masotti, Gabriele ;
Zanchettin, Andrea Maria ;
Rocco, Paolo .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) :8061-8068
[10]  
Niepert M, 2016, PR MACH LEARN RES, V48