Human-Robot Collaboration Through a Multi-Scale Graph Convolution Neural Network With Temporal Attention

被引：9

作者：

Liu, Zhaowei ^{[1
]}

Lu, Xilang ^{[1
]}

Liu, Wenzhe ^{[1
]}

Qi, Wen ^{[2
]}

Su, Hang ^{[3
]}

机构：

[1] Yantai Univ, Sch Comp & Control Engn, Yantai 264005, Peoples R China

[2] South China Univ Technol, Sch Future Technol, Guangzhou 510641, Peoples R China

[3] Paris Saclay Univ, F-91190 Paris, France

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 03期

关键词：

Robots; Collaboration; Skeleton; Task analysis; Human activity recognition; Feature extraction; Convolutional neural networks; Human-robot collaboration; intention recognition; skeleton; graph convolutional neural network; RECOGNITION;

D O I：

10.1109/LRA.2024.3355752

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Collaborative robots sensing and understanding the movements and intentions of their human partners are crucial for realizing human-robot collaboration. Human skeleton sequences are widely recognized as a kind of data with great application potential in human action recognition. In this letter, a multi-scale skeleton-based human action recognition network is proposed, which leverages a spatio-temporal attention mechanism. The network achieves high-accuracy human action prediction by aggregating multi-level key point features of the skeleton and applying the spatio-temporal attention mechanism to extract key temporal information features. In addition, a human action skeleton dataset containing eight different categories is collected for a human-robot collaboration task, where the human activity recognition network predicts skeleton sequences from a camera and the collaborating robot makes collaborative actions based on the predicted actions. In this study, the performance of the proposed method is compared with state-of-the-art human action recognition methods and ablation experiments are performed. The results show that the multi-scale spatio-temporal graph convolutional neural network has an action recognition accuracy of 94.16%. The effectiveness of the method is also verified by performing human-robot collaboration experiments on a real robot platform in a laboratory environment.

引用

页码：2248 / 2255

页数：8

共 30 条

[1] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].

Cao, Zhe ;

Simon, Tomas ;

Wei, Shih-En ;

Sheikh, Yaser .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310

[2] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].

Carreira, Joao ;

Zisserman, Andrew .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733

[3]

Du Y, 2015, PROC CVPR IEEE, P1110, DOI 10.1109/CVPR.2015.7298714

[4] Focal and Global Spatial-Temporal Transformer for Skeleton-Based Action Recognition [J].

Gao, Zhimin ;

Wang, Peitao ;

Lv, Pei ;

Jiang, Xiaoheng ;

Liu, Qidong ;

Wang, Pichao ;

Xu, Mingliang ;

Li, Wanqing .

COMPUTER VISION - ACCV 2022, PT IV, 2023, 13844 :155-171

[5] Human-Centered Collaborative Robots With Deep Reinforcement Learning [J].

Ghadirzadeh, Ali ;

Chen, Xi ;

Yin, Wenjie ;

Yi, Zhengrong ;

Bjorkman, Marten ;

Kragic, Danica .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) :566-571

[6] Rich feature hierarchies for accurate object detection and semantic segmentation [J].

Girshick, Ross ;

Donahue, Jeff ;

Darrell, Trevor ;

Malik, Jitendra .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587

[7] Context and Intention aware 3D Human Body Motion Prediction using an Attention Deep Learning model in Handover Tasks [J].

Laplaza, Javier ;

Moreno-Noguer, Francesc ;

Sanfeliu, Alberto .

2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, :4743-4748

[8] Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition [J].

Liu, Ziyu ;

Zhang, Hongwen ;

Chen, Zhenghao ;

Wang, Zhiyong ;

Ouyang, Wanli .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :140-149

[9] Human-Robot Collaboration: Optimizing Stress and Productivity Based on Game Theory [J].

Messeri, Costanza ;

Masotti, Gabriele ;

Zanchettin, Andrea Maria ;

Rocco, Paolo .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) :8061-8068

[10]

Niepert M, 2016, PR MACH LEARN RES, V48

← 1 2 3 →