Human Motion Prediction Based on Space-Time-Separable Graph Convolutional Network

被引:0
作者
Li, Rui [1 ,2 ]
He, Duo [1 ]
Yang, Shiqiang [1 ]
Yan, An [1 ]
Zeng, Xin [1 ]
Li, Dexin [1 ]
机构
[1] Xian Univ Technol, Sch Mech & Precis Instrument Engn, Xian 710048, Peoples R China
[2] Xian Peoples Hosp, Xian 710100, Peoples R China
关键词
Decoding; Predictive models; Feature extraction; Joints; Convolution; Solid modeling; Convolutional neural networks; Motion control; Human activity recognition; Human motion prediction; decoder; STS-GCN; GRU;
D O I
10.1109/ACCESS.2023.3325291
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human motion prediction is a popular method to predict future motion sequences based on past sequences, which is widely used in human-computer interaction. Space-time-separable graph Convolutional Network (STS-GCN) is a conventional mathematical model for human motion prediction. However, the uncertainty of human movements often leads to the problem of significant prediction error in the prediction results. This paper first proposed a Multi-scale STS-GCN (MSTS-GCN) model based on the conventional STS-GCN method to find the relevant factors that affect the prediction results. In our study, the constructed Multi-scale Temporal Convolutional Network (MTCN) decoder effectively reduced the human motion prediction error at specific time nodes. To expand the transmission and utilization performance in a larger receptive field, a Gated Recurrent Unit-TCN decoder was also designed. Finally, a new STS-GCN (NSTS-GCN) human motion prediction model was proposed, which realized the transmission and utilization of motion sequence features under a larger temporal perceptual field. To verify the effectiveness of NSTS-GCN, the Human3.6M dataset, AMASS, and 3DPW dataset were tested. The experimental results show that the MPJPE error of the proposed model for human joint prediction at each time node is reduced compared with the conventional STS-GCN model, and the mean reduction was achieved by 3.0mm. All the experimental results validated the effectiveness of the proposed NSTS-GCN model, which further improved the performance of human motion prediction.
引用
收藏
页码:115126 / 115139
页数:14
相关论文
共 40 条
[1]   American Sign Language Words Recognition Using Spatio-Temporal Prosodic and Angle Features: A Sequential Learning Approach [J].
Abdullahi, Sunusi Bala ;
Chamnongthai, Kosin .
IEEE ACCESS, 2022, 10 :15911-15923
[2]   American Sign Language Words Recognition of Skeletal Videos Using Processed Video Driven Multi-Stacked Deep LSTM [J].
Abdullahi, Sunusi Bala ;
Chamnongthai, Kosin .
SENSORS, 2022, 22 (04)
[3]  
Admoni H., 2016, P AAAI FALL S SER, P298
[4]   A Spatio-temporal Transformer for 3D Human Motion Prediction [J].
Aksan, Emre ;
Kaufmann, Manuel ;
Cao, Peng ;
Hilliges, Otmar .
2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, :565-574
[5]  
Ballakur AA, 2020, PROCEEDINGS OF THE 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND SECURITY (ICCCS-2020)
[6]   Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition [J].
Chen, Tailin ;
Zhou, Desen ;
Wang, Jian ;
Wang, Shidong ;
Guan, Yu ;
He, Xuming ;
Ding, Errui .
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :4334-4342
[7]   Towards Accurate 3D Human Motion Prediction from Incomplete Observations [J].
Cui, Qiongjie ;
Sun, Huaijiang .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4799-4808
[8]   Learning Dynamic Relationships for 3D Human Motion Prediction [J].
Cui, Qiongjie ;
Sun, Huaijiang ;
Yang, Fei .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6518-6526
[9]   MSR-GCN: Multi-Scale Residual Graph Convolution Networks for Human Motion Prediction [J].
Dang, Lingwei ;
Nie, Yongwei ;
Long, Chengjiang ;
Zhang, Qing ;
Li, Guiqing .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :11447-11456
[10]   A Neural Temporal Model for Human Motion Prediction [J].
Gopalakrishnan, Anand ;
Mali, Ankur ;
Kifer, Dan ;
Giles, C. Lee ;
Ororbia, Alexander G. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :12108-12117