3D Skeleton-Based Human Motion Prediction Using Dynamic Multi-Scale Spatiotemporal Graph Recurrent Neural Networks

被引:3
作者
Lovanshi, Mayank [1 ]
Tiwari, Vivek [2 ]
Jain, Swati [3 ]
机构
[1] Int Inst Informat Technol, Dept Comp Sci & Engn, Naya Raipur 493661, India
[2] ABV Indian Inst Informat Technol & Management ABV, Dept Comp Sci & Engn, Gwalior 474015, India
[3] Govt J Yoganandam Chhattisgarh Coll, Raipur 492001, India
来源
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2024年 / 8卷 / 01期
关键词
CMU Mocap; DMST-GRNN; graph GRU lite; human3.6M; human activity; multi-scale; skeleton; spatiotemporal;
D O I
10.1109/TETCI.2023.3318985
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A dynamic multi-scale spatiotemporal graph recurrent neural network (DMST-GRNN) model has been introduced, which is leveraged to use human motion prediction on a 3D skeleton-based human activity dataset. It offers a multi-scale approach to spatial & temporal graphs using multi-scale graph convolution units (MGCUs) to describe the human body's semantic interconnection. The proposed DMST-GRNN is an encoder-decoder framework where a series of MGCUs are used as encoders to learn spatiotemporal features and a novel graph-gated recurrent unit lite (GGRU-L) for the decoder to predict human pose. Extensive experiments have been carried out with two datasets, Human3.6M and CMU Mocap, where both short and long videos were considered to validate the performance of the proposed model. The DMST-GRNN model outperforms the existing baseline on the Human3.6M datasets by 11.95% and 7.74% of average mean angle errors (avg MAE) for short and long-term motion prediction, respectively. Similarly, CMU Mocap datasets, the DMST-GRNN model predicts future posture more accurately than the previous best approaches by 2.77% and 5.51% of average mean angle errors (avg MAE) for short and long-term motion prediction, respectively. A comparison analysis was also presented with other measures like mean angle error, prediction loss and standard deviation. A separate discussion has been included to analyze the effect of different multiscale on spatial and temporal graphs, along with the impact of MGCU unit counts.
引用
收藏
页码:164 / 174
页数:11
相关论文
共 26 条
  • [1] Graph Convolutional Neural Network for Human Action Recognition: A Comprehensive Survey
    Ahmad T.
    Jin L.
    Zhang X.
    Lai S.
    Tang G.
    Lin L.
    [J]. IEEE Transactions on Artificial Intelligence, 2021, 2 (02): : 128 - 145
  • [2] Using recurrent neural network models for early detection of heart failure onset
    Choi, Edward
    Schuetz, Andy
    Stewart, Walter F.
    Sun, Jimeng
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2017, 24 (02) : 361 - 370
  • [3] Dey R, 2017, MIDWEST SYMP CIRCUIT, P1597, DOI 10.1109/MWSCAS.2017.8053243
  • [4] Du Y, 2015, PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, P579, DOI 10.1109/ACPR.2015.7486569
  • [5] Adaptation supports short-term memory in a visual change detection task
    Hu, Brian
    Garrett, Marina E.
    Groblewski, Peter A.
    Ollerenshaw, Douglas R.
    Shang, Jiaqi
    Roll, Kate
    Manavi, Sahar
    Koch, Christof
    Olsen, Shawn R.
    Mihalas, Stefan
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2021, 17 (09)
  • [6] Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments
    Ionescu, Catalin
    Papava, Dragos
    Olaru, Vlad
    Sminchisescu, Cristian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (07) : 1325 - 1339
  • [7] Ionescu C, 2011, IEEE I CONF COMP VIS, P2220, DOI 10.1109/ICCV.2011.6126500
  • [8] Automatic Human Mocap Data Classification
    Kadu, Harshad
    Kuo, C. -C. Jay
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (08) : 2191 - 2202
  • [9] Action-Attending Graphic Neural Network
    Li, Chaolong
    Cui, Zhen
    Zheng, Wenming
    Xu, Chunyan
    Ji, Rongrong
    Yang, Jian
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (07) : 3657 - 3670
  • [10] Convolutional Sequence to Sequence Model for Human Dynamics
    Li, Chen
    Zhang, Zhen
    Lee, Wee Sun
    Lee, Gim Hee
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5226 - 5234