Enhanced spatial-temporal dynamics in pose forecasting through multi-graph convolution networks

被引:0
|
作者
Ren, Hongwei [1 ]
Zhang, Xiangran [1 ]
Shi, Yuhong [1 ]
Liang, Kewei [2 ]
机构
[1] Zhejiang Univ, Polytech Inst, Shixiang Rd, Hangzhou 310058, Zhejiang, Peoples R China
[2] Zhejiang Univ, Sch Math Sci, Yuhangtang Rd, Hangzhou 310015, Zhejiang, Peoples R China
关键词
Graph convolutional network; Pose prediction; Attention mechanism; MOTION;
D O I
10.1007/s13042-024-02254-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, there has been a growing interest in predicting human motion, which involves forecasting future body poses based on observed pose sequences. This task is complex due to modeling spatial and temporal relationships. Autoregressive models, including recurrent neural networks (RNNs) and their variants, as well as transformer networks, are commonly used for addressing this challenge. However, autoregressive models have several serious drawbacks, such as vanishing or exploding gradients. Other researchers have attempted to solve the communication problem in the spatial dimension by integrating graph convolutional networks (GCNs) and long short-term memory (LSTM) or convolutional neural network (CNN) models. These approaches process temporal and spatial information separately and fuse them to extract features, whereas this sequential processing hampers the model's ability to capture spatiotemporal information and perform feature extraction simultaneously. To address this in human pose forecasting, we propose a novel approach called the multi-graph convolution network (MGCN). By introducing an augmented graph for pose sequences, our model captures spatial and temporal information in one step only using GCN. Multiple frames provide multiple parts, which are joined together in a unified graph instance. Furthermore, our model investigates the impact of natural structure and sequence-aware attention. In the experimental evaluation of the large-scale benchmark datasets (Human3.6M, AMSS, and 3DPW), MGCN outperforms the state-of-the-art methods in human pose prediction.
引用
收藏
页码:5453 / 5467
页数:15
相关论文
共 50 条
  • [1] Multi-component Spatial-temporal Graph Convolution Networks for Traffic Flow Forecasting
    Feng N.
    Guo S.-N.
    Song C.
    Zhu Q.-C.
    Wan H.-Y.
    Ruan Jian Xue Bao/Journal of Software, 2019, 30 (03): : 759 - 769
  • [2] Attention-based spatial-temporal synchronous graph convolution networks for traffic flow forecasting
    Xiaoduo Wei
    Dawen Xia
    Yunsong Li
    Yuce Ao
    Yan Chen
    Yang Hu
    Yantao Li
    Huaqing Li
    Applied Intelligence, 2025, 55 (7)
  • [3] STAGCN: Spatial-Temporal Attention Graph Convolution Network for Traffic Forecasting
    Gu, Yafeng
    Deng, Li
    MATHEMATICS, 2022, 10 (09)
  • [4] Predicting Traffic Flow Using Dynamic Spatial-Temporal Graph Convolution Networks
    Liu, Yunchang
    Wan, Fei
    Liang, Chengwu
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (03): : 4343 - 4361
  • [5] Graph enhanced spatial-temporal transformer for traffic flow forecasting
    Kong, Weishan
    Ju, Yanni
    Zhang, Shiyuan
    Wang, Jun
    Huang, Liwei
    Qu, Hong
    APPLIED SOFT COMPUTING, 2025, 170
  • [6] Attention-based spatial-temporal multi-graph convolutional networks for casualty prediction of terrorist attacks
    Hou, Zhiwen
    Zhou, Yuchen
    Wu, Xiaowei
    Bu, Fanliang
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (06) : 6307 - 6328
  • [7] Graph Convolution Based Spatial-Temporal Attention LSTM Model for Flood Forecasting
    Feng, Jun
    Sha, Haichao
    Ding, Yukai
    Yan, Le
    Yu, Zhangheng
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [8] STSGAN: Spatial-Temporal Global Semantic Graph Attention Convolution Networks for Urban Flow Prediction
    Zhou, Junwei
    Qin, Xizhong
    Yu, Kun
    Jia, Zhenhong
    Du, Yan
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2022, 11 (07)
  • [9] A Novel Attention-Based Dynamic Multi-Graph Spatial-Temporal Graph Neural Network Model for Traffic Prediction
    Diao, Chunyan
    Zhang, Dafang
    Liang, Wei
    Jiang, Man
    Li, Kuanching
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, : 1910 - 1923
  • [10] Multi-stage attention spatial-temporal graph networks for traffic prediction
    Yin, Xueyan
    Wu, Genze
    Wei, Jinze
    Shen, Yanming
    Qi, Heng
    Yin, Baocai
    NEUROCOMPUTING, 2021, 428 : 42 - 53