Enhanced spatial-temporal dynamics in pose forecasting through multi-graph convolution networks

被引:0
|
作者
Ren, Hongwei [1 ]
Zhang, Xiangran [1 ]
Shi, Yuhong [1 ]
Liang, Kewei [2 ]
机构
[1] Zhejiang Univ, Polytech Inst, Shixiang Rd, Hangzhou 310058, Zhejiang, Peoples R China
[2] Zhejiang Univ, Sch Math Sci, Yuhangtang Rd, Hangzhou 310015, Zhejiang, Peoples R China
关键词
Graph convolutional network; Pose prediction; Attention mechanism; MOTION;
D O I
10.1007/s13042-024-02254-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, there has been a growing interest in predicting human motion, which involves forecasting future body poses based on observed pose sequences. This task is complex due to modeling spatial and temporal relationships. Autoregressive models, including recurrent neural networks (RNNs) and their variants, as well as transformer networks, are commonly used for addressing this challenge. However, autoregressive models have several serious drawbacks, such as vanishing or exploding gradients. Other researchers have attempted to solve the communication problem in the spatial dimension by integrating graph convolutional networks (GCNs) and long short-term memory (LSTM) or convolutional neural network (CNN) models. These approaches process temporal and spatial information separately and fuse them to extract features, whereas this sequential processing hampers the model's ability to capture spatiotemporal information and perform feature extraction simultaneously. To address this in human pose forecasting, we propose a novel approach called the multi-graph convolution network (MGCN). By introducing an augmented graph for pose sequences, our model captures spatial and temporal information in one step only using GCN. Multiple frames provide multiple parts, which are joined together in a unified graph instance. Furthermore, our model investigates the impact of natural structure and sequence-aware attention. In the experimental evaluation of the large-scale benchmark datasets (Human3.6M, AMSS, and 3DPW), MGCN outperforms the state-of-the-art methods in human pose prediction.
引用
收藏
页码:5453 / 5467
页数:15
相关论文
共 50 条
  • [21] A New Partitioned Spatial-Temporal Graph Attention Convolution Network for Human Motion Recognition
    Guo, Keyou
    Wang, Pengshuo
    Shi, Peipeng
    He, Chengbo
    Wei, Caili
    APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [22] Attention-based spatial-temporal graph transformer for traffic flow forecasting
    Zhang, Qingyong
    Chang, Wanfeng
    Li, Changwu
    Yin, Conghui
    Su, Yixin
    Xiao, Peng
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (29) : 21827 - 21839
  • [23] Attention-based spatial-temporal graph transformer for traffic flow forecasting
    Qingyong Zhang
    Wanfeng Chang
    Changwu Li
    Conghui Yin
    Yixin Su
    Peng Xiao
    Neural Computing and Applications, 2023, 35 : 21827 - 21839
  • [24] Spatial-Temporal Attention Graph Convolution Network on Edge Cloud for Traffic Flow Prediction
    Lai, Qifeng
    Tian, Jinyu
    Wang, Wei
    Hu, Xiping
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (04) : 4565 - 4576
  • [25] Spatial-Temporal Dynamic Graph Convolutional Network With Interactive Learning for Traffic Forecasting
    Liu, Aoyu
    Zhang, Yaying
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (07) : 7645 - 7660
  • [26] Spatial-Temporal Attention Mechanism and Graph Convolutional Networks for Destination Prediction
    Li, Cong
    Zhang, Huyin
    Wang, Zengkai
    Wu, Yonghao
    Yang, Fei
    FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [27] Attention-Based Spatial-Temporal Convolution Gated Recurrent Unit for Traffic Flow Forecasting
    Zhang, Qingyong
    Chang, Wanfeng
    Yin, Conghui
    Xiao, Peng
    Li, Kelei
    Tan, Meifang
    ENTROPY, 2023, 25 (06)
  • [28] An improved dynamic Chebyshev graph convolution network for traffic flow prediction with spatial-temporal attention
    Lyuchao Liao
    Zhiyuan Hu
    Yuxin Zheng
    Shuoben Bi
    Fumin Zou
    Huai Qiu
    Maolin Zhang
    Applied Intelligence, 2022, 52 : 16104 - 16116
  • [29] An improved dynamic Chebyshev graph convolution network for traffic flow prediction with spatial-temporal attention
    Liao, Lyuchao
    Hu, Zhiyuan
    Zheng, Yuxin
    Bi, Shuoben
    Zou, Fumin
    Qiu, Huai
    Zhang, Maolin
    APPLIED INTELLIGENCE, 2022, 52 (14) : 16104 - 16116
  • [30] Dynamic graph convolution neural network based on spatial-temporal correlation for air quality prediction
    Dun, Ao
    Yang, Yuning
    Lei, Fei
    ECOLOGICAL INFORMATICS, 2022, 70