Enhanced spatial-temporal dynamics in pose forecasting through multi-graph convolution networks

被引:0
|
作者
Ren, Hongwei [1 ]
Zhang, Xiangran [1 ]
Shi, Yuhong [1 ]
Liang, Kewei [2 ]
机构
[1] Zhejiang Univ, Polytech Inst, Shixiang Rd, Hangzhou 310058, Zhejiang, Peoples R China
[2] Zhejiang Univ, Sch Math Sci, Yuhangtang Rd, Hangzhou 310015, Zhejiang, Peoples R China
关键词
Graph convolutional network; Pose prediction; Attention mechanism; MOTION;
D O I
10.1007/s13042-024-02254-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, there has been a growing interest in predicting human motion, which involves forecasting future body poses based on observed pose sequences. This task is complex due to modeling spatial and temporal relationships. Autoregressive models, including recurrent neural networks (RNNs) and their variants, as well as transformer networks, are commonly used for addressing this challenge. However, autoregressive models have several serious drawbacks, such as vanishing or exploding gradients. Other researchers have attempted to solve the communication problem in the spatial dimension by integrating graph convolutional networks (GCNs) and long short-term memory (LSTM) or convolutional neural network (CNN) models. These approaches process temporal and spatial information separately and fuse them to extract features, whereas this sequential processing hampers the model's ability to capture spatiotemporal information and perform feature extraction simultaneously. To address this in human pose forecasting, we propose a novel approach called the multi-graph convolution network (MGCN). By introducing an augmented graph for pose sequences, our model captures spatial and temporal information in one step only using GCN. Multiple frames provide multiple parts, which are joined together in a unified graph instance. Furthermore, our model investigates the impact of natural structure and sequence-aware attention. In the experimental evaluation of the large-scale benchmark datasets (Human3.6M, AMSS, and 3DPW), MGCN outperforms the state-of-the-art methods in human pose prediction.
引用
收藏
页码:5453 / 5467
页数:15
相关论文
共 50 条
  • [41] Spatial-temporal graph neural network based on gated convolution and topological attention for traffic flow prediction
    Dewei Bai
    Dawen Xia
    Dan Huang
    Yang Hu
    Yantao Li
    Huaqing Li
    Applied Intelligence, 2023, 53 : 30843 - 30864
  • [42] Spatial-temporal graph neural network based on gated convolution and topological attention for traffic flow prediction
    Bai, Dewei
    Xia, Dawen
    Huang, Dan
    Hu, Yang
    Li, Yantao
    Li, Huaqing
    APPLIED INTELLIGENCE, 2023, 53 (24) : 30843 - 30864
  • [43] Probing Traffic Trend Forecasting via Spatial-Temporal Aware Learning-Graph Attention
    Huang, Xinyuan
    Ren, Qianqian
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
  • [44] Spatial-temporal load forecasting of electric vehicle charging stations based on graph neural network
    Zhang, Yanyu
    Liu, Chunyang
    Rao, Xinpeng
    Zhang, Xibeng
    Zhou, Yi
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (01) : 821 - 836
  • [45] Hybrid Spatial-Temporal Graph Convolutional Network for Long-Term Traffic Flow Forecasting
    Wu, Zihao
    Lou, Ping
    2023 IEEE 8TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS, ICBDA, 2023, : 224 - 229
  • [46] An Attention Enhanced Spatial-Temporal Graph Convolutional LSTM Network for Action Recognition in Karate
    Guo, Jianping
    Liu, Hong
    Li, Xi
    Xu, Dahong
    Zhang, Yihan
    APPLIED SCIENCES-BASEL, 2021, 11 (18):
  • [47] Channel attention-based spatial-temporal graph neural networks for traffic prediction
    Wang, Bin
    Gao, Fanghong
    Tong, Le
    Zhang, Qian
    Zhu, Sulei
    DATA TECHNOLOGIES AND APPLICATIONS, 2023, 58 (01) : 81 - 94
  • [48] Model-enhanced spatial-temporal attention networks for traffic density prediction
    Guo, Qi
    Tan, Qi
    Peng, Yue
    Xiao, Long
    Liu, Miao
    Shi, Benyun
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (01)
  • [49] A Multi-Scale Spatial-Temporal Graph Neural Network-Based Method of Multienergy Load Forecasting in Integrated Energy System
    Zhuang, Wei
    Fan, Jili
    Xia, Min
    Zhu, Kedong
    IEEE TRANSACTIONS ON SMART GRID, 2024, 15 (03) : 2652 - 2666
  • [50] MFSTGN: a multi-scale spatial-temporal fusion graph network for traffic prediction
    Ran Tian
    Chu Wang
    Jia Hu
    Zhongyu Ma
    Applied Intelligence, 2023, 53 : 22582 - 22601