Optimizing human motion prediction through decoupled motion spatio-temporal trends

被引:0
|
作者
Pan, Huan [1 ]
Ji, Ruiya [2 ]
Cao, Wenming [1 ]
Huang, Zhao [3 ]
Zhong, Jianqi [1 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen 518060, Peoples R China
[2] Queen Mary Univ London, Dept Comp Sci, London, England
[3] Univ Northumbria, Dept Comp & Informat Sci, Newcastle, England
基金
中国国家自然科学基金;
关键词
3D human motion forecasting; Deep learning; Time series;
D O I
10.1007/s00530-025-01691-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advancements in deep learning and artificial intelligence have underscored the importance of human motion prediction in fields such as intelligent robotics, autonomous driving, and human-computer interaction. Current human motion prediction methods primarily focus on network structure and feature extraction innovations, often overlooking the underlying logic of spatio-temporal changes in motion data. This oversight can result in potential conflicts within the coupled modeling of spatial and temporal dependencies, potentially obscuring the spatio-temporal logic of human motion. In this paper, we address this issue by decoupling the spatio-temporal features, employing time series modeling for preliminary prediction, and introducing velocity data as a learning branch to capture joint dependencies. This velocity-based information more clearly represents quantitative indices related to human movement, enhancing the model's pattern recognition capability. We map the trajectory change rules to the joint change trends for future moments, thereby refining the prediction results. Additionally, we enhance local semantic information through a patching method and ensure the independence of multi-scale representations of spatial and temporal dimensions using a two-branch framework. We propose a multi-layer perceptron (MLP)-based network structure, DCMixer, designed to learn multi-scale dynamic information and perform internal feature extraction. Our approach achieves spatio-temporal fusion with greater kinematic logic, significantly improving model performance. We evaluated our method on three public datasets, demonstrating superior prediction performance compared to state-of-the-art methods. The code is publicly available at https://github.com/Dabanshou/STTSN.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Spatio-temporal aggregation of skeletal motion features for human motion prediction
    Ueda, Itsuki
    Shishido, Hidehiko
    Kitahara, Itaru
    ARRAY, 2022, 15
  • [2] Human Motion Prediction via Spatio-Temporal Inpainting
    Ruiz, A. Hernandez
    Gall, J.
    Moreno-Noguer, F.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7133 - 7142
  • [3] Spatio-temporal structure of human motion primitives and its application to motion prediction
    Takano, Wataru
    Imagawa, Hirotaka
    Nakamura, Yoshihiko
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 75 : 288 - 296
  • [4] Spatio-Temporal Branching for Motion Prediction using Motion Increments
    Wang, Jiexin
    Zhou, Yujie
    Qiang, Wenwen
    Ba, Ying
    Su, Bing
    Wen, Ji-Rong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4290 - 4299
  • [5] Spatio-Temporal Gating-Adjacency GCN for Human Motion Prediction
    Zhong, Chongyang
    Hu, Lei
    Zhang, Zihao
    Ye, Yongjing
    Xia, Shihong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6437 - 6446
  • [6] Spatio-Temporal Gating-Adjacency GCN for Human Motion Prediction
    Zhong, Chongyang
    Hu, Lei
    Zhang, Zihao
    Ye, Yongjing
    Xia, Shihong
    arXiv, 2022,
  • [7] Spatio-Temporal Gating-Adjacency GCN for Human Motion Prediction
    Zhong, Chongyang
    Hu, Lei
    Zhang, Zihao
    Ye, Yongjing
    Xia, Shihong
    Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2022, 2022-June : 6437 - 6446
  • [8] A Spatio-temporal Transformer for 3D Human Motion Prediction
    Aksan, Emre
    Kaufmann, Manuel
    Cao, Peng
    Hilliges, Otmar
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 565 - 574
  • [9] TrajectoryCNN: A New Spatio-Temporal Feature Learning Network for Human Motion Prediction
    Liu, Xiaoli
    Yin, Jianqin
    Liu, Jin
    Ding, Pengxiang
    Liu, Jun
    Liu, Huaping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2133 - 2146
  • [10] KSOF: Leveraging kinematics and spatio-temporal optimal fusion for human motion prediction
    Ding, Rui
    Qu, Kehua
    Tang, Jin
    PATTERN RECOGNITION, 2025, 161