Optimizing human motion prediction through decoupled motion spatio-temporal trends

被引:0
|
作者
Pan, Huan [1 ]
Ji, Ruiya [2 ]
Cao, Wenming [1 ]
Huang, Zhao [3 ]
Zhong, Jianqi [1 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen 518060, Peoples R China
[2] Queen Mary Univ London, Dept Comp Sci, London, England
[3] Univ Northumbria, Dept Comp & Informat Sci, Newcastle, England
基金
中国国家自然科学基金;
关键词
3D human motion forecasting; Deep learning; Time series;
D O I
10.1007/s00530-025-01691-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advancements in deep learning and artificial intelligence have underscored the importance of human motion prediction in fields such as intelligent robotics, autonomous driving, and human-computer interaction. Current human motion prediction methods primarily focus on network structure and feature extraction innovations, often overlooking the underlying logic of spatio-temporal changes in motion data. This oversight can result in potential conflicts within the coupled modeling of spatial and temporal dependencies, potentially obscuring the spatio-temporal logic of human motion. In this paper, we address this issue by decoupling the spatio-temporal features, employing time series modeling for preliminary prediction, and introducing velocity data as a learning branch to capture joint dependencies. This velocity-based information more clearly represents quantitative indices related to human movement, enhancing the model's pattern recognition capability. We map the trajectory change rules to the joint change trends for future moments, thereby refining the prediction results. Additionally, we enhance local semantic information through a patching method and ensure the independence of multi-scale representations of spatial and temporal dimensions using a two-branch framework. We propose a multi-layer perceptron (MLP)-based network structure, DCMixer, designed to learn multi-scale dynamic information and perform internal feature extraction. Our approach achieves spatio-temporal fusion with greater kinematic logic, significantly improving model performance. We evaluated our method on three public datasets, demonstrating superior prediction performance compared to state-of-the-art methods. The code is publicly available at https://github.com/Dabanshou/STTSN.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] SPATIO-TEMPORAL PREDICTION IN VIDEO CODING BY SPATIALLY REFINED MOTION COMPENSATION
    Seiler, Juergen
    Kaup, Andre
    2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 2788 - 2791
  • [22] Flow-Based Spatio-Temporal Structured Prediction of Motion Dynamics
    Zand, Mohsen
    Etemad, Ali
    Greenspan, Michael
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13523 - 13535
  • [23] Spatio-Temporal Articulation & Coordination Co-attention Graph Network for human motion prediction
    Zhu, Shuang
    Chen, Jin
    Su, Yong
    SIGNAL PROCESSING, 2024, 223
  • [24] Robot Motion Planning as Video Prediction: A Spatio-Temporal Neural Network-based Motion Planner
    Zang, Xiao
    Yin, Miao
    Huang, Lingyi
    Yu, Jingjin
    Zonouz, Saman
    Yuan, Bo
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 12492 - 12499
  • [25] In vivo validation of spatio-temporal liver motion prediction from motion tracked on MR thermometry images
    C. Tanner
    Y. Zur
    K. French
    G. Samei
    J. Strehlow
    G. Sat
    H. McLeod
    G. Houston
    S. Kozerke
    G. Székely
    A. Melzer
    T. Preusser
    International Journal of Computer Assisted Radiology and Surgery, 2016, 11 : 1143 - 1152
  • [26] In vivo validation of spatio-temporal liver motion prediction from motion tracked on MR thermometry images
    Tanner, C.
    Zur, Y.
    French, K.
    Samei, G.
    Strehlow, J.
    Sat, G.
    McLeod, H.
    Houston, G.
    Kozerke, S.
    Szekely, G.
    Melzer, A.
    Preusser, T.
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2016, 11 (06) : 1143 - 1152
  • [27] Spatio-temporal motion estimation for transparency and occlusions
    Barth, E
    Stuke, I
    Aach, T
    Mota, C
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 65 - 68
  • [28] Motion estimation based on spatio-temporal correlations
    Yoon, HS
    Lee, GS
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 2, PROCEEDINGS, 2003, : 359 - 362
  • [29] ON THE SPATIO-TEMPORAL DETERMINANTS OF SOME MOTION EFFECTS
    CAELLI, T
    ACTA PSYCHOLOGICA, 1981, 48 (1-3) : 175 - 185
  • [30] A spatio-temporal filtering approach to motion segmentation
    Chamorro-Martínez, J
    Fdez-Valdivia, J
    Martinez-Baena, J
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PROCEEDINGS, 2003, 2652 : 193 - 203