KSOF: Leveraging kinematics and spatio-temporal optimal fusion for human motion prediction

被引:0
|
作者
Ding, Rui [1 ]
Qu, Kehua [1 ]
Tang, Jin [2 ]
机构
[1] Capital Normal Univ, Informat Engn Coll, Beijing 100048, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Intelligent Engn & Automat, Beijing 100876, Peoples R China
关键词
Human motion prediction; Kinematic constraints; Spatio-temporal optimal fusion;
D O I
10.1016/j.patcog.2024.111206
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ignoring the meaningful kinematics law, which generates improbable or impractical predictions, is one of the obstacles to human motion prediction. Current methods attempt to tackle this problem by taking simple kinematics information as auxiliary features to improve predictions. However, it remains challenging to utilize human prior knowledge deeply, such as the trajectory formed by the same joint should be smooth and continuous in this task. In this paper, we advocate explicitly describing kinematics information via velocity and acceleration by proposing a novel loss called joint point smoothness (JPS) loss, which calculates the acceleration of joints to smooth the sudden change in joint velocity. In addition, capturing spatio-temporal dependencies to make feature representations more informative is also one of the obstacles in this task. Therefore, we propose a dual-path network (KSOF) that models the temporal and spatial dependencies from kinematic temporal convolutional network (K-TCN) and spatial graph convolutional networks (S-GCN), respectively. Moreover, we propose a novel multi-scale fusion module named spatio-temporal optimal fusion (SOF) to enhance extraction of the essential correlation and important features at different scales from spatiotemporal coupling features. We evaluate our approach on three standard benchmark datasets, including Human3.6M, CMU-Mocap, and 3DPW datasets. For both short-term and long-term predictions, our method achieves outstanding performance on all these datasets. The code is available at https://github.com/qukehua/ KSOF.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Optimal prediction of user mobility based on spatio-temporal matching
    Ajinu, A.
    Maheswaran, C. P.
    INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2022, 13 (06)
  • [22] Spatio-Temporal Articulation & Coordination Co-attention Graph Network for human motion prediction
    Zhu, Shuang
    Chen, Jin
    Su, Yong
    SIGNAL PROCESSING, 2024, 223
  • [23] Spatio-Temporal Information Fusion and Filtration for Human Action Recognition
    Zhang, Man
    Li, Xing
    Wu, Qianhan
    SYMMETRY-BASEL, 2023, 15 (12):
  • [24] Multi-Agent Trajectory Prediction With Spatio-Temporal Sequence Fusion
    Wang, Yu
    Chen, Shiwei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 13 - 23
  • [25] Vehicle trajectory prediction based on spatio-temporal Transformer feature fusion
    Zhao, Wenhong
    Wang, Wei
    Wan, Zilu
    Tongxin Xuebao/Journal on Communications, 2024, 45 (11): : 267 - 276
  • [26] Querying complex spatio-temporal sequences in human motion databases
    Chen, Yueguo
    Jiang, Shouxu
    Ooi, Beng Chin
    Tung, Anthony K. H.
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 90 - 99
  • [27] Recognizing Human Actions by Using Spatio-temporal Motion Descriptors
    Utasi, Akos
    Kovacs, Andrea
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PT II, 2010, 6475 : 366 - 375
  • [28] SPATIO-TEMPORAL PREDICTION IN VIDEO CODING BY SPATIALLY REFINED MOTION COMPENSATION
    Seiler, Juergen
    Kaup, Andre
    2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 2788 - 2791
  • [29] Flow-Based Spatio-Temporal Structured Prediction of Motion Dynamics
    Zand, Mohsen
    Etemad, Ali
    Greenspan, Michael
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13523 - 13535
  • [30] On the Effect of Misregistration on Spatio-temporal Fusion
    Tang, Yijie
    Wang, Qunming
    2019 10TH INTERNATIONAL WORKSHOP ON THE ANALYSIS OF MULTITEMPORAL REMOTE SENSING IMAGES (MULTITEMP), 2019,