Human Motion Prediction via Spatio-Temporal Inpainting

被引:97
|
作者
Ruiz, A. Hernandez [1 ]
Gall, J. [2 ]
Moreno-Noguer, F. [1 ]
机构
[1] CSIC UPC, Inst Robot & Informat Ind, Barcelona, Spain
[2] Univ Bonn, Comp Vis Grp, Bonn, Germany
关键词
D O I
10.1109/ICCV.2019.00723
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a Generative Adversarial Network (GAN) to forecast 3D human motion given a sequence of past 3D skeleton poses. While recent GANs have shown promising results, they can only forecast plausible motion over relatively short periods of time (few hundred milliseconds) and typically ignore the absolute position of the skeleton w.r.t. the camera. Our scheme provides long term predictions (two seconds or more) for both the body pose and its absolute position. Our approach builds upon three main contributions. First, we represent the data using a spatiotemporal tensor of 3D skeleton coordinates which allows formulating the prediction problem as an inpainting one, for which GANs work particularly well. Secondly, we design an architecture to learn the joint distribution of body poses and global motion, capable to hypothesize large chunks of the input 3D tensor with missing data. And finally, we argue that the L2 metric, considered so far by most approaches, fails to capture the actual distribution of long-term human motion. We propose two alternative metrics, based on the distribution of frequencies, that are able to capture more realistic motion patterns. Extensive experiments demonstrate our approach to significantly improve the state of the art, while also handling situations in which past observations are corrupted by occlusions, noise and missing frames.
引用
收藏
页码:7133 / 7142
页数:10
相关论文
共 50 条
  • [1] Spatio-temporal aggregation of skeletal motion features for human motion prediction
    Ueda, Itsuki
    Shishido, Hidehiko
    Kitahara, Itaru
    ARRAY, 2022, 15
  • [2] Optimizing human motion prediction through decoupled motion spatio-temporal trends
    Pan, Huan
    Ji, Ruiya
    Cao, Wenming
    Huang, Zhao
    Zhong, Jianqi
    MULTIMEDIA SYSTEMS, 2025, 31 (02)
  • [3] Spatio-temporal structure of human motion primitives and its application to motion prediction
    Takano, Wataru
    Imagawa, Hirotaka
    Nakamura, Yoshihiko
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 75 : 288 - 296
  • [4] SPATIO-TEMPORAL BINARY VIDEO INPAINTING VIA THRESHOLD DYNAMICS
    Oliver, M.
    Palomares, R. P.
    Ballester, C.
    Haro, G.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1822 - 1826
  • [5] Spatio-Temporal Gating-Adjacency GCN for Human Motion Prediction
    Zhong, Chongyang
    Hu, Lei
    Zhang, Zihao
    Ye, Yongjing
    Xia, Shihong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6437 - 6446
  • [6] Spatio-Temporal Gating-Adjacency GCN for Human Motion Prediction
    Zhong, Chongyang
    Hu, Lei
    Zhang, Zihao
    Ye, Yongjing
    Xia, Shihong
    arXiv, 2022,
  • [7] Spatio-Temporal Gating-Adjacency GCN for Human Motion Prediction
    Zhong, Chongyang
    Hu, Lei
    Zhang, Zihao
    Ye, Yongjing
    Xia, Shihong
    Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2022, 2022-June : 6437 - 6446
  • [8] A Spatio-temporal Transformer for 3D Human Motion Prediction
    Aksan, Emre
    Kaufmann, Manuel
    Cao, Peng
    Hilliges, Otmar
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 565 - 574
  • [9] Spatio-Temporal Branching for Motion Prediction using Motion Increments
    Wang, Jiexin
    Zhou, Yujie
    Qiang, Wenwen
    Ba, Ying
    Su, Bing
    Wen, Ji-Rong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4290 - 4299
  • [10] TrajectoryCNN: A New Spatio-Temporal Feature Learning Network for Human Motion Prediction
    Liu, Xiaoli
    Yin, Jianqin
    Liu, Jin
    Ding, Pengxiang
    Liu, Jun
    Liu, Huaping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2133 - 2146