HP-GAN: Probabilistic 3D human motion prediction via GAN

被引:186
作者
Barsoum, Emad [1 ]
Kender, John [1 ]
Liu, Zicheng [2 ]
机构
[1] Columbia Univ, 116th St & Broadway, New York, NY 10027 USA
[2] Microsoft, One Microsoft Way, Redmond, WA 98052 USA
来源
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW) | 2018年
关键词
D O I
10.1109/CVPRW.2018.00191
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting and understanding human motion dynamics has many applications, such as motion synthesis, augmented reality, security, and autonomous vehicles. Due to the recent success of generative adversarial networks (GAN), there has been much interest in probabilistic estimation and synthetic data generation using deep neural network architectures and learning algorithms. We propose a novel sequence-to-sequence model for probabilistic human motion prediction, trained with a modified version of improved Wasserstein generative adversarial networks (WGAN-GP), in which we use a custom loss function designed for human motion prediction. Our model, which we call HP-GAN, learns a probability density function of future human poses conditioned on previous poses. It predicts multiple sequences of possible future human poses, each from the same input sequence but a different vector z drawn from a random distribution. Furthermore, to quantify the quality of the non-deterministic predictions, we simultaneously train a motion-quality-assessment model that learns the probability that a given skeleton sequence is a real human motion. We test our algorithm on two of the largest skeleton datasets: NTURGB-D and Human3.6M. We train our model on both single and multiple action types. Its predictive power for long-term motion estimation is demonstrated by generating multiple plausible futures of more than 30 frames from just 10 frames of input. We show that most sequences generated from the same input have more than 50% probabilities of being judged as a real human sequence. We published all the code used in this paper to https://github.com/ebarsoum/hpgan.
引用
收藏
页码:1499 / 1508
页数:10
相关论文
共 42 条
  • [1] [Anonymous], ABS170106264 CORR
  • [2] [Anonymous], 2017, PROC INT C LEARN REP
  • [3] [Anonymous], 2013, ICML
  • [4] Arjovsky M, 2017, PR MACH LEARN RES, V70
  • [5] Baccouche Moez, 2011, Human Behavior Unterstanding. Proceedings Second International Workshop, HBU 2011, P29, DOI 10.1007/978-3-642-25446-8_4
  • [6] Butepage J., 2017, ABS170207486 CORR
  • [7] Chen B., 2017, ABS170604124 CORR
  • [8] Chung J., 2014, ARXIV
  • [9] Donahue J, 2015, PROC CVPR IEEE, P2625, DOI 10.1109/CVPR.2015.7298878
  • [10] Recurrent Network Models for Human Dynamics
    Fragkiadaki, Katerina
    Levine, Sergey
    Felsen, Panna
    Malik, Jitendra
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4346 - 4354