Learning disentangled representations for controllable human motion prediction

被引:4
作者
Gu, Chunzhi [1 ]
Yu, Jun [2 ]
Zhang, Chao [1 ,3 ]
机构
[1] Univ Fukui, Fukui, Fukui 9108507, Japan
[2] Niigata Univ, Niigata, Niigata 9502181, Japan
[3] Univ Fukui, 3-9-1,Bunkyo, Fukui, Fukui 9108507, Japan
关键词
Stochastic motion prediction; Deep generative model; Disentanglement learning;
D O I
10.1016/j.patcog.2023.109998
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative model-based motion prediction techniques have recently realized predicting controlled human motions, such as predicting multiple upper human body motions with similar lower-body motions. However, to achieve this, the state-of-the-art methods require either subsequently learning mapping functions to seek similar motions or training the model repetitively to enable control over the desired portion of body. In this paper, we propose a novel framework to learn disentangled representations for controllable human motion prediction. Our task is to predict multiple future human motions based on the past observed sequence, with the control of partial-body movements. Our network involves a conditional variational auto-encoder (CVAE) architecture to model full-body human motion, and an extra CVAE path to learn only the corresponding partial -body (e.g., lower-body) motion. Specifically, the inductive bias imposed by the extra CVAE path encourages two latent variables in two paths to respectively govern separate representations for each partial-body motion. With a single training, our model is able to provide two types of controls for the generated human motions: (i) strictly controlling one portion of human body and (ii) adaptively controlling the other portion, by sampling from a pair of latent spaces. Additionally, we extend and adapt a sampling strategy to our trained model to diversify the controllable predictions. Our framework also potentially allows new forms of control by flexibly customizing the input for the extra CVAE path. Extensive experimental results and ablation studies demonstrate that our approach is capable of predicting state-of-the-art controllable human motions both qualitatively and quantitatively.
引用
收藏
页数:12
相关论文
共 53 条
  • [1] A Spatio-temporal Transformer for 3D Human Motion Prediction
    Aksan, Emre
    Kaufmann, Manuel
    Cao, Peng
    Hilliges, Otmar
    [J]. 2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 565 - 574
  • [2] Contextually Plausible and Diverse 3D Human Motion Prediction
    Aliakbarian, Sadegh
    Saleh, Fatemeh
    Petersson, Lars
    Gould, Stephen
    Salzmann, Mathieu
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11313 - 11322
  • [3] HP-GAN: Probabilistic 3D human motion prediction via GAN
    Barsoum, Emad
    Kender, John
    Liu, Zicheng
    [J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 1499 - 1508
  • [4] Accurate and Diverse Sampling of Sequences based on a "Best of Many" Sample Objective
    Bhattacharyya, Apratim
    Schiele, Bernt
    Fritz, Mario
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8485 - 8493
  • [5] Behavior-Driven Synthesis of Human Dynamics
    Blattmann, Andreas
    Milbich, Timo
    Dorkenwald, Michael
    Ommer, Bjoern
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12231 - 12241
  • [6] Charakorn R, 2020, Arxiv, DOI arXiv:2001.08957
  • [7] Cho KYHY, 2014, Arxiv, DOI [arXiv:1406.1078, DOI 10.48550/ARXIV.1406.1078]
  • [8] Cui A., 2021, P IEEE CVF ICCV, P16107
  • [9] Diverse Human Motion Prediction via Gumbel-Softmax Sampling from an Auxiliary Space
    Dang, Lingwei
    Nie, Yongwei
    Long, Chengjiang
    Zhang, Qing
    Li, Guiqing
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5162 - 5171
  • [10] MSR-GCN: Multi-Scale Residual Graph Convolution Networks for Human Motion Prediction
    Dang, Lingwei
    Nie, Yongwei
    Long, Chengjiang
    Zhang, Qing
    Li, Guiqing
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11447 - 11456