Diverse Human Motion Prediction Guided by Multi-level Spatial-Temporal Anchors

被引:21
|
作者
Xu, Sirui [1 ]
Wang, Yu-Xiong [1 ]
Gui, Liang-Yan [1 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
来源
COMPUTER VISION, ECCV 2022, PT XXII | 2022年 / 13682卷
基金
美国国家科学基金会;
关键词
Stochastic human motion prediction; Generative models; Graph neural networks;
D O I
10.1007/978-3-031-20047-2_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting diverse human motions given a sequence of historical poses has received increasing attention. Despite rapid progress, existing work captures the multi-modal nature of human motions primarily through likelihood-based sampling, where the mode collapse has been widely observed. In this paper, we propose a simple yet effective approach that disentangles randomly sampled codes with a deterministic learnable component named anchors to promote sample precision and diversity. Anchors are further factorized into spatial anchors and temporal anchors, which provide attractively interpretable control over spatial-temporal disparity. In principle, our spatial-temporal anchor-based sampling (STARS) can be applied to different motion predictors. Here we propose an interaction-enhanced spatial-temporal graph convolutional network (IE-STGCN) that encodes prior knowledge of human motions (e.g., spatial locality), and incorporate the anchors into it. Extensive experiments demonstrate that our approach outperforms state of the art in both stochastic and deterministic prediction, suggesting it as a unified framework for modeling human motions. Our code and pretrained models are available at https://github.com/Sirui-Xu/STARS.
引用
收藏
页码:251 / 269
页数:19
相关论文
共 33 条
  • [31] PH-GCN: Boosting Human Action Recognition Through Multi-Level Granularity With Pair-Wise Hyper GCN
    Alsarhan, Tamam
    Ali, Syed Sadaf
    Ganapathi, Iyyakutti Iyappan
    Ali, Ahmad
    Werghi, Naoufel
    IEEE ACCESS, 2024, 12 : 162608 - 162621
  • [32] Multivariate multi-step time series prediction of induction motor situation based on fused temporal-spatial features
    Chen, Caifeng
    Yuan, Yiping
    Sun, Wenlei
    Zhao, Feiyang
    INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2024, 50 : 1386 - 1394
  • [33] QMEDNet: A quaternion-based multi-order differential encoder-decoder model for 3D human motion prediction
    Cao, Wenming
    Li, Shuangshuang
    Zhong, Jianqi
    NEURAL NETWORKS, 2022, 154 : 141 - 151