Diverse Human Motion Prediction Guided by Multi-level Spatial-Temporal Anchors

被引：21

作者：

Xu, Sirui ^{[1
]}

Wang, Yu-Xiong ^{[1
]}

Gui, Liang-Yan ^{[1
]}

机构：

[1] Univ Illinois, Champaign, IL 61820 USA

来源：

COMPUTER VISION, ECCV 2022, PT XXII | 2022年 / 13682卷

基金：

美国国家科学基金会;

关键词：

Stochastic human motion prediction; Generative models; Graph neural networks;

D O I：

10.1007/978-3-031-20047-2_15

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Predicting diverse human motions given a sequence of historical poses has received increasing attention. Despite rapid progress, existing work captures the multi-modal nature of human motions primarily through likelihood-based sampling, where the mode collapse has been widely observed. In this paper, we propose a simple yet effective approach that disentangles randomly sampled codes with a deterministic learnable component named anchors to promote sample precision and diversity. Anchors are further factorized into spatial anchors and temporal anchors, which provide attractively interpretable control over spatial-temporal disparity. In principle, our spatial-temporal anchor-based sampling (STARS) can be applied to different motion predictors. Here we propose an interaction-enhanced spatial-temporal graph convolutional network (IE-STGCN) that encodes prior knowledge of human motions (e.g., spatial locality), and incorporate the anchors into it. Extensive experiments demonstrate that our approach outperforms state of the art in both stochastic and deterministic prediction, suggesting it as a unified framework for modeling human motions. Our code and pretrained models are available at https://github.com/Sirui-Xu/STARS.

引用

页码：251 / 269

页数：19

共 33 条

[31] PH-GCN: Boosting Human Action Recognition Through Multi-Level Granularity With Pair-Wise Hyper GCN
Alsarhan, Tamam
Ali, Syed Sadaf
Ganapathi, Iyyakutti Iyappan
Ali, Ahmad
Werghi, Naoufel
IEEE ACCESS, 2024, 12 : 162608 - 162621
[32] Multivariate multi-step time series prediction of induction motor situation based on fused temporal-spatial features
Chen, Caifeng
Yuan, Yiping
Sun, Wenlei
Zhao, Feiyang
INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2024, 50 : 1386 - 1394
[33] QMEDNet: A quaternion-based multi-order differential encoder-decoder model for 3D human motion prediction
Cao, Wenming
Li, Shuangshuang
Zhong, Jianqi
NEURAL NETWORKS, 2022, 154 : 141 - 151

← 1 2 3 4 →