Back to MLP: A Simple Baseline for Human Motion Prediction

被引:83
作者
Guo, Wen [1 ]
Du, Yuming [3 ]
Shen, Xi [4 ]
Lepetit, Vincent [3 ]
Alameda-Pineda, Xavier [1 ]
Moreno-Noguer, Francesc [2 ]
机构
[1] Univ Grenoble Alpes, INRIA, CNRS, Grenoble INP,LJK, F-38000 Grenoble, France
[2] CSIC UPC, Inst Robot & Informat Ind, Barcelona, Spain
[3] Univ Gustave Eiffel, CNRS, Ecole Ponts, LIGM, Paris, France
[4] Tencent AI Lab, Bellevue, WA 98004 USA
来源
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2023年
基金
欧盟地平线“2020”;
关键词
D O I
10.1109/WACV56688.2023.00479
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper tackles the problem of human motion prediction, consisting in forecasting future body poses from historically observed sequences. State-of-the-art approaches provide good results, however, they rely on deep learning architectures of arbitrary complexity, such as Recurrent Neural Networks (RNN), Transformers or Graph Convolutional Networks (GCN), typically requiring multiple training stages and more than 2 million parameters. In this paper, we show that, after combining with a series of standard practices, such as applying Discrete Cosine Transform (DCT), predicting residual displacement of joints and optimizing velocity as an auxiliary loss, a light-weight network based on multi-layer perceptrons (MLPs) with only 0.14 million parameters can surpass the state-of-the-art performance. An exhaustive evaluation on the Human3.6M, AMASS, and 3DPW datasets shows that our method, named SIMLPE, consistently outperforms all other approaches. We hope that our simple method could serve as a strong baseline for the community and allow re-thinking of the human motion prediction problem. The code is publicly available at https://github.com/dulucas/siMLPe.
引用
收藏
页码:4798 / 4808
页数:11
相关论文
共 61 条
[1]  
Advanced Computing Center for the Arts and Design, ACCAD MOCAP DAT
[2]  
Akhter Ijaz, 2015, C COMP VIS PATT REC
[3]  
Aksan Emre, 2021, 3DV
[4]  
[Anonymous], 2007, Advances in Neural Information Processing Systems
[5]   Digital Dance Ethnography: Organizing Large Dance Collections [J].
Aristidou, Andreas ;
Shamir, Ariel ;
Chrysanthou, Yiorgos .
ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE, 2019, 12 (04)
[6]  
Ba J.L., 2016, Layer Normalization
[7]  
Bio Motion Lab, BMLHANDBALL MOT CAPT
[8]  
Bogo F., 2017, C COMP VIS PATT REC
[9]  
Bouazizi Arij, 2022, ARXIV220700499
[10]  
Brand Matthew, 2000, COMPUTER GRAPHICS IN