On human motion prediction using recurrent neural networks

被引:663
作者
Martinez, Julieta [1 ,4 ]
Black, Michael J. [2 ]
Romero, Javier [3 ]
机构
[1] Univ British Columbia, Vancouver, BC, Canada
[2] MPI Intelligent Syst, Tubingen, Germany
[3] Body Labs Inc, New York, NY USA
[4] MPI, Tubingen, Germany
来源
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) | 2017年
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/CVPR.2017.497
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human motion modelling is a classical problem at the intersection of graphics and computer vision, with applications spanning human-computer interaction, motion synthesis, and motion prediction for virtual and augmented reality. Following the success of deep learning methods in several computer vision tasks, recent work has focused on using deep recurrent neural networks (RNNs) to model human motion, with the goal of learning time-dependent representations that perform tasks such as short-term motion prediction and long-term human motion synthesis. We examine recent work, with a focus on the evaluation methodologies commonly used in the literature, and show that, surprisingly, state-of-the-art performance can be achieved by a simple baseline that does not attempt to model motion at all. We investigate this result, and analyze recent RNN methods by looking at the architectures, loss functions, and training procedures used in state-of-the-art approaches. We propose three changes to the standard RNN models typically used for human motion, which result in a simple and scalable RNN architecture that obtains state-of-the-art performance on human motion prediction.
引用
收藏
页码:4674 / 4683
页数:10
相关论文
共 53 条
[1]  
[Anonymous], CVIU
[2]  
[Anonymous], 2014, Generating sequences with recurrent neural networks
[3]  
[Anonymous], 2015, INT C COMP VIS ICCCV
[4]  
[Anonymous], ECCV
[5]  
[Anonymous], ICML
[6]  
[Anonymous], 2000, SIGGRAPH
[7]  
[Anonymous], 2014, P SSST EMNLP 2014 8
[8]  
[Anonymous], 2011, INT C ARTIFICIAL INT
[9]  
[Anonymous], 2014, CVPR
[10]  
[Anonymous], 2016, COMPUTER VISION PATT