Structured Prediction Helps 3D Human Motion Modelling

被引:145
作者
Aksan, Emre [1 ]
Kaufmann, Manuel [1 ]
Hilliges, Otmar [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
关键词
D O I
10.1109/ICCV.2019.00724
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human motion prediction is a challenging and important task in many computer vision application domains. Existing work only implicitly models the spatial structure of the human skeleton. In this paper, we propose a novel approach that decomposes the prediction into individual joints by means of a structured prediction layer that explicitly models the joint dependencies. This is implemented via a hierarchy of small-sized neural networks connected analogously to the kinematic chains in the human body as well as a joint-wise decomposition in the loss function. The proposed layer is agnostic to the underlying network and can be used with existing architectures for motion modelling. Prior work typically leverages the H3.6M dataset. We show that some state-of-the-art techniques do not perform well when trained and tested on AMASS, a recently released dataset 14 times the size of H3.6M. Our experiments indicate that the proposed layer increases the performance of motion forecasting irrespective of the base network, joint-angle representation, and prediction horizon. We furthermore show that the layer also improves motion predictions qualitatively. We make code and models publicly available at https://ait.ethz.ch/projects/2019/spl.
引用
收藏
页码:7143 / 7152
页数:10
相关论文
共 34 条
[1]  
Abadi M., 2015, P 12 USENIX S OPERAT
[2]  
[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.573
[3]  
Auli M., 2019, Int. J. Comput. Vis., V128, P1
[4]  
Bütepage J, 2018, IEEE INT CONF ROBOT, P4563, DOI 10.1109/ICRA.2018.8460651
[5]   Deep representation learning for human motion prediction and classification [J].
Butepage, Judith ;
Black, Michael J. ;
Kragic, Danica ;
Kjellstrom, Hedvig .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1591-1599
[6]  
Cho K, 2014, ARXIV14061078
[7]  
De la Torre O, 2018, PEOPLE OF THE RIVER: NATURE AND IDENTITY IN BLACK AMAZONIA, 1835-1945, P135
[8]   Recurrent Network Models for Human Dynamics [J].
Fragkiadaki, Katerina ;
Levine, Sergey ;
Felsen, Panna ;
Malik, Jitendra .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4346-4354
[9]   Learning Human Motion Models for Long-term Predictions [J].
Ghosh, Partha ;
Song, Jie ;
Aksan, Emre ;
Hilliges, Otmar .
PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, :458-466
[10]   A Deep Learning Framework for Character Motion Synthesis and Editing [J].
Holden, Daniel ;
Saito, Jun ;
Komura, Taku .
ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (04)