Adversarial Geometry-Aware Human Motion Prediction

被引：174

作者：

Gui, Liang-Yan ^{[1
]}

Wang, Yu-Xiong ^{[1
]}

Liang, Xiaodan ^{[1
]}

Moura, Jose M. F. ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

来源：

COMPUTER VISION - ECCV 2018, PT IV | 2018年 / 11208卷

关键词：

Human motion prediction; Adversarial learning; Geodesic loss; MODELS;

D O I：

10.1007/978-3-030-01225-0_48

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We explore an approach to forecasting human motion in a few milliseconds given an input 3D skeleton sequence based on a recurrent encoder-decoder framework. Current approaches suffer from the problem of prediction discontinuities and may fail to predict human-like motion in longer time horizons due to error accumulation. We address these critical issues by incorporating local geometric structure constraints and regularizing predictions with plausible temporal smoothness and continuity from a global perspective. Specifically, rather than using the conventional Euclidean loss, we propose a novel frame-wise geodesic loss as a geometrically meaningful, more precise distance measurement. Moreover, inspired by the adversarial training mechanism, we present a new learning procedure to simultaneously validate the sequence-level plausibility of the prediction and its coherence with the input sequence by introducing two global recurrent discriminators. An unconditional, fidelity discriminator and a conditional, continuity discriminator are jointly trained along with the predictor in an adversarial manner. Our resulting adversarial geometry-aware encoder-decoder (AGED) model significantly outperforms state-of-the-art deep learning based approaches on the heavily benchmarked H3.6M dataset in both short-term and long-term predictions.

引用

页码：823 / 842

页数：20

共 60 条

[31] Anticipating Human Activities Using Object Affordances for Reactive Robotic Response [J].

Koppula, Hema S. ;

Saxena, Ashutosh .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (01) :14-29

[32]

Kovar L, 2002, ACM T GRAPHIC, V21, P473, DOI 10.1145/566570.566605

[33] Dual Motion GAN for Future-Flow Embedded Video Prediction [J].

Liang, Xiaodan ;

Lee, Lisa ;

Dai, Wei ;

Xing, Eric P. .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1762-1770

[34] On human motion prediction using recurrent neural networks [J].

Martinez, Julieta ;

Black, Michael J. ;

Romero, Javier .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4674-4683

[35]

MURRAY R. M., 1993, A Mathematical Introduction to Robotic Manipulation

[36] A Survey of Motion Planning and Control Techniques for Self-Driving Urban Vehicles [J].

Paden, Brian ;

Cap, Michal ;

Yong, Sze Zheng ;

Yershov, Dmitry ;

Frazzoli, Emilio .

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2016, 1 (01) :33-55

[37]

Paszke Adam, 2017, AUTOMATIC DIFFERENTI

[38] Context Encoders: Feature Learning by Inpainting [J].

Pathak, Deepak ;

Krahenbuhl, Philipp ;

Donahue, Jeff ;

Darrell, Trevor ;

Efros, Alexei A. .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2536-2544

[39]

Pavlovic V, 2001, ADV NEUR IN, V13, P981

[40] A framework for uncertainty and validation of 3-D registration methods based on points and frames [J].

Pennec, X ;

Thirion, JP .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 1997, 25 (03) :203-229

← 1 2 3 4 5 6 →