Bidirectional recurrent autoencoder for 3D skeleton motion data refinement

被引：22

作者：

Li, Shujie ^{[1
]}

Zhou, Yang ^{[1
]}

Zhu, Haisheng ^{[1
]}

Xie, Wenjun ^{[1
]}

Zhao, Yang ^{[1
]}

Liu, Xiaoping ^{[1
]}

机构：

[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei, Anhui, Peoples R China

来源：

COMPUTERS & GRAPHICS-UK | 2019年 / 81卷

基金：

中国国家自然科学基金;

关键词：

Motion data refinement; B-LSTM-RNN; Motion autoencoder; 3D skeleton motion data; Joint position; MISSING MARKERS; NEURAL-NETWORKS; CAPTURE;

D O I：

10.1016/j.cag.2019.03.010

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this paper, we propose a novel 3D skeleton human motion data refinement method that is based on a bidirectional recurrent autoencoder (BRA). The BRA has two main characteristics: (1) the motion manifold is extracted by a bidirectional long short-term memory recurrent neural network (B-LSTM-RNN) and (2) apart from statistical information of motion data, kinematic information including smoothness and bone length constrain, are also simultaneously exploited with noisy-clean motion pairs. Using a bidirectional LSTM unit, which is more suitable for time series and can infer information from the data in both time directions, our autoencoder extracts a manifold that can exploit the spatial and temporal relationships between previous and subsequent motion data. As a result, the refined data that are projected by the decoder from the motion manifold have much lower reproduction error. Furthermore, owing to the consideration of kinematic information, our reproduced motion data are of higher visual quality, while preserving positional precision. The proposed method is not action-specific and can handle a wide variety of noise types. The proposed method does not require the noise amplitude, which may be unknown in many scenarios, as a priori knowledge. Extensive experimental results demonstrate that our method outperforms several state-of-the-art methods. (C) 2019 Elsevier Ltd. All rights reserved.

引用

页码：92 / 103

页数：12

共 58 条

[1] Bilinear Spatiotemporal Basis Models
Akhter, Ijaz
Simon, Tomas
Khan, Sohaib
Matthews, Iain
Sheikh, Yaser
[J]. ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (02): : 1 - 12
[2] [Anonymous], 2011, EUROGRAPHICS, DOI 10.2312/EG2011/short/045-048
[3] Real-time marker prediction and CoR estimation in optical motion capture
Aristidou, Andreas
Lasenby, Joan
[J]. VISUAL COMPUTER, 2013, 29 (01) : 7 - 26
[4] SnapNet: 3D point cloud semantic labeling with 2D deep segmentation networks
Boulch, Alexandre
Guerry, Yids
Le Saux, Bertrand
Audebert, Nicolas
[J]. COMPUTERS & GRAPHICS-UK, 2018, 71 : 189 - 198
[5] Bruderlin A., 1995, Computer Graphics Proceedings. SIGGRAPH 95, P97, DOI 10.1145/218380.218421
[6] Estimating missing marker positions using low dimensional Kalman smoothing
Burke, M.
Lasenby, J.
[J]. JOURNAL OF BIOMECHANICS, 2016, 49 (09) : 1854 - 1858
[7] Deep representation learning for human motion prediction and classification
Butepage, Judith
Black, Michael J.
Kragic, Danica
Kjellstrom, Hedvig
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1591 - 1599
[8] CMU, 2018, CARN MELL MOC DAT
[9] RECURRENT NEURAL NETWORKS AND ROBUST TIME-SERIES PREDICTION
CONNOR, JT
MARTIN, RD
ATLAS, LE
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02): : 240 - 254
[10] A Novel Approach to Solve the "Missing Marker Problem'' in Marker-Based Motion Analysis That Exploits the Segment Coordination Patterns in Multi-Limb Motion Data
Federolf, Peter Andreas
[J]. PLOS ONE, 2013, 8 (10):

← 1 2 3 4 5 6 →