Bidirectional recurrent autoencoder for 3D skeleton motion data refinement

被引:22
作者
Li, Shujie [1 ]
Zhou, Yang [1 ]
Zhu, Haisheng [1 ]
Xie, Wenjun [1 ]
Zhao, Yang [1 ]
Liu, Xiaoping [1 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei, Anhui, Peoples R China
来源
COMPUTERS & GRAPHICS-UK | 2019年 / 81卷
基金
中国国家自然科学基金;
关键词
Motion data refinement; B-LSTM-RNN; Motion autoencoder; 3D skeleton motion data; Joint position; MISSING MARKERS; NEURAL-NETWORKS; CAPTURE;
D O I
10.1016/j.cag.2019.03.010
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we propose a novel 3D skeleton human motion data refinement method that is based on a bidirectional recurrent autoencoder (BRA). The BRA has two main characteristics: (1) the motion manifold is extracted by a bidirectional long short-term memory recurrent neural network (B-LSTM-RNN) and (2) apart from statistical information of motion data, kinematic information including smoothness and bone length constrain, are also simultaneously exploited with noisy-clean motion pairs. Using a bidirectional LSTM unit, which is more suitable for time series and can infer information from the data in both time directions, our autoencoder extracts a manifold that can exploit the spatial and temporal relationships between previous and subsequent motion data. As a result, the refined data that are projected by the decoder from the motion manifold have much lower reproduction error. Furthermore, owing to the consideration of kinematic information, our reproduced motion data are of higher visual quality, while preserving positional precision. The proposed method is not action-specific and can handle a wide variety of noise types. The proposed method does not require the noise amplitude, which may be unknown in many scenarios, as a priori knowledge. Extensive experimental results demonstrate that our method outperforms several state-of-the-art methods. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:92 / 103
页数:12
相关论文
共 58 条
  • [1] Bilinear Spatiotemporal Basis Models
    Akhter, Ijaz
    Simon, Tomas
    Khan, Sohaib
    Matthews, Iain
    Sheikh, Yaser
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (02): : 1 - 12
  • [2] [Anonymous], 2011, EUROGRAPHICS, DOI 10.2312/EG2011/short/045-048
  • [3] Real-time marker prediction and CoR estimation in optical motion capture
    Aristidou, Andreas
    Lasenby, Joan
    [J]. VISUAL COMPUTER, 2013, 29 (01) : 7 - 26
  • [4] SnapNet: 3D point cloud semantic labeling with 2D deep segmentation networks
    Boulch, Alexandre
    Guerry, Yids
    Le Saux, Bertrand
    Audebert, Nicolas
    [J]. COMPUTERS & GRAPHICS-UK, 2018, 71 : 189 - 198
  • [5] Bruderlin A., 1995, Computer Graphics Proceedings. SIGGRAPH 95, P97, DOI 10.1145/218380.218421
  • [6] Estimating missing marker positions using low dimensional Kalman smoothing
    Burke, M.
    Lasenby, J.
    [J]. JOURNAL OF BIOMECHANICS, 2016, 49 (09) : 1854 - 1858
  • [7] Deep representation learning for human motion prediction and classification
    Butepage, Judith
    Black, Michael J.
    Kragic, Danica
    Kjellstrom, Hedvig
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1591 - 1599
  • [8] CMU, 2018, CARN MELL MOC DAT
  • [9] RECURRENT NEURAL NETWORKS AND ROBUST TIME-SERIES PREDICTION
    CONNOR, JT
    MARTIN, RD
    ATLAS, LE
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02): : 240 - 254
  • [10] A Novel Approach to Solve the "Missing Marker Problem'' in Marker-Based Motion Analysis That Exploits the Segment Coordination Patterns in Multi-Limb Motion Data
    Federolf, Peter Andreas
    [J]. PLOS ONE, 2013, 8 (10):