MoVi: A large multi-purpose human motion and video dataset

被引:37
作者
Ghorbani, Saeed [1 ,2 ]
Mahdaviani, Kimia [3 ]
Thaler, Anne [2 ,4 ]
Kording, Konrad [5 ,6 ]
Cook, Douglas James [7 ,8 ]
Blohm, Gunnar [7 ]
Troje, Nikolaus F. [2 ,4 ]
机构
[1] York Univ, Dept Elect Engn & Comp Sci, Toronto, ON, Canada
[2] York Univ, Ctr Vis Res, Toronto, ON, Canada
[3] Queens Univ, Dept Psychol, Kingston, ON, Canada
[4] York Univ, Dept Biol, Toronto, ON, Canada
[5] Univ Penn, Dept Neurosci, Philadelphia, PA 19104 USA
[6] Univ Penn, Dept Bioengn, Philadelphia, PA 19104 USA
[7] Queens Univ, Ctr Neurosci Studies, Kingston, ON, Canada
[8] Queens Univ, Dept Surg, Kingston, ON, Canada
来源
PLOS ONE | 2021年 / 16卷 / 06期
基金
加拿大自然科学与工程研究理事会;
关键词
CENTER LOCATION; PREDICTION;
D O I
10.1371/journal.pone.0253157
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Large high-quality datasets of human body shape and kinematics lay the foundation for modelling and simulation approaches in computer vision, computer graphics, and biomechanics. Creating datasets that combine naturalistic recordings with high-accuracy data about ground truth body shape and pose is challenging because different motion recording systems are either optimized for one or the other. We address this issue in our dataset by using different hardware systems to record partially overlapping information and synchronized data that lend themselves to transfer learning. This multimodal dataset contains 9 hours of optical motion capture data, 17 hours of video data from 4 different points of view recorded by stationary and hand-held cameras, and 6.6 hours of inertial measurement units data recorded from 60 female and 30 male actors performing a collection of 21 everyday actions and sports movements. The processed motion capture data is also available as realistic 3D human meshes. We anticipate use of this dataset for research on human pose estimation, action recognition, motion modelling, gait analysis, and body shape reconstruction.
引用
收藏
页数:15
相关论文
共 26 条
[1]   SCAPE: Shape Completion and Animation of People [J].
Anguelov, D ;
Srinivasan, P ;
Koller, D ;
Thrun, S ;
Rodgers, J ;
Davis, J .
ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03) :408-416
[2]  
[Anonymous], 2009, Tech. Rep. CMU-RI-TR-08-22
[3]  
[Anonymous], Camera calibration toolbox for matlab
[4]   PREDICTION OF HIP-JOINT CENTER LOCATION FROM EXTERNAL LANDMARKS [J].
BELL, AL ;
BRAND, RA ;
PEDERSEN, DR .
HUMAN MOVEMENT SCIENCE, 1989, 8 (01) :3-16
[5]   A COMPARISON OF THE ACCURACY OF SEVERAL HIP CENTER LOCATION PREDICTION METHODS [J].
BELL, AL ;
PEDERSEN, DR ;
BRAND, RA .
JOURNAL OF BIOMECHANICS, 1990, 23 (06) :617-621
[6]  
Dasari S., 2019, ARXIV PREPRINT ARXIV
[7]   Complete solution classification for the Perspective-Three-Point problem [J].
Gao, XS ;
Hou, XR ;
Tang, JL ;
Cheng, HF .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (08) :930-943
[8]   Probabilistic Character Motion Synthesis using a Hierarchical Deep Latent Variable Model [J].
Ghorbani, S. ;
Wloka, C. ;
Etemad, A. ;
Brubaker, M. A. ;
Troje, N. F. .
COMPUTER GRAPHICS FORUM, 2020, 39 (08) :225-239
[9]  
Ghorbani S, 2020, **DATA OBJECT**, DOI [10.5683/SP2/JRHDRN, DOI 10.5683/SP2/JRHDRN]
[10]   Auto-labelling of Markers in Optical Motion Capture by Permutation Learning [J].
Ghorbani, Saeed ;
Etemad, Ali ;
Troje, Nikolaus F. .
ADVANCES IN COMPUTER GRAPHICS, CGI 2019, 2019, 11542 :167-178