ASPset: An outdoor sports pose video dataset with 3D keypoint annotations

被引:13
作者
Nibali, Aiden [1 ]
Millward, Joshua [1 ,2 ]
He, Zhen [1 ]
Morgan, Stuart [2 ]
机构
[1] La Trobe Univ, Dept Comp Sci & IT, Bundoora, Vic 3086, Australia
[2] Australian Inst Sport, Leverrier St, Bruce, ACT 2617, Australia
关键词
Markerless motion capture; Human pose estimation; Triangulation; Camera calibration;
D O I
10.1016/j.imavis.2021.104196
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in deep learning approaches to computer vision problems have led to renewed interest in the task of predicting 3D human joint locations from raw image data, with application areas including sports analysis, human-computer interaction, and physical rehabilitation. Although supervised learning of deep neural networks has proven to be effective for pose estimation, it requires a wealth of varied data to generalise well to previously unseen examples. Consequently, progress in 3D human pose estimation has been slowed by the fact that 3D keypoint annotations are notoriously difficult to obtain, traditionally requiring a large array of cameras and/or the use of wearable markers/sensors. In this paper we describe a methodology for obtaining 3D human pose annotations using only three video cameras and without any wearables. We apply this methodology to construct ASPset-510 (Australian Sports Pose Dataset), a large collection of natural sports-related video with 3D pose annotations. Using ASPset-510 as an additional source of training examples, we found that we could improve pose model generalisation on the established MPI-INF-3DHP benchmark. We make ASPset-510 publicly available, and provide strong baseline results for future work to compare against. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 49 条
[1]  
Andriluka Mykhaylo, 2014, P CVPR
[2]  
[Anonymous], 2000, DR DOBBS J SOFTWARE
[3]  
[Anonymous], 2017, ARXIV171109250
[4]  
[Anonymous], 2017, P ICLR
[5]  
[Anonymous], 2017, IEEE INT C COMPUT VI, DOI [10.1109/iccv.201, DOI 10.1109/ICCV.2017.322]
[6]  
[Anonymous], 2017, ARXIV171002322
[7]  
Azevedo C.L., 2014, TRANSP RES PROCEDIA, V1, P3, DOI 10.1016/j.trpro.2014.07.002
[8]  
Baptista R., ARXIV200409989
[9]   A Kinect-based system for physical rehabilitation: A pilot study for young adults with motor disabilities [J].
Chang, Yao-Jen ;
Chen, Shu-Fang ;
Huang, Jun-Da .
RESEARCH IN DEVELOPMENTAL DISABILITIES, 2011, 32 (06) :2566-2570
[10]   Synthesizing Training Images for Boosting Human 3D Pose Estimation [J].
Chen, Wenzheng ;
Wang, Huan ;
Li, Yangyan ;
Su, Hao ;
Wang, Zhenhua ;
Tu, Changhe ;
Lischinski, Dani ;
Cohen-Or, Daniel ;
Chen, Baoquan .
PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, :479-488