Effective and efficient similarity searching in motion capture data

被引:43
作者
Sedmidubsky, Jan [1 ]
Elias, Petr [2 ]
Zezula, Pavel [1 ]
机构
[1] Masaryk Univ, Comp Sci, Brno, Czech Republic
[2] Masaryk Univ, Brno, Czech Republic
关键词
Motion capture data retrieval; Effective similarity measure; Efficient indexing; k-NN query; Motion image; Convolutional neural network; Fixed-size motion feature; ACTION RECOGNITION; CLASSIFICATION; RETRIEVAL;
D O I
10.1007/s11042-017-4859-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Motion capture data describe human movements in the form of spatio-temporal trajectories of skeleton joints. Intelligent management of such complex data is a challenging task for computers which requires an effective concept of motion similarity. However, evaluating the pair-wise similarity is a difficult problem as a single action can be performed by various actors in different ways, speeds or starting positions. Recent methods usually model the motion similarity by comparing customized features using distance-based functions or specialized machine-learning classifiers. By combining both these approaches, we transform the problem of comparing motions of variable sizes into the problem of comparing fixed-size vectors. Specifically, each rather-short motion is encoded into a compact visual representation from which a highly descriptive 4,096-dimensional feature vector is extracted using a fine-tuned deep convolutional neural network. The advantage is that the fixed-size features are compared by the Euclidean distance which enables efficient motion indexing by any metric-based index structure. Another advantage of the proposed approach is its tolerance towards an imprecise action segmentation, the variance in movement speed, and a lower data quality. All these properties together bring new possibilities for effective and efficient large-scale retrieval.
引用
收藏
页码:12073 / 12094
页数:22
相关论文
共 56 条
[51]   Recognition of Human Actions Using Motion Capture Data and Support Vector Machine [J].
Wang, Jung-Ying ;
Lee, Hahn-Ming .
2009 WRI WORLD CONGRESS ON SOFTWARE ENGINEERING, VOL 1, PROCEEDINGS, 2009, :234-+
[52]   Human motion capture data retrieval based on semantic thumbnail [J].
Wang, Xin ;
Chen, Liangxiu ;
Jing, Jiali ;
Zheng, Herong .
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (19) :11723-11740
[53]  
Wu S., 2009, P 16 ACM S VIRTUAL R, P207
[54]  
Ye Liu, 2010, 2010 Proceedings of 16th International Conference on Virtual Systems and Multimedia (VSMM 2010), P26, DOI 10.1109/VSMM.2010.5665969
[55]   The Moving Pose: An Efficient 3D Kinematics Descriptor for Low-Latency Action Recognition and Detection [J].
Zanfir, Mihai ;
Leordeanu, Marius ;
Sminchisescu, Cristian .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :2752-2759
[56]  
Zhao XY, 2013, 2013 10TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), P23, DOI 10.1109/FSKD.2013.6816160