Embedded Features for 1D CNN-based Action Recognition on Depth Maps

被引:5
作者
Trelinski, Jacek [1 ]
Kwolek, Bogdan [1 ]
机构
[1] AGH Univ Sci & Technol, 30 Mickiewicza, PL-30059 Krakow, Poland
来源
VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 4: VISAPP | 2021年
关键词
Action Recognition on Depth Maps; Convolutional Neural Networks; Feature Embedding;
D O I
10.5220/0010340105360543
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present an algorithm for human action recognition using only depth maps. A convolutional autoencoder and Siamese neural network are trained to learn embedded features, encapsulating the content of single depth maps. Afterwards, statistical features and multichannel 1D CNN features are extracted on multivariate time-series of such embedded features to represent actions on depth map sequences. The action recognition is achieved by voting in an ensemble of one-vs-all weak classifiers. We demonstrate experimentally that the proposed algorithm achieves competitive results on UTD-MHAD dataset and outperforms by a large margin the best algorithms on 3D Human-Object Interaction Set (SYSU 3DHOI).
引用
收藏
页码:536 / 543
页数:8
相关论文
共 24 条
[1]  
[Anonymous], 2006, PROC IEEE COMPUT SOC, DOI 10.1109/CVPR.2006.100
[2]  
Bulbul M, 2019, MULTIM TOOLS APPL, V78
[3]  
Chen C, 2015, IEEE IMAGE PROC, P168, DOI 10.1109/ICIP.2015.7350781
[4]   Learning a similarity metric discriminatively, with application to face verification [J].
Chopra, S ;
Hadsell, R ;
LeCun, Y .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :539-546
[5]   Reducing the dimensionality of data with neural networks [J].
Hinton, G. E. ;
Salakhutdinov, R. R. .
SCIENCE, 2006, 313 (5786) :504-507
[6]   Skeleton Optical Spectra-Based Action Recognition Using Convolutional Neural Networks [J].
Hou, Yonghong ;
Li, Zhaoyang ;
Wang, Pichao ;
Li, Wanqing .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (03) :807-811
[7]   Early Action Prediction by Soft Regression [J].
Hu, Jian-Fang ;
Zheng, Wei-Shi ;
Ma, Lianyang ;
Wang, Gang ;
Lai, Jianhuang ;
Zhang, Jianguo .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (11) :2568-2583
[8]  
Hu JF, 2015, PROC CVPR IEEE, P5344, DOI 10.1109/CVPR.2015.7299172
[9]  
Koch G., 2015, ICML DEEP LEARN WORK, P1
[10]   Application on Integration Technology of Visualized Hierarchical Information [J].
Li, Weibo ;
He, Yang .
2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL I, 2010, :9-12