Three-dimensional action recognition using volume integrals

被引:4
作者
Diaz-Mas, Luis [1 ]
Munoz-Salinas, Rafael [1 ]
Madrid-Cuevas, F. J. [1 ]
Medina-Carnicer, R. [1 ]
机构
[1] Univ Cordoba, Dept Comp & Numer Anal, E-14071 Cordoba, Spain
关键词
Action recognition; View invariance; Multi-camera; Motion descriptor; POSTURE CLASSIFICATION;
D O I
10.1007/s10044-011-0239-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work proposes the volume integral (VI) as a new descriptor for three-dimensional action recognition. The descriptor transforms the actor's volumetric information into a two-dimensional representation by projecting the voxel data to a set of planes that maximize the discrimination of actions. Our descriptor significantly reduces the amount of data of the three-dimensional representations yet preserves the most important information. As a consequence, the action recognition process is greatly speeded up while achieving very high success rates. The method proposed is therefore especially appropriate for applications in which limitations of computing power and space are significant aspects to consider, such as real-time applications or mobile devices. Additionally, the descriptor is sensitive to reflected actions, i.e., same actions performed with different limbs can be differentiated. This paper tests the VI using several Dimensionality Reduction techniques (namely PCA, 2D-PCA, LDA) and different Machine Learning approaches (namely Clustering, SVM and HMM) so as to determine the best combination of these for the action recognition task. Experiments conducted on the public IXMAS dataset show that the VI compares favorably with state-of-the-art descriptors both in terms of classification rates and computing times.
引用
收藏
页码:289 / 298
页数:10
相关论文
共 50 条
[41]   HUMAN ACTION RECOGNITION BASED ON ACTION FORESTS MODEL USING KINECT CAMERA [J].
Chuan, Chi-Hung ;
Chen, Ying-Nong ;
Fan, Kuo-Chin .
IEEE 30TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (WAINA 2016), 2016, :914-917
[42]   Human Action Recognition Using Global Point Feature Histograms and Action Shapes [J].
Rusu, Radu Bogdan ;
Bandouch, Jan ;
Meier, Franziska ;
Essa, Irfan ;
Beetz, Michael .
ADVANCED ROBOTICS, 2009, 23 (14) :1873-1908
[43]   Action Recognition Using Nonnegative Action Component Representation and Sparse Basis Selection [J].
Wang, Haoran ;
Yuan, Chunfeng ;
Hu, Weiming ;
Ling, Haibin ;
Yang, Wankou ;
Sun, Changyin .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (02) :570-581
[44]   Human action recognition using time delay input radial basis function networks [J].
Kalhor, Davood ;
Aris, Ishak ;
Moaini, Trifa ;
Halin, Izhal Abdul .
International Journal of Simulation: Systems, Science and Technology, 2014, 15 (03) :42-53
[45]   Using Trajectory Features for Tai Chi Action Recognition [J].
Xu, Leiyang ;
Wang, Qiang ;
Yuan, Lin ;
Ma, Xiang .
2020 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, I2MTC 2020, 2020,
[46]   Action Recognition by Jointly Using Video Proposal and Trajectory [J].
Qi, Lei ;
Lu, Xiaoqiang ;
Li, Xuelong .
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON VISION, IMAGE AND SIGNAL PROCESSING (ICVISP 2018), 2018,
[47]   Using Phase Instead of Optical Flow for Action Recognition [J].
Hommos, Omar ;
Pintea, Silvia L. ;
Mettes, Pascal S. M. ;
van Gemert, Jan C. .
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT VI, 2019, 11134 :678-691
[48]   Analysis of table tennis swing using action recognition [J].
Heo, Geon ;
Ha, Jong-Eun .
Journal of Institute of Control, Robotics and Systems, 2014, 21 (01) :40-45
[49]   Bird Action Recognition in Wetlands using Deep Learning [J].
Rodriguez-Juan, Javier ;
Berenguer-Agullo, Adrian ;
Benavent-Lledo, Manuel ;
Mulero-Perez, David ;
Garcia-Rodriguez, Jose ;
Sebastian-Gonzalez, Esther .
PROCEEDINGS OF THE 2024 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY FOR SOCIAL GOOD, GOODIT 2024, 2024, :350-357
[50]   AN IMPROVED METHOD USING KINEMATIC FEATURES FOR ACTION RECOGNITION [J].
Chen, Yuanbo ;
Zhao, Yanyun ;
Cai, Anni .
PROCEEDINGS OF 2011 INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND APPLICATION, ICCTA2011, 2011, :737-741