Improving Human Action Recognition Using Hierarchical Features And Multiple Classifier Ensembles

被引:10
作者
Bulbul, Mohammad Farhad [1 ]
Islam, Saiful [2 ]
Zhou, Yatong [3 ]
Ali, Hazrat [4 ]
机构
[1] Jashore Univ Sci & Technol, Dept Math, Jashore, Bangladesh
[2] Bangabandhu Sheikh Mujibur Rahman Sci & Technol U, Dept Math, Gopalganj, Bangladesh
[3] Hebei Univ Technol HEBUT, Sch Elect & Informat Engn, Tianjin, Peoples R China
[4] COMSATS Univ Islamabad, Dept Elect & Comp Engn, Abbottabad Campus, Abbottabad, Pakistan
关键词
human action recognition; motion history image; static history image; local binary patterns; logarithmic opinion pool; kernel extreme learning machine; EXTREME LEARNING-MACHINE; TEXTURE MEASURES; REPRESENTATION; FUSION; 2D;
D O I
10.1093/comjnl/bxz123
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a simple, fast and efficacious system to promote the human action classification outcome using the depth action sequences. Firstly, the motion history images (Mills) and static history images (SHIs) are created from the front (XOY), side (YOZ) and top (XOZ) projected scenes of each depth sequence in a 3D Euclidean space through engaging the 3D Motion Trail Model (3DMTM). Then, the Local Binary Patterns (LBPs) algorithm is operated on the Mills and Sins to learn motion and static hierarchical features to represent the action sequence. The motion and static hierarchical feature vectors are then fed into a classifier ensemble to classify action classes, where the ensemble comprises of two classifiers. Thus, each ensemble includes a pair of Kernel-based Extreme Learning Machine (KELM) or l(2)-regularized Collaborative Representation Classifier (l(2)-CRC) or Multi-class Support Vector Machine. To extensively assess the framework, we perform experiments on a couple of standard available datasets such as MSR-Action3D, UTD-MHAD and DHA. Experimental consequences demonstrate that the proposed approach gains a state-of-the-art recognition performance in comparison with other available approaches. Several statistical measurements on recognition results also indicate that the method achieves superiority when the hierarchical features are adopted with the KELM ensemble. In addition, to ensure real-time processing capability of the algorithm, the running time of major components is investigated. Based on machine dependency of the running time, the computational complexity of the system is also shown and compared with other methods. Experimental results and evaluation of the computational time and complexity reflect real-time compatibility and feasibility of the proposed system.
引用
收藏
页码:1633 / 1655
页数:23
相关论文
共 87 条
[1]  
Adhikari K, 2017, PROCEEDINGS OF THE FIFTEENTH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS - MVA2017, P81, DOI 10.23919/MVA.2017.7986795
[2]   Speaker recognition with hybrid features from a deep belief network [J].
Ali, Hazrat ;
Tran, Son N. ;
Benetos, Emmanouil ;
Garcez, Artur S. d'Avila .
NEURAL COMPUTING & APPLICATIONS, 2018, 29 (06) :13-19
[3]   Evolutionary joint selection to improve human action recognition with RGB-D devices [J].
Andre Chaaraoui, Alexandros ;
Ramon Padilla-Lopez, Jose ;
Climent-Perez, Pau ;
Florez-Revuelta, Francisco .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (03) :786-794
[4]  
[Anonymous], 2016, IJCAI
[5]  
[Anonymous], 2013, Twenty-Third International Joint Conference on Artificial Intelligence (IJCAI), DOI DOI 10.5555/2540128.2540343
[6]  
[Anonymous], 2011, Acm T. Intel. Syst. Tec., DOI DOI 10.1145/1961189.1961199
[7]   Supervised spatio-temporal kernel descriptor for human action recognition from RGB-depth videos [J].
Asadi-Aghbolaghi, Maryam ;
Kasaei, Shohreh .
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (11) :14115-14135
[8]   Dynamic 3D Hand Gesture Recognition by Learning Weighted Depth Motion Maps [J].
Azad, Reza ;
Asadi-Aghbolaghi, Maryam ;
Kasaei, Shohreh ;
Escalera, Sergio .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (06) :1729-1740
[9]  
Azary S, 2012, LECT NOTES COMPUT SC, V7432, P166, DOI 10.1007/978-3-642-33191-6_17
[10]   Multisource remote sensing data classification based on consensus and pruning [J].
Benediktsson, JA ;
Sveinsson, JR .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2003, 41 (04) :932-936