Fusing depth and colour information for human action recognition

被引:29
作者
Avola, Danilo [1 ,2 ]
Bernardi, Marco [2 ]
Foresti, Gian Luca [1 ]
机构
[1] Univ Udine, Dept Math Comp Sci & Phys, Udine, Italy
[2] Sapienza Univ, Dept Comp, Rome, Italy
关键词
Human action recognition; Decision level fusion; Bag-of-visual-word; Naive bayes combination; Support vector machine; RGB-D; ACTION CLASSIFICATION; DISCRIMINANT-ANALYSIS; GESTURE RECOGNITION; FEATURES; LEVEL; VIDEO;
D O I
10.1007/s11042-018-6875-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, human action recognition systems have been increasingly developed to support a wide range of application areas, such as surveillance, behaviour analysis, security, and many others. In particular, data fusion approaches that use depth and colour information (i.e., RGB-D data) seem to be particularly promising for recognizing large classes of human actions with a high level of accuracy. Anyway, existing data fusion approaches are mainly based on feature fusion strategies, which tend to suffer of some limitations, including the difficult of combining different feature types and the management of missing information. To address the two problems just reported, we propose an RGB-D data based human action recognition system supported by a decision fusion strategy. The system, starting from the well-known Joint Directors of Laboratories (JDL) data fusion model, analyses human actions separately for each channel (i.e., depth and colour). The actions are modelled as a sum of visual words by using the traditional Bag-of-Visual-Words (BoVW) model. Subsequently, on each channel, these actions are classified by using a multi-class Support Vector Machine (SVM) classifier. Finally, the classification results are fused by a Naive Bayes Combination (NBC) method. The effectiveness of the proposed system has been proven on the basis of three public datasets: UTKinect-Action3D, CAD-60, and LIRIS Human Activities. Experimental results, compared with key works of the current state-of-the-art, have shown that what we propose can be considered a concrete contribute to the action recognition field.
引用
收藏
页码:5919 / 5939
页数:21
相关论文
共 77 条
[1]   Human activity recognition from 3D data: A review [J].
Aggarwal, J. K. ;
Xia, Lu .
PATTERN RECOGNITION LETTERS, 2014, 48 :70-80
[2]   Human Activity Analysis: A Review [J].
Aggarwal, J. K. ;
Ryoo, M. S. .
ACM COMPUTING SURVEYS, 2011, 43 (03)
[3]  
[Anonymous], 2004, COMBINING PATTERN CL
[4]  
[Anonymous], TPAMI
[5]  
[Anonymous], 2013, INT J COMPUT APPL
[6]  
[Anonymous], PATTERN RECOGNITION
[7]  
[Anonymous], IEEE T MULTIMEDIA
[8]  
[Anonymous], 2018, PATTERN RECOGNITION
[9]  
[Anonymous], TIP
[10]  
[Anonymous], P CVPR