Stacked sparse autoencoder and history of binary motion image for human activity recognition

被引:29
作者
Gnouma, Mariem [1 ]
Ladjailia, Ammar [2 ,3 ]
Ejbali, Ridha [1 ]
Zaied, Mourad [1 ]
机构
[1] Univ Gabes, Natl Sch Engineers Gabes, Res Team Intelligent Machines, Gabes, Tunisia
[2] Univ Souk Ahras, Fac Sci & Technol, Souk Ahras, Algeria
[3] Univ Annaba, Algeria Dept Comp Sci, Annaba, Algeria
关键词
Human activity recognition; Silhouette extraction; History of binary motion image; Deep learning; WAVELET NETWORK;
D O I
10.1007/s11042-018-6273-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The recognition of human actions in a video sequence still remains a challenging task in the computer vision community. Several techniques have been proposed until today such as silhouette detection, local space-time features and optical flow techniques. In this paper, a supervised way followed by an unsupervised learning using the principle of the auto-encoder is proposed to address the problem. We introduce a new foreground detection architecture based on information extracted from the Gaussian mixture model (GMM) incorporating with the uniform motion of Magnitude of Optical Flow (MOF). Thus, we use a fast dynamic frame skipping technique to avoid frames that contain irrelevant motion, making it possible to decrease the computational complexity of silhouette extraction. Furthermore a new technique of representations to construct an informative concept for human action recognition based on the superposition of human silhouettes is presented. We called this approach history of binary motion image (HBMI).Our method has been evaluated by a classification on the Ixmas, Weizmann, and KTH datasets, the Sparce Stacked Auto-encoder (SSAE), an instance of a deep learning strategy, is presented for efficient human activities detection and the Softmax (SMC) for the classification. The objective of this classifier in deep learning is the learning of function hierarchies with higher-level functions at lower-level functions of the hierarchy to provide an agile, robust and simple method. The results prove the efficiency of our proposed approach with respect to the irregularity in the performance of an action shape distortion, change of point of view as well as significant changes of scale.
引用
收藏
页码:2157 / 2179
页数:23
相关论文
共 54 条
[11]  
[Anonymous], P 10 EUR C COMP VIS
[12]  
[Anonymous], 2004, 1 INT C SIGN SYST DE
[13]  
[Anonymous], ACTION ENERGY IMAGES
[14]  
[Anonymous], 2017, Human Motion Sensing and Recognition, DOI DOI 10.1007/978-3-662-53692-6_11
[15]  
[Anonymous], COMPUTER VISION CONC
[16]  
[Anonymous], REV VIDEO BASED HUMA
[17]  
[Anonymous], ICVGIP
[18]  
[Anonymous], IEEE T CIRCUITS SYST
[19]  
[Anonymous], IEEE C COMP VIS PATT
[20]  
[Anonymous], 2013, P 2013 INT C COMPUTE, DOI DOI 10.1109/ICCMA.2013.6506158