Crowd anomaly detection using Aggregation of Ensembles of fine-tuned ConvNets

被引:81
作者
Singh, Kuldeep [1 ]
Rajora, Shantanu [2 ]
Vishwakarma, Dinesh Kumar [2 ]
Tripathi, Gaurav [3 ]
Kumar, Sandeep [3 ]
Walia, Gurjit Singh [4 ]
机构
[1] Malaviya Natl Inst Technol, Dept Elect & Commun Engn, Jaipur, Rajasthan, India
[2] Delhi Technol Univ, Dept Informat Technol, Delhi, India
[3] Delhi Technol Univ, Dept Elect & Commun Engn, Delhi, India
[4] Def Res & Dev Org, Sci Anal Grp, Delhi, India
关键词
Aggregation; Anomaly detection; Convolutional Neural Network; Deep learning; Ensembles; Fine-tuning; ABNORMAL EVENT DETECTION; NEURAL-NETWORKS; LOCALIZATION;
D O I
10.1016/j.neucom.2019.08.059
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomaly detection in crowded scenes plays a crucial role in automatic video surveillance to avert any casualty in the areas witnessing the high amount of footfalls. The key challenge for automatically classifying the anomalies in crowd image is the usage of feature set and techniques which can be replicated in every crowded scenario. In this paper, we propose a novel concept of Aggregation of Ensembles (AOE) for detecting an anomaly in video data showing crowded scenes, which leverage the existing capability of pre-trained ConvNets and a pool of classifiers. The proposed approach uses an ensemble of different fine-tuned Convolutional Neural Networks (CNN) based on the hypothesis that different CNN architectures learn different levels of semantic representation from crowd videos and thus an ensemble of CNNs will enable enriched feature sets to be extracted. The proposed AOE concept utilizes the fine-tuned ConvNets as fixed feature extractors to train variants of SVM classifier and then the posterior probabilities are fused to predict the anomaly in the crowd frame sequences. The experimental results show that the proposed Aggregation of Ensembles fine-tuned CNNs of various architectures achieve a higher accuracy in comparison with other established methods on benchmark datasets. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:188 / 198
页数:11
相关论文
共 40 条
[1]  
[Anonymous], 2014, Comput. Sci.
[2]  
[Anonymous], P ADV NEUR INF PROC
[3]  
[Anonymous], P CVPRW BOST
[4]  
[Anonymous], P IEEE IAFE COMP INT
[5]  
[Anonymous], 2001, P IEEE WORKSH MATH M
[6]  
[Anonymous], P CVPR
[7]   Sparse Coding Guided Spatiotemporal Feature Learning for Abnormal Event Detection in Large Videos [J].
Chu, Wenqing ;
Xue, Hongyang ;
Yao, Chengwei ;
Cai, Deng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (01) :246-255
[8]   Video Anomaly Search in Crowded Scenes via Spatio-Temporal Motion Context [J].
Cong, Yang ;
Yuan, Junsong ;
Tang, Yandong .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2013, 8 (10) :1590-1599
[9]   Abnormal event detection in crowded scenes using sparse representation [J].
Cong, Yang ;
Yuan, Junsong ;
Liu, Ji .
PATTERN RECOGNITION, 2013, 46 (07) :1851-1864
[10]  
Duchi J, 2011, J MACH LEARN RES, V12, P2121