Deep-Cascade: Cascading 3D Deep Neural Networks for Fast Anomaly Detection and Localization in Crowded Scenes

被引:268
作者
Sabokrou, Mohammad [1 ]
Fayyaz, Mohsen [1 ]
Fathy, Mahmood [2 ]
Klette, Reinhard [3 ]
机构
[1] Malek Ashtar Univ Technol, Dept ICT, Tehran 177415875, Iran
[2] Iran Univ Sci & Technol, Tehran 1684613114, Iran
[3] Auckland Univ Technol, Sch Engn Comp & Math Sci, Auckland 1142, New Zealand
关键词
Anomaly detection; deep neural network; video analysis; pedestrian scenes; EVENT DETECTION; ONLINE;
D O I
10.1109/TIP.2017.2670780
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a fast and reliable method for anomaly detection and localization in video data showing crowded scenes. Time-efficient anomaly localization is an ongoing challenge and subject of this paper. We propose a cubic-patch-based method, characterised by a cascade of classifiers, which makes use of an advanced feature-learning approach. Our cascade of classifiers has two main stages. First, a light but deep 3D auto-encoder is used for early identification of "many" normal cubic patches. This deep network operates on small cubic patches as being the first stage, before carefully resizing the remaining candidates of interest, and evaluating those at the second stage using a more complex and deeper 3D convolutional neural network (CNN). We divide the deep auto-encoder and the CNN into multiple sub-stages, which operate as cascaded classifiers. Shallow layers of the cascaded deep networks (designed as Gaussian classifiers, acting as weak single-class classifiers) detect "simple" normal patches, such as background patches and more complex normal patches, are detected at deeper layers. It is shown that the proposed novel technique (a cascade of two cascaded classifiers) performs comparable to current top-performing detection and localization methods on standard benchmarks, but outperforms those in general with respect to required computation time.
引用
收藏
页码:1992 / 2004
页数:13
相关论文
共 52 条
[1]   Robust real-time unusual event detection using multiple fixed-location monitors [J].
Adam, Amit ;
Rivlin, Ehud ;
Shimshoni, Ilan ;
Reinitz, David .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (03) :555-560
[2]  
[Anonymous], 2014, ADV NEURAL INFORM PR
[3]  
[Anonymous], PROC CVPR IEEE
[4]  
[Anonymous], ADV NEURAL INF PROCE
[5]  
[Anonymous], 2015, P BRIT MACH VIS C BM
[6]  
[Anonymous], P BMVC
[7]  
[Anonymous], P CVPR
[8]  
Antic B, 2011, IEEE I CONF COMP VIS, P2415, DOI 10.1109/ICCV.2011.6126525
[9]   Detecting abnormal human behaviour using multiple cameras [J].
Antonakaki, Panagiota ;
Kosmopoulos, Dimitrios ;
Perantonis, Stavros J. .
SIGNAL PROCESSING, 2009, 89 (09) :1723-1738
[10]   Multi-scale and real-time non-parametric approach for anomaly detection and localization [J].
Bertini, Marco ;
Del Bimbo, Alberto ;
Seidenari, Lorenzo .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2012, 116 (03) :320-329