Clustering Driven Deep Autoencoder for Video Anomaly Detection

被引:189
作者
Chang, Yunpeng [1 ]
Tu, Zhigang [1 ]
Xie, Wei [2 ]
Yuan, Junsong [3 ]
机构
[1] Wuhan Univ, Wuhan 430079, Peoples R China
[2] Cent China Normal Univ, Wuhan 430079, Peoples R China
[3] SUNY Buffalo, Buffalo, NY 14260 USA
来源
COMPUTER VISION - ECCV 2020, PT XV | 2020年 / 12360卷
关键词
Video anomaly detection; Spatio-temporal dissociation; Deep k-means cluster;
D O I
10.1007/978-3-030-58555-6_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Because of the ambiguous definition of anomaly and the complexity of real data, video anomaly detection is one of the most challenging problems in intelligent video surveillance. Since the abnormal events are usually different from normal events in appearance and/or in motion behavior, we address this issue by designing a novel convolution autoencoder architecture to separately capture spatial and temporal informative representation. The spatial part reconstructs the last individual frame (LIF), while the temporal part takes consecutive frames as input and RGB difference as output to simulate the generation of optical flow. The abnormal events which are irregular in appearance or in motion behavior lead to a large reconstruction error. Besides, we design a deep k-means cluster to force the appearance and the motion encoder to extract common factors of variation within the dataset. Experiments on some publicly available datasets demonstrate the effectiveness of our method with the state-of-the-art performance.
引用
收藏
页码:329 / 345
页数:17
相关论文
共 43 条
[1]   Latent Space Autoregression for Novelty Detection [J].
Abati, Davide ;
Porrello, Angelo ;
Calderara, Simone ;
Cucchiara, Rita .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :481-490
[2]  
[Anonymous], 2007, Advances in neural information processing systems, DOI 10.5555/2976456.2976599
[3]  
Blanchard G, 2010, J MACH LEARN RES, V11, P2973
[4]  
Chang Y., 2020, CCIS, V1180, P187, DOI [10.1007/978-981-15-3651-917, DOI 10.1007/978-981-15-3651-917]
[5]   Deep Clustering via Joint Convolutional Autoencoder Embedding and Relative Entropy Minimization [J].
Dizaji, Kamran Ghasedi ;
Herandi, Amirhossein ;
Deng, Cheng ;
Cai, Weidong ;
Huang, Heng .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5747-5756
[6]  
Fard M.M., 2018, arXiv
[7]   Memorizing Normality to Detect Anomaly: Memory-augmented Deep Autoencoder for Unsupervised Anomaly Detection [J].
Gong, Dong ;
Liu, Lingqiao ;
Le, Vuong ;
Saha, Budhaditya ;
Mansour, Moussa Reda ;
Venkatesh, Svetha ;
van den Hengel, Anton .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1705-1714
[8]  
Guo XF, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1753
[9]   Learning Temporal Regularity in Video Sequences [J].
Hasan, Mahmudul ;
Choi, Jonghyun ;
Neumann, Jan ;
Roy-Chowdhury, Amit K. ;
Davis, Larry S. .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :733-742
[10]   Joint Detection and Recounting of Abnormal Events by Learning Deep Generic Knowledge [J].
Hinami, Ryota ;
Mei, Tao ;
Satoh, Shin'ichi .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :3639-3647