VIDEO ANOMALY DETECTION VIA PREDICTIVE AUTOENCODER WITH GRADIENT-BASED ATTENTION

被引:15
作者
Lai, Yuandu [1 ,2 ]
Liu, Rui [1 ,2 ]
Han, Yahong [1 ,2 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Tianjin Univ, Tianjin Key Lab Machine Learning, Tianjin, Peoples R China
来源
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2020年
关键词
Anomaly detection; Gradient-based attention; Predictive autoencoder;
D O I
10.1109/icme46284.2020.9102894
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Video anomaly detection is a challenging problem due to the ambiguity and diversity of anomalies in different scenes. In this paper, we present a novel framework to detect abnormal in surveillance videos. Inspired by the common deep reconstruction methods and deep prediction ones, we propose a new two-branch predictive autoencoder, including a reconstruction decoder and a prediction decoder, in which the prediction decoder is used to generate future frame and carry out anomaly detection by comparing the difference between predicted future frame and its ground truth. And the reconstruction decoder reconstructs the current frame, which can constrains the encoder to learn video representations better. Moreover, reconstruction decoder provides a gradient-based attention, which significantly helps the prediction decoder to generate higher quality future frame. Our method unifies reconstruction and prediction methods in an end-to-end framework, and it obtains impressive results with better predicted future frame on some publicly available datasets including CUHK Avenue and UCSD Pedestrian.
引用
收藏
页数:6
相关论文
共 22 条
[1]  
[Anonymous], 2016, PROC 4 INT C LEARN R
[2]  
[Anonymous], 2017, BMVC
[3]   Privacy preserving crowd monitoring: Counting people without people models or tracking [J].
Chan, Antoni B. ;
Liang, Zhang-Sheng John ;
Vasconcelos, Nuno .
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, :1766-1772
[4]   Sparse Reconstruction Cost for Abnormal Event Detection [J].
Cong, Yang ;
Yuan, Junsong ;
Liu, Ji .
2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, :1807-+
[5]   A Discriminative Framework for Anomaly Detection in Large Videos [J].
Del Giorno, Allison ;
Bagnell, J. Andrew ;
Hebert, Martial .
COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 :334-349
[6]   FlowNet: Learning Optical Flow with Convolutional Networks [J].
Dosovitskiy, Alexey ;
Fischer, Philipp ;
Ilg, Eddy ;
Haeusser, Philip ;
Hazirbas, Caner ;
Golkov, Vladimir ;
van der Smagt, Patrick ;
Cremers, Daniel ;
Brox, Thomas .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2758-2766
[7]  
Gong D., 2019, ICCV
[8]   Learning Temporal Regularity in Video Sequences [J].
Hasan, Mahmudul ;
Choi, Jonghyun ;
Neumann, Jan ;
Roy-Chowdhury, Amit K. ;
Davis, Larry S. .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :733-742
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]   Joint Detection and Recounting of Abnormal Events by Learning Deep Generic Knowledge [J].
Hinami, Ryota ;
Mei, Tao ;
Satoh, Shin'ichi .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :3639-3647