Future Frame Prediction Network for Video Anomaly Detection

被引:64
作者
Luo, Weixin [1 ,2 ,3 ]
Liu, Wen [1 ,2 ,3 ]
Lian, Dongze [1 ,2 ,3 ]
Gao, Shenghua [4 ]
机构
[1] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai, Peoples R China
[2] Chinese Acad Sci, Shanghai Inst Microsyst & Informat Technol, Shanghai 200050, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[4] ShanghaiTech Univ, Shanghai Engn Res Ctr Intelligent Vis & Imaging, Shanghai 201210, Peoples R China
基金
国家重点研发计划;
关键词
Optical losses; Adaptation models; Visualization; Sensitivity; Uncertainty; Toy manufacturing industry; Training data; Video anomaly detection; prediction network; graph neural networks; meta learning; EVENT DETECTION; HISTOGRAMS; FLOW;
D O I
10.1109/TPAMI.2021.3129349
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video Anomaly detection in videos refers to the identification of events that do not conform to expected behavior. However, almost all existing methods cast this problem as the minimization of reconstruction errors of training data including only normal events, which may lead to self-reconstruction and cannot guarantee a larger reconstruction error for an abnormal event. In this paper, we propose to formulate the video anomaly detection problem within a regime of video prediction. We advocate that not all video prediction networks are suitable for video anomaly detection. Then, we introduce two principles for the design of a video prediction network for video anomaly detection. Based on them, we elaborately design a video prediction network with appearance and motion constraints for video anomaly detection. Further, to promote the generalization of the prediction-based video anomaly detection for novel scenes, we carefully investigate the usage of a meta learning within our framework, where our model can be fast adapted to a new testing scene with only a few starting frames. Extensive experiments on both a toy dataset and three real datasets validate the effectiveness of our method in terms of robustness to the uncertainty in normal events and the sensitivity to abnormal events.
引用
收藏
页码:7505 / 7520
页数:16
相关论文
共 100 条
[51]   Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute Detection [J].
Liang, Xiaodan ;
Lee, Lisa ;
Xing, Eric P. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4408-4417
[52]  
Liu W, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P3023
[53]   Future Frame Prediction for Anomaly Detection - A New Baseline [J].
Liu, Wen ;
Luo, Weixin ;
Lian, Dongze ;
Gao, Shenghua .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6536-6545
[54]   Abnormal Event Detection at 150 FPS in MATLAB [J].
Lu, Cewu ;
Shi, Jianping ;
Jia, Jiaya .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :2720-2727
[55]   Video Anomaly Detection with Sparse Coding Inspired Deep Neural Networks [J].
Luo, Weixin ;
Liu, Wen ;
Lian, Dongze ;
Tang, Jinhui ;
Duan, Lixin ;
Peng, Xi ;
Gao, Shenghua .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (03) :1070-1084
[56]   A Revisit of Sparse Coding Based Anomaly Detection in Stacked RNN Framework [J].
Luo, Weixin ;
Liu, Wen ;
Gao, Shenghua .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :341-349
[57]  
Luo WX, 2017, IEEE INT CON MULTI, P439, DOI 10.1109/ICME.2017.8019325
[58]   Attend and Interact: Higher-Order Object Interactions for Video Understanding [J].
Ma, Chih-Yao ;
Kadav, Asim ;
Melvin, Iain ;
Kira, Zsolt ;
AlRegib, Ghassan ;
Graf, Hans Peter .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6790-6800
[59]   Anomaly Detection in Crowded Scenes [J].
Mahadevan, Vijay ;
Li, Weixin ;
Bhalodia, Viral ;
Vasconcelos, Nuno .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :1975-1981
[60]  
Mao X., 2016, arXiv, DOI DOI 10.48550/ARXIV.1611.04076