Fast and accurate novelty detection for large surveillance video

被引:3
作者
Tang, Shanjiang [1 ]
Wang, Ziyi [2 ]
Yu, Ce [3 ]
Sun, Chao [4 ]
Li, Yusen [5 ]
Xiao, Jian [1 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Tianjin Univ, Tianjin, Peoples R China
[3] Tianjin Univ, High Performance Comp Lab HPCL Comp Sci & Technol, Tianjin, Peoples R China
[4] Tianjin Univ, High Performance Comp Ctr, Tianjin, Peoples R China
[5] Nankai Univ, Dept Comp Sci & Secur, Ianjin, Peoples R China
关键词
Novelty detection; Big data; Surveillance videos; Optical flow; Adaptive frame sampling; ANOMALY DETECTION;
D O I
10.1007/s42514-024-00185-z
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, fast and accurate novelty detection is crucial for public safety and security in surveillance videos. Given the high accuracy of deep learning technique, deep learning based novel detection is a trend. With the huge amount of surveillance videos being generated by surveillance cameras at any time, it is challenging to make novelty detection in surveillance videos efficiently while guaranteeing the accuracy. To address it, we propose a dynamic frame sampling method called ORLNet with both the frame similarity and the intensity of the object movement considered. It is based on the two observations as follows: firstly, there is a high similarity between adjacent frames in a video data. Secondly, in practice, since novel behaviors are always generated by moving targets, we only need to focus on a small number of frames that contain key information which we call key frames. Specifically, ORLNet speeds up surveillance video by setting a reinforcement learning agent to dynamically determine the indexes of key frames at run-time and replace end-to-end inference at non-key frame positions by reusing the last key frame's calculation. Typically, it defines frame similarity as novelty energy, which is the combination of novel semantic and motion features. On the premise of calculating the distance of novel energy between frames, the calculation of key frames can be reused for other frames corresponding to similar novelty energies, which can thus accelerate novelty detection while maintain accuracy. Finally, we evaluate ORLNet experimentally with two surveillance video datasets by comparing with existing methods. Experimental results show that ORLNet reduces processing time by 42% while guaranteeing the accuracy.
引用
收藏
页码:130 / 149
页数:20
相关论文
共 39 条
[1]   Latent Space Autoregression for Novelty Detection [J].
Abati, Davide ;
Porrello, Angelo ;
Calderara, Simone ;
Cucchiara, Rita .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :481-490
[2]  
Adhikari BM., 2020, MULTIMED TOOLS APPL, V79, P14823
[3]   THE BELLMAN EQUATION FOR MINIMIZING THE MAXIMUM COST [J].
BARRON, EN ;
ISHII, H .
NONLINEAR ANALYSIS-THEORY METHODS & APPLICATIONS, 1989, 13 (09) :1067-1090
[4]   CONTINUOUS INFERIOR VENA CAVA DIAMETER TRACKING THROUGH AN ITERATIVE KANADE-LUCAS-TOMASI-BASED ALGORITHM [J].
Belmont, Barry ;
Kessler, Ross ;
Theyyunni, Nikhil ;
Fung, Christopher ;
Huang, Robert ;
Cover, Michael ;
Ward, Kevin R. ;
Shih, Albert J. ;
Tiba, Mohamad .
ULTRASOUND IN MEDICINE AND BIOLOGY, 2018, 44 (12) :2793-2801
[5]  
Chakraborty S., 2021, IEEE T IND INFORM, V17, P1773
[6]   Anomaly detection for wind turbines based on the reconstruction of condition parameters using stacked denoising autoencoders [J].
Chen, Junsheng ;
Li, Jian ;
Chen, Weigen ;
Wang, Youyuan ;
Jiang, Tianyan .
RENEWABLE ENERGY, 2020, 147 :1469-1480
[7]  
Dynkin E.B., 1965, Markov Processes, P77
[8]  
Fan JQ, 2020, PR MACH LEARN RES, V120, P486
[9]  
Fang Y., 2021, IEEE T MULTIMEDIA, DOI [10.1109/TMM.2021.3094866, DOI 10.1109/TMM.2021.3094866]
[10]  
Ghavamzadeh M., 2006, ADV NEURAL INF PROCE, V19