Domain generalization for video anomaly detection considering diverse anomaly types

被引:1
作者
Wang, Zhiqiang [1 ]
Gu, Xiaojing [1 ]
Yan, Huaicheng [1 ]
Gu, Xingsheng [1 ]
机构
[1] East China Univ Sci & Technol, Sch Informat Sci & Engn, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Video anomaly detection; Domain generalization; Unsupervised learning; Domain expansion; EVENT DETECTION;
D O I
10.1007/s11760-024-03033-3
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In intelligent video surveillance, anomaly detection is conducted to identify the occurrence of abnormal events by monitoring the video captured by vision sensors, and it has important application value in public safety, industrial production process monitoring, and other fields. However, the study of video anomaly detection (VAD) models that generalize into uncharted territory remains challenging. The current approach is to use more abnormal samples to enhance the training and improve the generalization ability of the model, but this requires a large number of auxiliary datasets to fully describe the abnormal events. In addition, the ambiguity of the abnormality definition makes it impossible to effectively cover all abnormal videos. To solve this problem, we divide abnormalities into three different types based on object and behavior type: normal object and abnormal behavior (NOAB), abnormal object and normal behavior (AONB), and abnormal object and abnormal behavior (AOAB). We find that the traditional prediction-based model shows better generalization ability for NOAB. However, under the domain generalization setting, the generalization effect of AONB and AOAB decreases significantly. To solve this problem, a new spatiotemporal generalization (STG) model is proposed specifically for the detection of AONB and AOAB events involving anomalous objects. The STG model integrates contrast learning and adaptive data augmentation techniques to realize domain expansion. In addition, combining the STG model with the traditional prediction-based model, an anomaly video monitoring framework is proposed to comprehensively detect anomalies without target domain adaptation, which can improve the generalization ability of VAD models without auxiliary datasets. Extensive evaluations show that the proposed method achieves excellent performance on the benchmark datasets under the domain generalization setting.
引用
收藏
页码:3691 / 3704
页数:14
相关论文
共 37 条
[21]   Few-Shot Scene-Adaptive Anomaly Detection [J].
Lu, Yiwei ;
Yu, Frank ;
Reddy, Mahesh Kumar Krishna ;
Wang, Yang .
COMPUTER VISION - ECCV 2020, PT V, 2020, 12350 :125-141
[22]   A Revisit of Sparse Coding Based Anomaly Detection in Stacked RNN Framework [J].
Luo, Weixin ;
Liu, Wen ;
Gao, Shenghua .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :341-349
[23]   Learning Normal Dynamics in Videos with Meta Prototype Network [J].
Lv, Hui ;
Chen, Chen ;
Cui, Zhen ;
Xu, Chunyan ;
Li, Yong ;
Yang, Jian .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :15420-15429
[24]  
Matsuura T, 2020, AAAI CONF ARTIF INTE, V34, P11749
[25]  
Ramachandra B, 2020, IEEE WINT CONF APPL, P2558, DOI [10.1109/WACV45572.2020.9093457, 10.1109/wacv45572.2020.9093457]
[26]   U-Net: Convolutional Networks for Biomedical Image Segmentation [J].
Ronneberger, Olaf ;
Fischer, Philipp ;
Brox, Thomas .
MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 :234-241
[27]   Economic Comparison between Battery and Supercapacitor for Hourly Dispatching Wave Energy Converter Power [J].
Roy, Pranoy Kumar Singha ;
Karayaka, H. Bora ;
He, JiangBiao ;
Yu, Yi-Hsiang .
2020 52ND NORTH AMERICAN POWER SYMPOSIUM (NAPS), 2021,
[28]   Real-world Anomaly Detection in Surveillance Videos [J].
Sultani, Waqas ;
Chen, Chen ;
Shah, Mubarak .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6479-6488
[29]   Goal-based trajectory analysis for unusual behaviour detection in intelligent surveillance [J].
Tung, Frederick ;
Zelek, John S. ;
Clausi, David A. .
IMAGE AND VISION COMPUTING, 2011, 29 (04) :230-240
[30]  
Volpi R, 2018, ADV NEUR IN, V31