Multi-level feature splicing 3D network based on multi-task joint learning for video anomaly detection

被引:0
作者
Li, Yang [1 ]
Tong, Guoxiang [1 ]
机构
[1] Univ Shanghai Sci & Technol, 516 Jungong Rd, Shanghai 200093, Peoples R China
关键词
Video anomaly detection; Multi-task learning; Pseudo-anomaly; Feature splicing; Attention gating; ABNORMAL EVENT DETECTION;
D O I
10.1016/j.neucom.2025.129964
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In video anomaly detection research, deep learning is dedicated to identifying anomalous events accurately and efficiently. However, due to the scarcity and diversity of anomaly samples, previous methods have not adequately taken into account important information about location and timing. In addition, the overpowered generalization ability of the models leads to the fact that anomalies can also be well reconstructed or predicted. To address the above challenges, we propose a 3D network based on multi-level feature splicing with joint multi-task learning. The network is improved by the autoencoder (AE) as a backbone network. Firstly, we design a normal sample training task and a Gaussian noise task from a spatial perspective to enhance the reconstruction of positive samples. The frame-skipping task and the inverse sequence task of the video are designed from the temporal perspective to suppress the reconstruction ability of negative samples. Secondly, we use multi-level feature splicing in the encoding and decoding process to equip the network with the ability to explore sufficient information from the full scale. At the same time, we use an attention gating module to filter redundant features. The results show that our network is competitive with state-of-the-art methods. In terms of AUC, UCSD Ped2 achieves 99.3%, CUHK Avenue achieves 88.4%, and ShanghaiTech Campus achieves 74.2%.
引用
收藏
页数:13
相关论文
共 57 条
[1]   Latent Space Autoregression for Novelty Detection [J].
Abati, Davide ;
Porrello, Angelo ;
Calderara, Simone ;
Cucchiara, Rita .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :481-490
[2]   Collaborative Learning of Anomalies with Privacy (CLAP) for Unsupervised Video Anomaly Detection: A New Baseline [J].
Al-lahham, Anas ;
Zaheer, Muhammad Zaigham ;
Tastan, Nurbek ;
Nandakumar, Karthik .
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, :12416-12425
[3]   Synthetic Temporal Anomaly Guided End-to-End Video Anomaly Detection [J].
Astrid, Marcella ;
Zaheer, Muhammad Zaigham ;
Lee, Seung-Ik .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, :207-214
[4]   Video anomaly detection with multi-scale feature and temporal information fusion [J].
Cai, Yiheng ;
Liu, Jiaqi ;
Guo, Yajun ;
Hu, Shaobin ;
Lang, Shinan .
NEUROCOMPUTING, 2021, 423 :264-273
[5]  
Chalapathy R, 2019, Arxiv, DOI [arXiv:1802.06360, DOI 10.48550/ARXIV.1802.06360]
[6]   Clustering Driven Deep Autoencoder for Video Anomaly Detection [J].
Chang, Yunpeng ;
Tu, Zhigang ;
Xie, Wei ;
Yuan, Junsong .
COMPUTER VISION - ECCV 2020, PT XV, 2020, 12360 :329-345
[7]   Sparse Coding Guided Spatiotemporal Feature Learning for Abnormal Event Detection in Large Videos [J].
Chu, Wenqing ;
Xue, Hongyang ;
Yao, Chengwei ;
Cai, Deng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (01) :246-255
[8]   Towards Interpretable Video Anomaly Detection [J].
Doshi, Keval ;
Yilmaz, Yasin .
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, :2654-2663
[9]   Anomaly Detection in Video via Self-Supervised and Multi-Task Learning [J].
Georgescu, Mariana-Iuliana ;
Barbalau, Antonio ;
Ionescu, Radu Tudor ;
Khan, Fahad Shahbaz ;
Popescu, Marius ;
Shah, Mubarak .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12737-12747
[10]   Memorizing Normality to Detect Anomaly: Memory-augmented Deep Autoencoder for Unsupervised Anomaly Detection [J].
Gong, Dong ;
Liu, Lingqiao ;
Le, Vuong ;
Saha, Budhaditya ;
Mansour, Moussa Reda ;
Venkatesh, Svetha ;
van den Hengel, Anton .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1705-1714