Weakly-Supervised Video Anomaly Detection With Snippet Anomalous Attention

被引:16
作者
Fan, Yidan [1 ]
Yu, Yongxin
Lu, Wenhuan
Han, Yahong [1 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Anomaly detection; Task analysis; Location awareness; Training; Optimization; Annotations; Weakly-supervised; anomaly detection; snippet anomalous attention; multi-branch supervision;
D O I
10.1109/TCSVT.2024.3350084
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With a focus on abnormal events contained within untrimmed videos, there is increasing interest among researchers in video anomaly detection. Among different video anomaly detection scenarios, weakly-supervised video anomaly detection poses a significant challenge as it lacks frame-wise labels during the training stage, only relying on video-level labels as coarse supervision. Previous methods have made attempts to either learn discriminative features in an end-to-end manner or employ a two-stage self-training strategy to generate snippet-level pseudo labels. However, both approaches have certain limitations. The former tends to overlook informative features at the snippet level, while the latter can be susceptible to noises. In this paper, we propose an Anomalous Attention mechanism for weakly-supervised anomaly detection to tackle the aforementioned problems. Our approach takes into account snippet-level encoded features without the supervision of pseudo labels. Specifically, our approach first generates snippet-level anomalous attention and then feeds it together with original anomaly scores into a Multi-branch Supervision Module. The module learns different areas of the video, including areas that are challenging to detect, and also assists the attention optimization. Experiments on benchmark datasets XD-Violence and UCF-Crime verify the effectiveness of our method. Besides, thanks to the proposed snippet-level attention, we obtain a more precise anomaly localization.
引用
收藏
页码:5480 / 5492
页数:13
相关论文
共 68 条
[1]   Adaptive Graph Convolutional Networks for Weakly Supervised Anomaly Detection in Videos [J].
Cao, Congqi ;
Zhang, Xin ;
Zhang, Shizhou ;
Wang, Peng ;
Zhang, Yanning .
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 :2497-2501
[2]   Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].
Carreira, Joao ;
Zisserman, Andrew .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733
[3]  
Chen YX, 2023, AAAI CONF ARTIF INTE, P387
[4]   Look Around for Anomalies: Weakly-supervised Anomaly Detection via Context-Motion Relational Learning [J].
Cho, MyeongAh ;
Kim, Minjung ;
Hwang, Sangwon ;
Park, Chaewon ;
Lee, Kyungjae ;
Lee, Sangyoun .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :12137-12146
[5]   Learning Spatiotemporal Features with 3D Convolutional Networks [J].
Du Tran ;
Bourdev, Lubomir ;
Fergus, Rob ;
Torresani, Lorenzo ;
Paluri, Manohar .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4489-4497
[6]   MIST: Multiple Instance Self-Training Framework for Video Anomaly Detection [J].
Feng, Jia-Chang ;
Hong, Fa-Ting ;
Zheng, Wei-Shi .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14004-14013
[7]   Visual commonsense reasoning with directional visual connections [J].
Han, Yahong ;
Wu, Aming ;
Zhu, Linchao ;
Yang, Yi .
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2021, 22 (05) :625-637
[8]   Cross-modal Consensus Network forWeakly Supervised Temporal Action Localization [J].
Hong, Fa-Ting ;
Feng, Jia-Chang ;
Xu, Dan ;
Shan, Ying ;
Zheng, Wei-Shi .
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :1591-1599
[9]   Self-Supervised Attentive Generative Adversarial Networks for Video Anomaly Detection [J].
Huang, Chao ;
Wen, Jie ;
Xu, Yong ;
Jiang, Qiuping ;
Yang, Jian ;
Wang, Yaowei ;
Zhang, David .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) :9389-9403
[10]   Abnormal Event Detection Using Deep Contrastive Learning for Intelligent Video Surveillance System [J].
Huang, Chao ;
Wu, Zhihao ;
Wen, Jie ;
Xu, Yong ;
Jiang, Qiuping ;
Wang, Yaowei .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (08) :5171-5179