Cognition Guided Video Anomaly Detection Framework for Surveillance Services

被引:0
作者
Zhang, Menghao [1 ]
Wang, Jingyu [1 ]
Qi, Qi [1 ]
Zhuang, Zirui [1 ]
Sun, Haifeng [1 ]
Liao, Jianxin [1 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Anomaly detection; Surveillance; Knowledge engineering; Task analysis; Visualization; Explosions; Data models; Multi-layer GCN; prior knowledge; prompt tuning; video anomaly detection; surveillance service;
D O I
10.1109/TSC.2024.3407588
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The aim of surveillance services is to detect anomalous events that occur in given surveillance videos. Most existing video anomaly detection methods rely on minimizing reconstruction or prediction errors due to the lack of abnormal data, which results in poor generalization and overfitting. In fact, cognitions for anomalies in surveillance videos mainly relies on crucial relationships, including ones between objects and ones between objects and scenes. Focusing on this property of anomaly detection, a Cognition Guided Video Anomaly Detection framework based on prior knowledge is proposed, called CG-VAD. CG-VAD introduces both explicit and implicit prior knowledge into the frame prediction network to let the model exploit crucial relationships. Explicit knowledge containing crucial relationships related to anomaly is introduced into the anomaly detection model through a proposed embedding network based on multi-layer Graph Convolutional Networks. Implicit knowledge in the form of learnable parameters enhances the ability of the model to learn crucial relationships through prompt tuning. By integrating prior knowledge to focus the model on the relationships associated with the anomaly, we find that CG-VAD is not only quick to adapt to new real-world scenarios, but it is also able to recognize the type of anomaly. We have conducted extensive experiments on four benchmark datasets and the results indicate that the proposed method outperforms previous methods. Specifically, CG-VAD achieves an AUROC score of 87.2$\%$% on the ShanghaiTech dataset.
引用
收藏
页码:2109 / 2123
页数:15
相关论文
共 81 条
[21]   Memorizing Normality to Detect Anomaly: Memory-augmented Deep Autoencoder for Unsupervised Anomaly Detection [J].
Gong, Dong ;
Liu, Lingqiao ;
Le, Vuong ;
Saha, Budhaditya ;
Mansour, Moussa Reda ;
Venkatesh, Svetha ;
van den Hengel, Anton .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1705-1714
[22]   CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition [J].
Gowda, Shreyank N. ;
Sevilla-Lara, Laura ;
Keller, Frank ;
Rohrbach, Marcus .
COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 :187-203
[23]   ECCVideo: A Scalable Edge Cloud Collaborative Video Analysis System [J].
Han, Qing ;
Ren, Xuebin ;
Zhao, Peng ;
Wang, Yimeng ;
Wang, Luhui ;
Yang, Xinyu ;
Zhao, Cong .
IEEE INTELLIGENT SYSTEMS, 2023, 38 (01) :34-44
[24]   Self-Supervised Attentive Generative Adversarial Networks for Video Anomaly Detection [J].
Huang, Chao ;
Wen, Jie ;
Xu, Yong ;
Jiang, Qiuping ;
Yang, Jian ;
Wang, Yaowei ;
Zhang, David .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) :9389-9403
[25]   Category Contrast for Unsupervised Domain Adaptation in Visual Tasks [J].
Huang, Jiaxing ;
Guan, Dayan ;
Xiao, Aoran ;
Lu, Shijian ;
Shao, Ling .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :1193-1204
[26]   Boosting Variational Inference With Margin Learning for Few-Shot Scene-Adaptive Anomaly Detection [J].
Huang, Xin ;
Hu, Yutao ;
Luo, Xiaoyan ;
Han, Jungong ;
Zhang, Baochang ;
Cao, Xianbin .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (06) :2813-2825
[27]   Object-centric Auto-encoders and Dummy Anomalies for Abnormal Event Detection in Video [J].
Ionescu, Radu Tudor ;
Khan, Fahad Shahbaz ;
Georgescu, Mariana-Iuliana ;
Shao, Ling .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7834-7843
[28]   Visual Prompt Tuning [J].
Jia, Menglin ;
Tang, Luming ;
Chen, Bor-Chun ;
Cardie, Claire ;
Belongie, Serge ;
Hariharan, Bharath ;
Lim, Ser-Nam .
COMPUTER VISION - ECCV 2022, PT XXXIII, 2022, 13693 :709-727
[29]   The MegaFace Benchmark: 1 Million Faces for Recognition at Scale [J].
Kemelmacher-Shlizerman, Ira ;
Seitz, Steven M. ;
Miller, Daniel ;
Brossard, Evan .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4873-4882
[30]   Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations [J].
Krishna, Ranjay ;
Zhu, Yuke ;
Groth, Oliver ;
Johnson, Justin ;
Hata, Kenji ;
Kravitz, Joshua ;
Chen, Stephanie ;
Kalantidis, Yannis ;
Li, Li-Jia ;
Shamma, David A. ;
Bernstein, Michael S. ;
Li Fei-Fei .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 123 (01) :32-73