Weakly Supervised Video Anomaly Detection via Transformer-Enabled Temporal Relation Learning

被引:22
作者
Zhang, Dasheng [1 ]
Huang, Chao [2 ]
Liu, Chengliang [2 ]
Xu, Yong [2 ,3 ]
机构
[1] Chongqing Univ, Sch Artificial Intelligence, Chongqing 401135, Peoples R China
[2] Harbin Inst Technol, Shenzhen Key Lab Visual Object Detect & Recognit, Shenzhen 518055, Peoples R China
[3] Peng Cheng Lab, Shenzhen 518055, Peoples R China
基金
国家重点研发计划;
关键词
Feature extraction; Transformers; Task analysis; Anomaly detection; Training; Surveillance; Training data; Deep learning; video anomaly detection; vision transformer; weakly-supervised learning;
D O I
10.1109/LSP.2022.3175092
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Weakly supervised video anomaly detection is a challenging problem due to the lack of frame-level labels in training videos. Most previous works typically tackle this task with the multiple instance learning paradigm, which divides a video into multiple snippets and trains a snippet classifier to distinguish anomalies from normal snippets via video-level supervision information. Although existing approaches achieve remarkable progresses, these solutions are still limited in the insufficient representations. In this paper, we propose a novel weakly supervised temporal relation learning framework for anomaly detection, which efficiently explores the temporal relation between snippets and enhances the discriminative powers of features using only video-level labelled videos. To this end, we design a transformer-enabled feature encoder to convert the input task-agnostic features into discriminative task-specific features by mining the semantic correlation and position relation between video snippets. As a result, our model can make a more accurate anomaly detection for current video snippet based on the learned discriminative features. Experimental results indicate that the proposed method is superior to existing state-of-the-art approaches, which demonstrates the effectiveness of our model.
引用
收藏
页码:1197 / 1201
页数:5
相关论文
共 37 条
  • [1] Quantum neural network-based multilabel image classification in high-resolution unmanned aerial vehicle imagery
    Abdel-Khalek, Sayed
    Algarni, Mariam
    Mansour, Romany F.
    Gupta, Deepak
    Ilayaraja, M.
    [J]. SOFT COMPUTING, 2023, 27 (18) : 13027 - 13038
  • [2] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
    Carreira, Joao
    Zisserman, Andrew
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
  • [3] Contrastive Attention for Video Anomaly Detection
    Chang, Shuning
    Li, Yanchao
    Shen, Shengmei
    Feng, Jiashi
    Zhou, Zhiying
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4067 - 4076
  • [4] MIST: Multiple Instance Self-Training Framework for Video Anomaly Detection
    Feng, Jia-Chang
    Hong, Fa-Ting
    Zheng, Wei-Shi
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14004 - 14013
  • [5] A Background-Agnostic Framework With Adversarial Training for Abnormal Event Detection in Video
    Georgescu, Mariana Iuliana
    Ionescu, Radu Tudor
    Khan, Fahad Shahbaz
    Popescu, Marius
    Shah, Mubarak
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 4505 - 4523
  • [6] Anomaly Detection in Video via Self-Supervised and Multi-Task Learning
    Georgescu, Mariana-Iuliana
    Barbalau, Antonio
    Ionescu, Radu Tudor
    Khan, Fahad Shahbaz
    Popescu, Marius
    Shah, Mubarak
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12737 - 12747
  • [7] Memorizing Normality to Detect Anomaly: Memory-augmented Deep Autoencoder for Unsupervised Anomaly Detection
    Gong, Dong
    Liu, Lingqiao
    Le, Vuong
    Saha, Budhaditya
    Mansour, Moussa Reda
    Venkatesh, Svetha
    van den Hengel, Anton
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1705 - 1714
  • [8] Learning Temporal Regularity in Video Sequences
    Hasan, Mahmudul
    Choi, Jonghyun
    Neumann, Jan
    Roy-Chowdhury, Amit K.
    Davis, Larry S.
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 733 - 742
  • [9] Self-Supervised Attentive Generative Adversarial Networks for Video Anomaly Detection
    Huang, Chao
    Wen, Jie
    Xu, Yong
    Jiang, Qiuping
    Yang, Jian
    Wang, Yaowei
    Zhang, David
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 9389 - 9403
  • [10] Abnormal Event Detection Using Deep Contrastive Learning for Intelligent Video Surveillance System
    Huang, Chao
    Wu, Zhihao
    Wen, Jie
    Xu, Yong
    Jiang, Qiuping
    Wang, Yaowei
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (08) : 5171 - 5179