Weakly Supervised Video Anomaly Detection via Transformer-Enabled Temporal Relation Learning

被引：22

作者：

Zhang, Dasheng ^{[1
]}

Huang, Chao ^{[2
]}

Liu, Chengliang ^{[2
]}

Xu, Yong ^{[2
,3
]}

机构：

[1] Chongqing Univ, Sch Artificial Intelligence, Chongqing 401135, Peoples R China

[2] Harbin Inst Technol, Shenzhen Key Lab Visual Object Detect & Recognit, Shenzhen 518055, Peoples R China

[3] Peng Cheng Lab, Shenzhen 518055, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2022年 / 29卷

基金：

国家重点研发计划;

关键词：

Feature extraction; Transformers; Task analysis; Anomaly detection; Training; Surveillance; Training data; Deep learning; video anomaly detection; vision transformer; weakly-supervised learning;

D O I：

10.1109/LSP.2022.3175092

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Weakly supervised video anomaly detection is a challenging problem due to the lack of frame-level labels in training videos. Most previous works typically tackle this task with the multiple instance learning paradigm, which divides a video into multiple snippets and trains a snippet classifier to distinguish anomalies from normal snippets via video-level supervision information. Although existing approaches achieve remarkable progresses, these solutions are still limited in the insufficient representations. In this paper, we propose a novel weakly supervised temporal relation learning framework for anomaly detection, which efficiently explores the temporal relation between snippets and enhances the discriminative powers of features using only video-level labelled videos. To this end, we design a transformer-enabled feature encoder to convert the input task-agnostic features into discriminative task-specific features by mining the semantic correlation and position relation between video snippets. As a result, our model can make a more accurate anomaly detection for current video snippet based on the learned discriminative features. Experimental results indicate that the proposed method is superior to existing state-of-the-art approaches, which demonstrates the effectiveness of our model.

引用

页码：1197 / 1201

页数：5

共 37 条

[1] Quantum neural network-based multilabel image classification in high-resolution unmanned aerial vehicle imagery
Abdel-Khalek, Sayed
Algarni, Mariam
Mansour, Romany F.
Gupta, Deepak
Ilayaraja, M.
[J]. SOFT COMPUTING, 2023, 27 (18) : 13027 - 13038
[2] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
Carreira, Joao
Zisserman, Andrew
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
[3] Contrastive Attention for Video Anomaly Detection
Chang, Shuning
Li, Yanchao
Shen, Shengmei
Feng, Jiashi
Zhou, Zhiying
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4067 - 4076
[4] MIST: Multiple Instance Self-Training Framework for Video Anomaly Detection
Feng, Jia-Chang
Hong, Fa-Ting
Zheng, Wei-Shi
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14004 - 14013
[5] A Background-Agnostic Framework With Adversarial Training for Abnormal Event Detection in Video
Georgescu, Mariana Iuliana
Ionescu, Radu Tudor
Khan, Fahad Shahbaz
Popescu, Marius
Shah, Mubarak
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 4505 - 4523
[6] Anomaly Detection in Video via Self-Supervised and Multi-Task Learning
Georgescu, Mariana-Iuliana
Barbalau, Antonio
Ionescu, Radu Tudor
Khan, Fahad Shahbaz
Popescu, Marius
Shah, Mubarak
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12737 - 12747
[7] Memorizing Normality to Detect Anomaly: Memory-augmented Deep Autoencoder for Unsupervised Anomaly Detection
Gong, Dong
Liu, Lingqiao
Le, Vuong
Saha, Budhaditya
Mansour, Moussa Reda
Venkatesh, Svetha
van den Hengel, Anton
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1705 - 1714
[8] Learning Temporal Regularity in Video Sequences
Hasan, Mahmudul
Choi, Jonghyun
Neumann, Jan
Roy-Chowdhury, Amit K.
Davis, Larry S.
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 733 - 742
[9] Self-Supervised Attentive Generative Adversarial Networks for Video Anomaly Detection
Huang, Chao
Wen, Jie
Xu, Yong
Jiang, Qiuping
Yang, Jian
Wang, Yaowei
Zhang, David
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 9389 - 9403
[10] Abnormal Event Detection Using Deep Contrastive Learning for Intelligent Video Surveillance System
Huang, Chao
Wu, Zhihao
Wen, Jie
Xu, Yong
Jiang, Qiuping
Wang, Yaowei
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (08) : 5171 - 5179

← 1 2 3 4 →