Spatiotemporal Masked Autoencoder with Multi-Memory and Skip Connections for Video Anomaly Detection

被引：5

作者：

Fu, Yan ^{[1
]}

Yang, Bao ^{[1
]}

Ye, Ou ^{[1
]}

机构：

[1] Xian Univ Sci & Technol, Sch Comp Sci & Technol, Xian 710054, Peoples R China

来源：

ELECTRONICS | 2024年 / 13卷 / 02期

关键词：

video anomaly detection; memory network; spatiotemporal masked autoencoder; vision transformer; skip connections;

D O I：

10.3390/electronics13020353

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Video anomaly detection is a critical component of intelligent video surveillance systems,extensively deployed and researched in industry and academia. However, existing methods have astrong generalization ability for predicting anomaly samples. They cannot utilize high-level semanticand temporal contextual information in videos, resulting in unstable prediction performance. Toalleviate this issue, we propose an encoder-decoder model named SMAMS, based on spatiotemporalmasked autoencoder and memory modules. First, we represent and mask some of the video eventsusing spatiotemporal cubes. Then, the unmasked patches are inputted into the spatiotemporalmasked autoencoder to extract high-level semantic and spatiotemporal features of the video events.Next, we add multiple memory modules to store unmasked video patches of different feature layers.Finally, skip connections are introduced to compensate for crucial information loss caused by thememory modules. Experimental results show that the proposed method outperforms state-of-the-artmethods, achieving AUC scores of 99.9%, 94.8%, and 78.9% on the UCSD Ped2, CUHK Avenue, andShanghai Tech datasets.

引用

页数：20

共 50 条

[41] Unsupervised video anomaly detection based on multi-timescale trajectory prediction
Sun, Qiyue
Yang, Yang
COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 227
[42] Video anomaly detection with multi-scale feature and temporal information fusion
Cai, Yiheng
Liu, Jiaqi
Guo, Yajun
Hu, Shaobin
Lang, Shinan
NEUROCOMPUTING, 2021, 423 : 264 - 273
[43] OBJECT-CENTRIC AND MEMORY-GUIDED NORMALITY RECONSTRUCTION FOR VIDEO ANOMALY DETECTION
Bergaoui, Khalil
Naji, Yassine
Setkov, Aleksandr
Loesch, Angelique
Gouiffes, Michele
Audigier, Romaric
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2691 - 2695
[44] 3D-Convolutional Neural Network with Generative Adversarial Network and Autoencoder for Robust Anomaly Detection in Video Surveillance
Shin, Wonsup
Bu, Seok-Jun
Cho, Sung-Bae
INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2020, 30 (06)
[45] VIDEO ANOMALY DETECTION VIA PREDICTION NETWORK WITH ENHANCED SPATIO-TEMPORAL MEMORY EXCHANGE
Shen, Guodong
Ouyang, Yuqi
Sanchez, Victor
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3728 - 3732
[46] AEMNet: Unsupervised Video Anomaly Detection Method Based on Attention-Enhanced Memory Networks
Zhang, Linliang
Yan, Lianshan
Peng, Shouxin
Pan, Lihu
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (08)
[47] Self-supervised memory-guided and attention feature fusion for video anomaly detection
Jiang, Zitai
Wang, Chuanxu
Li, Jiajiong
Zhao, Min
Yang, Qingyang
JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)
[48] Normal-abnormal negative impacts suppressing via normal feature memory for video anomaly detection
Chen, Qihui
Liang, Weijie
Zhan, Yongzhao
SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
[49] Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches
Shen, Guodong
Ouyang, Yuqi
Lu, Junru
Yang, Yixuan
Sanchez, Victor
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6865 - 6880
[50] Learning Appearance-Motion Synergy via Memory-Guided Event Prediction for Video Anomaly Detection
Guo, Chongye
Wang, Hongbo
Xia, Yingjie
Feng, Guorui
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1519 - 1531

← 1 2 3 4 5 →