Spatiotemporal Masked Autoencoder with Multi-Memory and Skip Connections for Video Anomaly Detection

被引:5
|
作者
Fu, Yan [1 ]
Yang, Bao [1 ]
Ye, Ou [1 ]
机构
[1] Xian Univ Sci & Technol, Sch Comp Sci & Technol, Xian 710054, Peoples R China
关键词
video anomaly detection; memory network; spatiotemporal masked autoencoder; vision transformer; skip connections;
D O I
10.3390/electronics13020353
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video anomaly detection is a critical component of intelligent video surveillance systems,extensively deployed and researched in industry and academia. However, existing methods have astrong generalization ability for predicting anomaly samples. They cannot utilize high-level semanticand temporal contextual information in videos, resulting in unstable prediction performance. Toalleviate this issue, we propose an encoder-decoder model named SMAMS, based on spatiotemporalmasked autoencoder and memory modules. First, we represent and mask some of the video eventsusing spatiotemporal cubes. Then, the unmasked patches are inputted into the spatiotemporalmasked autoencoder to extract high-level semantic and spatiotemporal features of the video events.Next, we add multiple memory modules to store unmasked video patches of different feature layers.Finally, skip connections are introduced to compensate for crucial information loss caused by thememory modules. Experimental results show that the proposed method outperforms state-of-the-artmethods, achieving AUC scores of 99.9%, 94.8%, and 78.9% on the UCSD Ped2, CUHK Avenue, andShanghai Tech datasets.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Unsupervised video anomaly detection based on multi-timescale trajectory prediction
    Sun, Qiyue
    Yang, Yang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 227
  • [42] Video anomaly detection with multi-scale feature and temporal information fusion
    Cai, Yiheng
    Liu, Jiaqi
    Guo, Yajun
    Hu, Shaobin
    Lang, Shinan
    NEUROCOMPUTING, 2021, 423 : 264 - 273
  • [43] OBJECT-CENTRIC AND MEMORY-GUIDED NORMALITY RECONSTRUCTION FOR VIDEO ANOMALY DETECTION
    Bergaoui, Khalil
    Naji, Yassine
    Setkov, Aleksandr
    Loesch, Angelique
    Gouiffes, Michele
    Audigier, Romaric
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2691 - 2695
  • [44] 3D-Convolutional Neural Network with Generative Adversarial Network and Autoencoder for Robust Anomaly Detection in Video Surveillance
    Shin, Wonsup
    Bu, Seok-Jun
    Cho, Sung-Bae
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2020, 30 (06)
  • [45] VIDEO ANOMALY DETECTION VIA PREDICTION NETWORK WITH ENHANCED SPATIO-TEMPORAL MEMORY EXCHANGE
    Shen, Guodong
    Ouyang, Yuqi
    Sanchez, Victor
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3728 - 3732
  • [46] AEMNet: Unsupervised Video Anomaly Detection Method Based on Attention-Enhanced Memory Networks
    Zhang, Linliang
    Yan, Lianshan
    Peng, Shouxin
    Pan, Lihu
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (08)
  • [47] Self-supervised memory-guided and attention feature fusion for video anomaly detection
    Jiang, Zitai
    Wang, Chuanxu
    Li, Jiajiong
    Zhao, Min
    Yang, Qingyang
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)
  • [48] Normal-abnormal negative impacts suppressing via normal feature memory for video anomaly detection
    Chen, Qihui
    Liang, Weijie
    Zhan, Yongzhao
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [49] Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches
    Shen, Guodong
    Ouyang, Yuqi
    Lu, Junru
    Yang, Yixuan
    Sanchez, Victor
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6865 - 6880
  • [50] Learning Appearance-Motion Synergy via Memory-Guided Event Prediction for Video Anomaly Detection
    Guo, Chongye
    Wang, Hongbo
    Xia, Yingjie
    Feng, Guorui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1519 - 1531