A novel spatio-temporal memory network for video anomaly detection

被引:1
作者
Li H. [1 ]
Chen M. [1 ]
机构
[1] School of Information Science and Technology, Nantong University, 9 Seyuan Road, Jiangsu, Nantong
基金
中国国家自然科学基金;
关键词
Auto-encoder; Feature extraction; Memory; Video anomaly detection;
D O I
10.1007/s11042-024-18957-8
中图分类号
学科分类号
摘要
Future frame prediction for anomaly detection methods based on memory networks have been extensively explored in the academic domain. Nevertheless, traditional memory-guided network techniques, which store dispersed spatial low-dimensional features, often fall short in delivering satisfactory results when applied to datasets characterized by variable scenes. This deficiency is evident in the frequent challenges faced during network convergence in the training process, resulting in unstable training outcomes. In response to this challenge, we introduce a novel Spatio-Temporal Memory Module, denoted as ST_MemAE. Our approach is designed to retain temporal correlation information within low-dimensional features, enhancing the representation of temporally closely linked features within the output of the encoder. Furthermore, we incorporate a homogeneous uncertainty function to optimize the balance of weights associated with multiple loss functions that are part of the memory module update process. As a result, our method offers improved stability in model training, faster convergence, and higher quality predictions of future frames. To validate the effectiveness of our approach, we conducted extensive experiments utilizing three distinct video anomaly detection datasets: UCSD Pedestrian 2, CUHK Avenue, and ShanghaiTech. The outcomes of these comprehensive experiments on publicly available datasets underscore the robustness of our method in accommodating diverse normal events while maintaining sensitivity to abnormal events. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
引用
收藏
页码:4603 / 4624
页数:21
相关论文
共 50 条
  • [21] MULTI-SCALE ANALYSIS OF CONTEXTUAL INFORMATION WITHIN SPATIO-TEMPORAL VIDEO VOLUMES FOR ANOMALY DETECTION
    Li, Nannan
    Guo, Huiwen
    Xu, Dan
    Wu, Xinyu
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2363 - 2367
  • [22] Memory-Augmented Spatial-Temporal Consistency Network for Video Anomaly Detection
    Li, Zhangxun
    Zhao, Mengyang
    Zeng, Xinhua
    Wang, Tian
    Pang, Chengxin
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 95 - 107
  • [23] Video anomaly detection based on multi-scale optical flow spatio-temporal enhancement and normality mining
    He, Qiang
    Shi, Ruinian
    Chen, Linlin
    Huo, Lianzhi
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (03) : 1873 - 1888
  • [24] Feature selection algorithm assisted residual channel attention spatio-temporal auto encoder for video anomaly detection
    Prasudha, M. Lakshmi
    Sukhavasi, Vidyullatha
    Neha, Kandula
    Lunawat, Poonam Shaylesh
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [25] Video anomaly detection based on cross-frame prediction mechanism and spatio-temporal memory-enhanced pseudo-3D encoder
    Wen, Xiaopeng
    Lai, Huicheng
    Gao, Guxue
    Xiao, Yang
    Wang, Tongguan
    Jia, Zhenhong
    Wang, Liejun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [26] STVDNet: spatio-temporal interactive video de-raining network
    Ouyang, Ze
    Zhao, Huihuang
    Zhang, Yudong
    Chen, Long
    VISUAL COMPUTER, 2025, 41 (04) : 2767 - 2782
  • [27] Video Fingerprint Algorithm Based on Spatio-Temporal Deep Neural Network
    Wang Dongdong
    Li Yuenan
    LASER & OPTOELECTRONICS PROGRESS, 2018, 55 (01)
  • [28] Video Super-Resolution via a Spatio-Temporal Alignment Network
    Wen, Weilei
    Ren, Wenqi
    Shi, Yinghuan
    Nie, Yunfeng
    Zhang, Jingang
    Cao, Xiaochun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1761 - 1773
  • [29] PointSDA: Spatio-Temporal Deformable Attention Network for Point Cloud Video Modeling
    Sheng, Xiaoxiao
    Shen, Zhiqiang
    Xiao, Gang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (12): : 10946 - 10953
  • [30] DSTA-Net: Deformable Spatio-Temporal Attention Network for Video Inpainting
    Liu, Tongxing
    Qiu, Guoxin
    Xuan, Hanyu
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 771 - 775