A novel spatio-temporal memory network for video anomaly detection

被引:1
作者
Li H. [1 ]
Chen M. [1 ]
机构
[1] School of Information Science and Technology, Nantong University, 9 Seyuan Road, Jiangsu, Nantong
基金
中国国家自然科学基金;
关键词
Auto-encoder; Feature extraction; Memory; Video anomaly detection;
D O I
10.1007/s11042-024-18957-8
中图分类号
学科分类号
摘要
Future frame prediction for anomaly detection methods based on memory networks have been extensively explored in the academic domain. Nevertheless, traditional memory-guided network techniques, which store dispersed spatial low-dimensional features, often fall short in delivering satisfactory results when applied to datasets characterized by variable scenes. This deficiency is evident in the frequent challenges faced during network convergence in the training process, resulting in unstable training outcomes. In response to this challenge, we introduce a novel Spatio-Temporal Memory Module, denoted as ST_MemAE. Our approach is designed to retain temporal correlation information within low-dimensional features, enhancing the representation of temporally closely linked features within the output of the encoder. Furthermore, we incorporate a homogeneous uncertainty function to optimize the balance of weights associated with multiple loss functions that are part of the memory module update process. As a result, our method offers improved stability in model training, faster convergence, and higher quality predictions of future frames. To validate the effectiveness of our approach, we conducted extensive experiments utilizing three distinct video anomaly detection datasets: UCSD Pedestrian 2, CUHK Avenue, and ShanghaiTech. The outcomes of these comprehensive experiments on publicly available datasets underscore the robustness of our method in accommodating diverse normal events while maintaining sensitivity to abnormal events. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
引用
收藏
页码:4603 / 4624
页数:21
相关论文
共 50 条
  • [31] PointSDA: Spatio-Temporal Deformable Attention Network for Point Cloud Video Modeling
    Sheng, Xiaoxiao
    Shen, Zhiqiang
    Xiao, Gang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (12): : 10946 - 10953
  • [32] Dual-branch network with memory for video anomaly detection
    Dicong Wang
    Qinghua Hu
    Kaijun Wu
    Multimedia Systems, 2023, 29 : 247 - 259
  • [33] Dual-branch network with memory for video anomaly detection
    Wang, Dicong
    Hu, Qinghua
    Wu, Kaijun
    MULTIMEDIA SYSTEMS, 2023, 29 (01) : 247 - 259
  • [34] FOAD: a novel video anomaly detection focusing on objects
    Li, Hongjun
    Chen, Jinyi
    Huang, Xiezhou
    Zhang, Yuxing
    Du, Yunlong
    Chen, Junjie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 20637 - 20651
  • [35] FOAD: a novel video anomaly detection focusing on objects
    Hongjun Li
    Jinyi Chen
    Xiezhou Huang
    Yuxing Zhang
    Yunlong Du
    Junjie Chen
    Multimedia Tools and Applications, 2024, 83 : 20637 - 20651
  • [36] A Spatio-Temporal Enhanced Graph-Transformer AutoEncoder embedded pose for anomaly detection
    Zhu, Honglei
    Wei, Pengjuan
    Xu, Zhigang
    IET COMPUTER VISION, 2024, 18 (03) : 405 - 419
  • [37] Spatial-temporal graph attention network for video anomaly detection
    Chen, Haoyang
    Mei, Xue
    Ma, Zhiyuan
    Wu, Xinhong
    Wei, Yachuan
    IMAGE AND VISION COMPUTING, 2023, 131
  • [38] Spatio-temporal Video Autoencoder for Human Action Recognition
    Sousa e Santos, Anderson Carlos
    Pedrini, Helio
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 114 - 123
  • [39] Background separation network for video anomaly detection
    Ye, Qing
    Song, Zihan
    Zhao, Yuqi
    Zhang, Yongmei
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (03) : 6535 - 6551
  • [40] Video Text Tracking With a Spatio-Temporal Complementary Model
    Gao, Yuzhe
    Li, Xing
    Zhang, Jiajian
    Zhou, Yu
    Jin, Dian
    Wang, Jing
    Zhu, Shenggao
    Bai, Xiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9321 - 9331