A novel spatio-temporal memory network for video anomaly detection

被引：1

作者：

Li H. ^{[1
]}

Chen M. ^{[1
]}

机构：

[1] School of Information Science and Technology, Nantong University, 9 Seyuan Road, Jiangsu, Nantong

来源：

Multimedia Tools and Applications | 2025年 / 84卷 / 8期

基金：

中国国家自然科学基金;

关键词：

Auto-encoder; Feature extraction; Memory; Video anomaly detection;

D O I：

10.1007/s11042-024-18957-8

中图分类号：

学科分类号：

摘要：

Future frame prediction for anomaly detection methods based on memory networks have been extensively explored in the academic domain. Nevertheless, traditional memory-guided network techniques, which store dispersed spatial low-dimensional features, often fall short in delivering satisfactory results when applied to datasets characterized by variable scenes. This deficiency is evident in the frequent challenges faced during network convergence in the training process, resulting in unstable training outcomes. In response to this challenge, we introduce a novel Spatio-Temporal Memory Module, denoted as ST_MemAE. Our approach is designed to retain temporal correlation information within low-dimensional features, enhancing the representation of temporally closely linked features within the output of the encoder. Furthermore, we incorporate a homogeneous uncertainty function to optimize the balance of weights associated with multiple loss functions that are part of the memory module update process. As a result, our method offers improved stability in model training, faster convergence, and higher quality predictions of future frames. To validate the effectiveness of our approach, we conducted extensive experiments utilizing three distinct video anomaly detection datasets: UCSD Pedestrian 2, CUHK Avenue, and ShanghaiTech. The outcomes of these comprehensive experiments on publicly available datasets underscore the robustness of our method in accommodating diverse normal events while maintaining sensitivity to abnormal events. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.

引用

页码：4603 / 4624

页数：21

共 50 条

[21] MULTI-SCALE ANALYSIS OF CONTEXTUAL INFORMATION WITHIN SPATIO-TEMPORAL VIDEO VOLUMES FOR ANOMALY DETECTION
Li, Nannan
Guo, Huiwen
Xu, Dan
Wu, Xinyu
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2363 - 2367
[22] Memory-Augmented Spatial-Temporal Consistency Network for Video Anomaly Detection
Li, Zhangxun
Zhao, Mengyang
Zeng, Xinhua
Wang, Tian
Pang, Chengxin
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 95 - 107
[23] Video anomaly detection based on multi-scale optical flow spatio-temporal enhancement and normality mining
He, Qiang
Shi, Ruinian
Chen, Linlin
Huo, Lianzhi
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (03) : 1873 - 1888
[24] Feature selection algorithm assisted residual channel attention spatio-temporal auto encoder for video anomaly detection
Prasudha, M. Lakshmi
Sukhavasi, Vidyullatha
Neha, Kandula
Lunawat, Poonam Shaylesh
SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
[25] Video anomaly detection based on cross-frame prediction mechanism and spatio-temporal memory-enhanced pseudo-3D encoder
Wen, Xiaopeng
Lai, Huicheng
Gao, Guxue
Xiao, Yang
Wang, Tongguan
Jia, Zhenhong
Wang, Liejun
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
[26] STVDNet: spatio-temporal interactive video de-raining network
Ouyang, Ze
Zhao, Huihuang
Zhang, Yudong
Chen, Long
VISUAL COMPUTER, 2025, 41 (04) : 2767 - 2782
[27] Video Fingerprint Algorithm Based on Spatio-Temporal Deep Neural Network
Wang Dongdong
Li Yuenan
LASER & OPTOELECTRONICS PROGRESS, 2018, 55 (01)
[28] Video Super-Resolution via a Spatio-Temporal Alignment Network
Wen, Weilei
Ren, Wenqi
Shi, Yinghuan
Nie, Yunfeng
Zhang, Jingang
Cao, Xiaochun
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1761 - 1773
[29] PointSDA: Spatio-Temporal Deformable Attention Network for Point Cloud Video Modeling
Sheng, Xiaoxiao
Shen, Zhiqiang
Xiao, Gang
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (12): : 10946 - 10953
[30] DSTA-Net: Deformable Spatio-Temporal Attention Network for Video Inpainting
Liu, Tongxing
Qiu, Guoxin
Xuan, Hanyu
IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 771 - 775

← 1 2 3 4 5 →