A novel spatio-temporal memory network for video anomaly detection

被引：1

作者：

Li H. ^{[1
]}

Chen M. ^{[1
]}

机构：

[1] School of Information Science and Technology, Nantong University, 9 Seyuan Road, Jiangsu, Nantong

来源：

Multimedia Tools and Applications | 2025年 / 84卷 / 8期

基金：

中国国家自然科学基金;

关键词：

Auto-encoder; Feature extraction; Memory; Video anomaly detection;

D O I：

10.1007/s11042-024-18957-8

中图分类号：

学科分类号：

摘要：

Future frame prediction for anomaly detection methods based on memory networks have been extensively explored in the academic domain. Nevertheless, traditional memory-guided network techniques, which store dispersed spatial low-dimensional features, often fall short in delivering satisfactory results when applied to datasets characterized by variable scenes. This deficiency is evident in the frequent challenges faced during network convergence in the training process, resulting in unstable training outcomes. In response to this challenge, we introduce a novel Spatio-Temporal Memory Module, denoted as ST_MemAE. Our approach is designed to retain temporal correlation information within low-dimensional features, enhancing the representation of temporally closely linked features within the output of the encoder. Furthermore, we incorporate a homogeneous uncertainty function to optimize the balance of weights associated with multiple loss functions that are part of the memory module update process. As a result, our method offers improved stability in model training, faster convergence, and higher quality predictions of future frames. To validate the effectiveness of our approach, we conducted extensive experiments utilizing three distinct video anomaly detection datasets: UCSD Pedestrian 2, CUHK Avenue, and ShanghaiTech. The outcomes of these comprehensive experiments on publicly available datasets underscore the robustness of our method in accommodating diverse normal events while maintaining sensitivity to abnormal events. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.

引用

页码：4603 / 4624

页数：21

共 50 条

[31] PointSDA: Spatio-Temporal Deformable Attention Network for Point Cloud Video Modeling
Sheng, Xiaoxiao
Shen, Zhiqiang
Xiao, Gang
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (12): : 10946 - 10953
[32] Dual-branch network with memory for video anomaly detection
Dicong Wang
Qinghua Hu
Kaijun Wu
Multimedia Systems, 2023, 29 : 247 - 259
[33] Dual-branch network with memory for video anomaly detection
Wang, Dicong
Hu, Qinghua
Wu, Kaijun
MULTIMEDIA SYSTEMS, 2023, 29 (01) : 247 - 259
[34] FOAD: a novel video anomaly detection focusing on objects
Li, Hongjun
Chen, Jinyi
Huang, Xiezhou
Zhang, Yuxing
Du, Yunlong
Chen, Junjie
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 20637 - 20651
[35] FOAD: a novel video anomaly detection focusing on objects
Hongjun Li
Jinyi Chen
Xiezhou Huang
Yuxing Zhang
Yunlong Du
Junjie Chen
Multimedia Tools and Applications, 2024, 83 : 20637 - 20651
[36] A Spatio-Temporal Enhanced Graph-Transformer AutoEncoder embedded pose for anomaly detection
Zhu, Honglei
Wei, Pengjuan
Xu, Zhigang
IET COMPUTER VISION, 2024, 18 (03) : 405 - 419
[37] Spatial-temporal graph attention network for video anomaly detection
Chen, Haoyang
Mei, Xue
Ma, Zhiyuan
Wu, Xinhong
Wei, Yachuan
IMAGE AND VISION COMPUTING, 2023, 131
[38] Spatio-temporal Video Autoencoder for Human Action Recognition
Sousa e Santos, Anderson Carlos
Pedrini, Helio
PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 114 - 123
[39] Background separation network for video anomaly detection
Ye, Qing
Song, Zihan
Zhao, Yuqi
Zhang, Yongmei
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (03) : 6535 - 6551
[40] Video Text Tracking With a Spatio-Temporal Complementary Model
Gao, Yuzhe
Li, Xing
Zhang, Jiajian
Zhou, Yu
Jin, Dian
Wang, Jing
Zhu, Shenggao
Bai, Xiang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9321 - 9331

← 1 2 3 4 5 →