Learning Spatiotemporal Features With 3DCNN and ConvGRU for Video Anomaly Detection

被引：0

作者：

Wang, Xin ^{[1
]}

Xie, Weixin ^{[1
]}

Song, Jiayi ^{[1
]}

机构：

[1] Shenzhen Univ, ATR Natl Key Lab Def Technol, Shenzhen, Peoples R China

来源：

PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) | 2018年

关键词：

3DCNN; ConvGRU; Video anomaly detection;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Video anomaly detection aims to analyze the abnormal events or behaviors from massive monitoring video data, which is extremely challenging due to the ambiguous definition of abnormal behavior and the complex monitoring scene. Feature representation based on the hand-crafted of video local spatial area is more complicated, and it is difficult to learn the essential feature from the input video. In this paper, a deep autoencoder network combined with 3DCNN and ConvGRU is proposed to learn the spatiotemporal features for video anomaly. Firstly, 3DCNN and bidirectional ConvGRU are used to encode the local-global spatial features and short-long-term temporal features in the spatiotemporal dimension. Secondly, the reconstruction branch is introduced to reconstruct video frames, while the prediction branch is utilized to make the encoder to learn the better spatiotemporal feature at the training phase. In addition, the regularization of adjacent frames in a loss function is carried on to improve the temporal feature. The weights of the C3D model trained by action recognition are transferred to 3DCNN to prevent model over fitting. Experiments on real anomaly datasets shows the effectiveness of our proposed deep model.

引用

页码：474 / 479

页数：6

共 50 条

[41] Dy-MIL: dynamic multiple-instance learning framework for video anomaly detection
Chen Li
Mo Chen
Multimedia Systems, 2024, 30
[42] Weakly Supervised Video Anomaly Detection via Transformer-Enabled Temporal Relation Learning
Zhang, Dasheng
Huang, Chao
Liu, Chengliang
Xu, Yong
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1197 - 1201
[43] Weakly Supervised Video Anomaly Detection Based on 3D Convolution and LSTM
Ma, Zhen
Machado, Jose J. M.
Tavares, Joao Manuel R. S.
SENSORS, 2021, 21 (22)
[44] LEARNING SPATIO-TEMPORAL RELATIONS WITH MULTI-SCALE INTEGRATED PERCEPTION FOR VIDEO ANOMALY DETECTION
Ye, Hongyu
Xu, Ke
Jiang, Xinghao
Sun, Tanfeng
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4020 - 4024
[45] Adversarial erasure network based on multi-instance learning for weakly supervised video anomaly detection
Song, Xin
Liu, Penghui
Li, Suyuan
Xu, Siyang
Wang, Ke
NEUROCOMPUTING, 2025, 636
[46] A critical study on the recent deep learning based semi-supervised video anomaly detection methods
Baradaran, Mohammad
Bergevin, Robert
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 27761 - 27807
[47] Video Anomaly Detection via self-supervised and spatio-temporal proxy tasks learning
Yang, Qingyang
Wang, Chuanxu
Liu, Peng
Jiang, Zitai
Li, Jiajiong
PATTERN RECOGNITION, 2025, 158
[48] A critical study on the recent deep learning based semi-supervised video anomaly detection methods
Mohammad Baradaran
Robert Bergevin
Multimedia Tools and Applications, 2024, 83 : 27761 - 27807
[49] Multi-level feature splicing 3D network based on multi-task joint learning for video anomaly detection
Li, Yang
Tong, Guoxiang
NEUROCOMPUTING, 2025, 636
[50] Learning Appearance-Motion Synergy via Memory-Guided Event Prediction for Video Anomaly Detection
Guo, Chongye
Wang, Hongbo
Xia, Yingjie
Feng, Guorui
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1519 - 1531

← 1 2 3 4 5 →