Learning Spatiotemporal Features With 3DCNN and ConvGRU for Video Anomaly Detection

被引:0
|
作者
Wang, Xin [1 ]
Xie, Weixin [1 ]
Song, Jiayi [1 ]
机构
[1] Shenzhen Univ, ATR Natl Key Lab Def Technol, Shenzhen, Peoples R China
来源
PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) | 2018年
关键词
3DCNN; ConvGRU; Video anomaly detection;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video anomaly detection aims to analyze the abnormal events or behaviors from massive monitoring video data, which is extremely challenging due to the ambiguous definition of abnormal behavior and the complex monitoring scene. Feature representation based on the hand-crafted of video local spatial area is more complicated, and it is difficult to learn the essential feature from the input video. In this paper, a deep autoencoder network combined with 3DCNN and ConvGRU is proposed to learn the spatiotemporal features for video anomaly. Firstly, 3DCNN and bidirectional ConvGRU are used to encode the local-global spatial features and short-long-term temporal features in the spatiotemporal dimension. Secondly, the reconstruction branch is introduced to reconstruct video frames, while the prediction branch is utilized to make the encoder to learn the better spatiotemporal feature at the training phase. In addition, the regularization of adjacent frames in a loss function is carried on to improve the temporal feature. The weights of the C3D model trained by action recognition are transferred to 3DCNN to prevent model over fitting. Experiments on real anomaly datasets shows the effectiveness of our proposed deep model.
引用
收藏
页码:474 / 479
页数:6
相关论文
共 50 条
  • [41] Dy-MIL: dynamic multiple-instance learning framework for video anomaly detection
    Chen Li
    Mo Chen
    Multimedia Systems, 2024, 30
  • [42] Weakly Supervised Video Anomaly Detection via Transformer-Enabled Temporal Relation Learning
    Zhang, Dasheng
    Huang, Chao
    Liu, Chengliang
    Xu, Yong
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1197 - 1201
  • [43] Weakly Supervised Video Anomaly Detection Based on 3D Convolution and LSTM
    Ma, Zhen
    Machado, Jose J. M.
    Tavares, Joao Manuel R. S.
    SENSORS, 2021, 21 (22)
  • [44] LEARNING SPATIO-TEMPORAL RELATIONS WITH MULTI-SCALE INTEGRATED PERCEPTION FOR VIDEO ANOMALY DETECTION
    Ye, Hongyu
    Xu, Ke
    Jiang, Xinghao
    Sun, Tanfeng
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4020 - 4024
  • [45] Adversarial erasure network based on multi-instance learning for weakly supervised video anomaly detection
    Song, Xin
    Liu, Penghui
    Li, Suyuan
    Xu, Siyang
    Wang, Ke
    NEUROCOMPUTING, 2025, 636
  • [46] A critical study on the recent deep learning based semi-supervised video anomaly detection methods
    Baradaran, Mohammad
    Bergevin, Robert
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 27761 - 27807
  • [47] Video Anomaly Detection via self-supervised and spatio-temporal proxy tasks learning
    Yang, Qingyang
    Wang, Chuanxu
    Liu, Peng
    Jiang, Zitai
    Li, Jiajiong
    PATTERN RECOGNITION, 2025, 158
  • [48] A critical study on the recent deep learning based semi-supervised video anomaly detection methods
    Mohammad Baradaran
    Robert Bergevin
    Multimedia Tools and Applications, 2024, 83 : 27761 - 27807
  • [49] Multi-level feature splicing 3D network based on multi-task joint learning for video anomaly detection
    Li, Yang
    Tong, Guoxiang
    NEUROCOMPUTING, 2025, 636
  • [50] Learning Appearance-Motion Synergy via Memory-Guided Event Prediction for Video Anomaly Detection
    Guo, Chongye
    Wang, Hongbo
    Xia, Yingjie
    Feng, Guorui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1519 - 1531