Criss-Cross Attention Based Auto Encoder for Video Anomaly Event Detection

被引:1
作者
Wang, Jiaqi [1 ]
Zhang, Jie [2 ]
Ji, Genlin [2 ]
Sheng, Bo [3 ]
机构
[1] Nanjing Normal Univ, Sch Math Sci, Nanjing 210023, Peoples R China
[2] Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing 210023, Peoples R China
[3] Univ Massachusetts, Dept Comp Sci, Boston, MA 02125 USA
基金
美国国家科学基金会;
关键词
Video anomaly detection; bi-directional long short-term memory; convolutional autoencoder; Criss-Cross attention module; MIXTURES; NETWORKS;
D O I
10.32604/iasc.2022.029535
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The surveillance applications generate enormous video data and present challenges to video analysis for huge human labor cost. Reconstruction-based convolutional autoencoders have achieved great success in video anomaly detection for their ability of automatically detecting abnormal event. The approaches learn normal patterns only with the normal data in an unsupervised way due to the difficulty of collecting anomaly samples and obtaining anomaly annotations. But convolutional autoencoders have limitations in global feature extraction for the local receptive field of convolutional kernels. What is more, 2-dimensional convolution lacks the capability of capturing temporal information while videos change over time. In this paper, we propose a method established on Criss-Cross attention based AutoEncoder (CCAE) for capturing global visual features of sequential video frames. The method utilizes Criss-Cross attention based encoder to extract global appearance features. Another Criss-Cross attention module is embedded into bi-directional convolutional long short-term memory in hidden layer to explore global temporal features between frames. A decoder is executed to fuse global appearance and temporal features and reconstruct the frames. We perform extensive experiments on two public datasets UCSD Ped2 and CUHK Avenue. The experimental results demonstrate that CCAE achieves superior detection accuracy compared with other video anomaly detection approaches.
引用
收藏
页码:1629 / 1642
页数:14
相关论文
共 50 条
  • [21] Video anomaly detection using transformers and ensemble of convolutional auto-encoders
    Rahimpour, Seyed Mohammad
    Kazemi, Mohammad
    Moallem, Payman
    Safayani, Mehran
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 120
  • [22] Attention-guided residual frame learning for video anomaly detection
    Jun-Hyung Yu
    Jeong-Hyeon Moon
    Kyung-Ah Sohn
    Multimedia Tools and Applications, 2023, 82 : 12099 - 12116
  • [23] Review on Deep Learning Approaches for Anomaly Event Detection in Video Surveillance
    Jebur, Sabah Abdulazeez
    Hussein, Khalid A.
    Hoomod, Haider Kadhim
    Alzubaidi, Laith
    Santamaria, Jose
    ELECTRONICS, 2023, 12 (01)
  • [24] AONet: Attention network with optional activation for unsupervised video anomaly detection
    Rakhmonov, Akhrorjon Akhmadjon Ugli
    Subramanian, Barathi
    Varnousefaderani, Bahar Amirian
    Kim, Jeonghong
    ETRI JOURNAL, 2024, 46 (05) : 890 - 903
  • [25] Video anomaly detection based on cross-frame prediction mechanism and spatio-temporal memory-enhanced pseudo-3D encoder
    Wen, Xiaopeng
    Lai, Huicheng
    Gao, Guxue
    Xiao, Yang
    Wang, Tongguan
    Jia, Zhenhong
    Wang, Liejun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [26] Video anomaly detection based on scene classification
    Hongjun Li
    Xulin Shen
    Xiaohu Sun
    Yunlong Wang
    Chaobo Li
    Junjie Chen
    Multimedia Tools and Applications, 2023, 82 : 45345 - 45365
  • [27] Video anomaly detection based on scene classification
    Li, Hongjun
    Shen, Xulin
    Sun, Xiaohu
    Wang, Yunlong
    Li, Chaobo
    Chen, Junjie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (29) : 45345 - 45365
  • [28] A GAN-Based Framework Combining Memory and Self-Attention Mechanisms for Video Anomaly Detection in Online Gaming Environments
    Xiong L.-T.
    Ou B.
    Cheng Z.-P.
    Computer-Aided Design and Applications, 2024, 21 (s5): : 91 - 105
  • [29] Novel video anomaly detection method based on global-local self-attention network
    Yang J.
    Wu C.
    Zhou L.
    Tongxin Xuebao/Journal on Communications, 2023, 44 (08): : 241 - 250
  • [30] MSAF: Multimodal Supervise-Attention Enhanced Fusion for Video Anomaly Detection
    Wei, Donglai
    Liu, Yang
    Zhu, Xiaoguang
    Liu, Jing
    Zeng, Xinhua
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2178 - 2182