Criss-Cross Attention Based Auto Encoder for Video Anomaly Event Detection

被引：1

作者：

Wang, Jiaqi ^{[1
]}

Zhang, Jie ^{[2
]}

Ji, Genlin ^{[2
]}

Sheng, Bo ^{[3
]}

机构：

[1] Nanjing Normal Univ, Sch Math Sci, Nanjing 210023, Peoples R China

[2] Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing 210023, Peoples R China

[3] Univ Massachusetts, Dept Comp Sci, Boston, MA 02125 USA

来源：

INTELLIGENT AUTOMATION AND SOFT COMPUTING | 2022年 / 34卷 / 03期

基金：

美国国家科学基金会;

关键词：

Video anomaly detection; bi-directional long short-term memory; convolutional autoencoder; Criss-Cross attention module; MIXTURES; NETWORKS;

D O I：

10.32604/iasc.2022.029535

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The surveillance applications generate enormous video data and present challenges to video analysis for huge human labor cost. Reconstruction-based convolutional autoencoders have achieved great success in video anomaly detection for their ability of automatically detecting abnormal event. The approaches learn normal patterns only with the normal data in an unsupervised way due to the difficulty of collecting anomaly samples and obtaining anomaly annotations. But convolutional autoencoders have limitations in global feature extraction for the local receptive field of convolutional kernels. What is more, 2-dimensional convolution lacks the capability of capturing temporal information while videos change over time. In this paper, we propose a method established on Criss-Cross attention based AutoEncoder (CCAE) for capturing global visual features of sequential video frames. The method utilizes Criss-Cross attention based encoder to extract global appearance features. Another Criss-Cross attention module is embedded into bi-directional convolutional long short-term memory in hidden layer to explore global temporal features between frames. A decoder is executed to fuse global appearance and temporal features and reconstruct the frames. We perform extensive experiments on two public datasets UCSD Ped2 and CUHK Avenue. The experimental results demonstrate that CCAE achieves superior detection accuracy compared with other video anomaly detection approaches.

引用

页码：1629 / 1642

页数：14

共 50 条

[21] Video anomaly detection using transformers and ensemble of convolutional auto-encoders
Rahimpour, Seyed Mohammad
Kazemi, Mohammad
Moallem, Payman
Safayani, Mehran
COMPUTERS & ELECTRICAL ENGINEERING, 2024, 120
[22] Attention-guided residual frame learning for video anomaly detection
Jun-Hyung Yu
Jeong-Hyeon Moon
Kyung-Ah Sohn
Multimedia Tools and Applications, 2023, 82 : 12099 - 12116
[23] Review on Deep Learning Approaches for Anomaly Event Detection in Video Surveillance
Jebur, Sabah Abdulazeez
Hussein, Khalid A.
Hoomod, Haider Kadhim
Alzubaidi, Laith
Santamaria, Jose
ELECTRONICS, 2023, 12 (01)
[24] AONet: Attention network with optional activation for unsupervised video anomaly detection
Rakhmonov, Akhrorjon Akhmadjon Ugli
Subramanian, Barathi
Varnousefaderani, Bahar Amirian
Kim, Jeonghong
ETRI JOURNAL, 2024, 46 (05) : 890 - 903
[25] Video anomaly detection based on cross-frame prediction mechanism and spatio-temporal memory-enhanced pseudo-3D encoder
Wen, Xiaopeng
Lai, Huicheng
Gao, Guxue
Xiao, Yang
Wang, Tongguan
Jia, Zhenhong
Wang, Liejun
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
[26] Video anomaly detection based on scene classification
Hongjun Li
Xulin Shen
Xiaohu Sun
Yunlong Wang
Chaobo Li
Junjie Chen
Multimedia Tools and Applications, 2023, 82 : 45345 - 45365
[27] Video anomaly detection based on scene classification
Li, Hongjun
Shen, Xulin
Sun, Xiaohu
Wang, Yunlong
Li, Chaobo
Chen, Junjie
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (29) : 45345 - 45365
[28] A GAN-Based Framework Combining Memory and Self-Attention Mechanisms for Video Anomaly Detection in Online Gaming Environments
Xiong L.-T.
Ou B.
Cheng Z.-P.
Computer-Aided Design and Applications, 2024, 21 (s5): : 91 - 105
[29] Novel video anomaly detection method based on global-local self-attention network
Yang J.
Wu C.
Zhou L.
Tongxin Xuebao/Journal on Communications, 2023, 44 (08): : 241 - 250
[30] MSAF: Multimodal Supervise-Attention Enhanced Fusion for Video Anomaly Detection
Wei, Donglai
Liu, Yang
Zhu, Xiaoguang
Liu, Jing
Zeng, Xinhua
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2178 - 2182

← 1 2 3 4 5 →