Anomaly Detection Based on a 3D Convolutional Neural Network Combining Convolutional Block Attention Module Using Merged Frames

被引:10
作者
Hwang, In-Chang [1 ]
Kang, Hyun-Soo [1 ]
机构
[1] Chungbuk Natl Univ, Sch Elect & Comp Engn, Dept Informat & Commun Engn, Cheongju 28644, South Korea
关键词
video surveillance; convolutional block attention module; area under the curve; equal error rate; computer vision; UBI-Fights; 3D convolution; VIOLENCE DETECTION; EVENT DETECTION; VIDEO;
D O I
10.3390/s23239616
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
With the recent rise in violent crime, the real-time situation analysis capabilities of the prevalent closed-circuit television have been employed for the deterrence and resolution of criminal activities. Anomaly detection can identify abnormal instances such as violence within the patterns of a specified dataset; however, it faces challenges in that the dataset for abnormal situations is smaller than that for normal situations. Herein, using datasets such as UBI-Fights, RWF-2000, and UCSD Ped1 and Ped2, anomaly detection was approached as a binary classification problem. Frames extracted from each video with annotation were reconstructed into a limited number of images of 3x3, 4x3, 4x4, 5x3 sizes using the method proposed in this paper, forming an input data structure similar to a light field and patch of vision transformer. The model was constructed by applying a convolutional block attention module that included channel and spatial attention modules to a residual neural network with depths of 10, 18, 34, and 50 in the form of a three-dimensional convolution. The proposed model performed better than existing models in detecting abnormal behavior such as violent acts in videos. For instance, with the undersampled UBI-Fights dataset, our network achieved an accuracy of 0.9933, a loss value of 0.0010, an area under the curve of 0.9973, and an equal error rate of 0.0027. These results may contribute significantly to solve real-world issues such as the detection of violent behavior in artificial intelligence systems using computer vision and real-time video monitoring.
引用
收藏
页数:24
相关论文
共 85 条
[41]   Data clustering: 50 years beyond K-means [J].
Jain, Anil K. .
PATTERN RECOGNITION LETTERS, 2010, 31 (08) :651-666
[42]  
Jetley S, 2018, Arxiv, DOI arXiv:1804.02391
[43]   Multiple instance-based video anomaly detection using deep temporal encoding-decoding [J].
Kamoona, Ammar Mansoor ;
Gostar, Amirali Khodadadian ;
Bab-Hadiashar, Alireza ;
Hoseinnezhad, Reza .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 214
[44]   Video Anomaly Detection using Inflated 3D Convolution Network [J].
Koshti, Dipali ;
Kamoji, Supriya ;
Kalnad, Nehal ;
Sreekumar, Suyash ;
Bhujbal, Shreya .
PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT-2020), 2020, :729-733
[45]  
Levoy M., 1996, Computer Graphics Proceedings. SIGGRAPH '96, P31, DOI 10.1145/237170.237199
[46]   Context-related video anomaly detection via generative adversarial network [J].
Li, Daoheng ;
Nie, Xiushan ;
Li, Xiaofeng ;
Zhang, Yu ;
Yin, Yilong .
PATTERN RECOGNITION LETTERS, 2022, 156 :183-189
[47]   Future Frame Prediction for Anomaly Detection - A New Baseline [J].
Liu, Wen ;
Luo, Weixin ;
Lian, Dongze ;
Gao, Shenghua .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6536-6545
[48]   Diversity-Measurable Anomaly Detection [J].
Liu, Wenrui ;
Chang, Hong ;
Ma, Bingpeng ;
Shan, Shiguang ;
Chen, Xilin .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :12147-12156
[49]   Abnormal Event Detection at 150 FPS in MATLAB [J].
Lu, Cewu ;
Shi, Jianping ;
Jia, Jiaya .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :2720-2727
[50]  
Luo WX, 2017, IEEE INT CON MULTI, P439, DOI 10.1109/ICME.2017.8019325