Violent Physical Behavior Detection using 3D Spatio- Temporal Convolutional Neural Networks

被引:0
作者
Xu, Xiuhong [1 ]
Liao, Zhongming [2 ]
Xu, Zhaosheng [3 ]
机构
[1] Jiangxi New Energy Technol Vocat Coll, Coll Photovolta Power Generat, Xinyu 338004, Jiangxi, Peoples R China
[2] Xinyu Coll, Acad Affairs Off, Xinyu 338004, Jiangxi, Peoples R China
[3] Xinyu Coll, Sch Math & Comp Sci, Xinyu 338004, Jiangxi, Peoples R China
关键词
-Violence detection; surveillance cameras; 3D Convolutional Neural Network (3D CNN); Spatio-temporal convolution; deep learning; abnormal behavior; RECOGNITION;
D O I
10.14569/IJACSA.2023.0140891
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
use of surveillance cameras has made it possible to analyze a huge amount of data for automated surveillance. The use of security systems in schools, hotels, hospitals, and other security areas is required to identify the violent activities that can cause social, economic, and environmental damage. Detecting the mobile objects on each frame is a fundamental phase in the analysis of the video trail and the violence recognition. Therefore, a three-step approach is presented in this article. In our method, the separation of the frames containing the motion information and the detection of the violent behavior are applied at two levels of the network. First, the people in the video frames are identified by using a convolutional neural network. In the second step, a sequence of 16 frames containing the identified people is injected into the 3D CNN. Furthermore, we optimize the 3D CNN by using the visual inference and then a neural network optimization tool that transforms the pre-trained model into an average representation. Finally, this method uses the toolbox of OPENVINO to perform the optimization operations to increase the performance. To evaluate the accuracy of our algorithm, two datasets have been analyzed, which are: Violence in Movies and Hockey Fight. The results show that the final accuracy of this analysis is equal to 99.9% and 96% from each dataset.
引用
收藏
页码:829 / 836
页数:8
相关论文
共 24 条
[1]   Abnormal behavior recognition for intelligent video surveillance systems: A review [J].
Ben Mabrouk, Amira ;
Zagrouba, Ezzeddine .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 91 :480-491
[2]  
Nievas EB, 2011, LECT NOTES COMPUT SC, V6855, P332, DOI 10.1007/978-3-642-23678-5_39
[3]  
Bilinski P, 2016, 2016 13TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), P30, DOI 10.1109/AVSS.2016.7738019
[4]  
Cao C., 2016, ACTION RECOGNITION J, V1, P3
[5]   Learning Spatiotemporal Features with 3D Convolutional Networks [J].
Du Tran ;
Bourdev, Lubomir ;
Fergus, Rob ;
Torresani, Lorenzo ;
Paluri, Manohar .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4489-4497
[6]  
Howard AG, 2017, Arxiv, DOI [arXiv:1704.04861, DOI 10.48550/ARXIV.1704.04861]
[7]   Violence detection using Oriented VIolent Flows [J].
Gao, Yuan ;
Liu, Hong ;
Sun, Xiaohu ;
Wang, Can ;
Liu, Yi .
IMAGE AND VISION COMPUTING, 2016, 48-49 :37-41
[8]   Large-scale Video Classification with Convolutional Neural Networks [J].
Karpathy, Andrej ;
Toderici, George ;
Shetty, Sanketh ;
Leung, Thomas ;
Sukthankar, Rahul ;
Fei-Fei, Li .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1725-1732
[9]   Efficient CNN based summarization of surveillance videos for resource-constrained devices [J].
Muhammad, Khan ;
Hussain, Tanveer ;
Baik, Sung Wook .
PATTERN RECOGNITION LETTERS, 2020, 130 (130) :370-375
[10]   Measuring the crime displacement and diffusion of benefit effects of open-street CCTV in South Korea [J].
Park, Hyeon Ho ;
Oh, Gyeong Seok ;
Paek, Seung Yeop .
INTERNATIONAL JOURNAL OF LAW CRIME AND JUSTICE, 2012, 40 (03) :179-191