Detection of Violent Behavior Using Neural Networks and Pose Estimation

被引:15
作者
Kwan-Loo, Kevin B. [1 ]
Ortiz-Bayliss, Jose C. [1 ]
Conant-Pablos, Santiago E. [1 ]
Terashima-Marin, Hugo [1 ]
Rad, P. [2 ]
机构
[1] Tecnol Monterrey, Sch Engn & Sci, Monterrey 64849, Mexico
[2] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
关键词
Behavioral sciences; Databases; Pose estimation; Feature extraction; Cameras; Neural networks; Security; Computer vision; Violent behavior; pedestrian detection; tracking; pose estimation; neural networks; frame buffer; computer vision;
D O I
10.1109/ACCESS.2022.3198985
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Regarding safety and security, felonies and crimes with physical violence remain a significant problem worldwide. Some solutions for pedestrian safety are guards, police car patrolling, sensors, and security cameras. Nonetheless, these methods react only when the crime takes place. In the worst cases, the damage may be irreversible when it has already occurred. Therefore, numerous methods based on Artificial Intelligence have been proposed to solve this problem. Many approaches to detect violent behavior and action recognition rely on 3D convolutional neural networks (3D-CNNs), spatio-temporal models, long short-term memory networks, pose estimation, among other implementations. However, these approaches work in a limited fashion and have not been adapted to uncontrolled environments. Thus, a significant contribution from this work is the development of an innovative solution model capable of detecting violent behavior. This approach focuses on pedestrian detection, tracking, pose estimation, and neural networks to predict pedestrian behavior in video frames. Our proposal uses a time window frame to extract joint angles, given by the pose estimation algorithm, as features for classifying behavior. Another significant contribution of this work is the creation of a new database, Kranok-NV, with a total of 3,683 normal and violent videos. This database was used to train and test the solution model. For the evaluation, we designed a protocol using 10-fold cross-validation. We obtained an accuracy slightly above 98% on the Kranok-NV database with the implemented solution model. Although the proposed solution model detects violent and normal behavior, it can be easily extended to classify other types of behavior.
引用
收藏
页码:86339 / 86352
页数:14
相关论文
共 41 条
[21]  
Khot H., 2015, INT J EMERG ENG RES, V3, P109
[22]   Multi-Layered Deep Learning Features Fusion for Human Action Recognition [J].
Kiran, Sadia ;
Khan, Muhammad Attique ;
Javed, Muhammad Younus ;
Alhaisoni, Majed ;
Tariq, Usman ;
Nam, Yunyoung ;
Damasevicius, Robertas ;
Sharif, Muhammad .
CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 69 (03) :4061-4075
[23]  
Kochan C.., COMPUTE ROLLING AVER
[24]  
Kuehne H, 2011, IEEE I CONF COMP VIS, P2556, DOI 10.1109/ICCV.2011.6126543
[25]  
Kwan Chong Loo K. B., 2020, THESIS I TECNOLOGICO
[26]   Learning realistic human actions from movies [J].
Laptev, Ivan ;
Marszalek, Marcin ;
Schmid, Cordelia ;
Rozenfeld, Benjamin .
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, :3222-+
[27]   2D/3D Pose Estimation and Action Recognition using Multitask Deep Learning [J].
Luvizon, Diogo C. ;
Picard, David ;
Tabia, Hedi .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5137-5146
[28]  
Trong NP, 2017, 2017 56TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), P699, DOI 10.23919/SICE.2017.8105762
[29]  
Osokin D, 2018, Arxiv, DOI [arXiv:1811.12004, DOI 10.48550/ARXIV.1811.12004]
[30]  
Protecto, REAL STREET FIGHT