Detection of Violent Behavior Using Neural Networks and Pose Estimation

被引:15
作者
Kwan-Loo, Kevin B. [1 ]
Ortiz-Bayliss, Jose C. [1 ]
Conant-Pablos, Santiago E. [1 ]
Terashima-Marin, Hugo [1 ]
Rad, P. [2 ]
机构
[1] Tecnol Monterrey, Sch Engn & Sci, Monterrey 64849, Mexico
[2] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
关键词
Behavioral sciences; Databases; Pose estimation; Feature extraction; Cameras; Neural networks; Security; Computer vision; Violent behavior; pedestrian detection; tracking; pose estimation; neural networks; frame buffer; computer vision;
D O I
10.1109/ACCESS.2022.3198985
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Regarding safety and security, felonies and crimes with physical violence remain a significant problem worldwide. Some solutions for pedestrian safety are guards, police car patrolling, sensors, and security cameras. Nonetheless, these methods react only when the crime takes place. In the worst cases, the damage may be irreversible when it has already occurred. Therefore, numerous methods based on Artificial Intelligence have been proposed to solve this problem. Many approaches to detect violent behavior and action recognition rely on 3D convolutional neural networks (3D-CNNs), spatio-temporal models, long short-term memory networks, pose estimation, among other implementations. However, these approaches work in a limited fashion and have not been adapted to uncontrolled environments. Thus, a significant contribution from this work is the development of an innovative solution model capable of detecting violent behavior. This approach focuses on pedestrian detection, tracking, pose estimation, and neural networks to predict pedestrian behavior in video frames. Our proposal uses a time window frame to extract joint angles, given by the pose estimation algorithm, as features for classifying behavior. Another significant contribution of this work is the creation of a new database, Kranok-NV, with a total of 3,683 normal and violent videos. This database was used to train and test the solution model. For the evaluation, we designed a protocol using 10-fold cross-validation. We obtained an accuracy slightly above 98% on the Kranok-NV database with the implemented solution model. Although the proposed solution model detects violent and normal behavior, it can be easily extended to classify other types of behavior.
引用
收藏
页码:86339 / 86352
页数:14
相关论文
共 41 条
[11]  
Chen F.., REALTIME ACTION RECO
[12]  
Chen JW, 2020, Arxiv, DOI arXiv:2008.01057
[13]   Inflated 3D ConvNet context analysis for violence detection [J].
Freire-Obregon, David ;
Barra, Paola ;
Castrillon-Santana, Modesto ;
De Marsico, Maria .
MACHINE VISION AND APPLICATIONS, 2022, 33 (01)
[14]   An attention recurrent model for human cooperation detection [J].
Freire-Obregon, David ;
Castrillon-Santana, Modesto ;
Barra, Paola ;
Bisogni, Carmen ;
Nappi, Michele .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 197
[15]  
G. Police, REV ASS
[16]   Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? [J].
Hara, Kensho ;
Kataoka, Hirokatsu ;
Satoh, Yutaka .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6546-6555
[17]  
ITALYKICKBOXING, K1 KICKB THAIB TRAIN
[18]  
Jhuang H, 2007, IEEE I CONF COMP VIS, P1253
[19]   3D Convolutional Neural Networks for Human Action Recognition [J].
Ji, Shuiwang ;
Xu, Wei ;
Yang, Ming ;
Yu, Kai .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (01) :221-231
[20]  
Kay W, 2017, Arxiv, DOI [arXiv:1705.06950, DOI 10.48550/ARXIV.1705.06950, 10.48550/arXiv.1705.06950]