Detection of Violent Behavior Using Neural Networks and Pose Estimation

被引:15
作者
Kwan-Loo, Kevin B. [1 ]
Ortiz-Bayliss, Jose C. [1 ]
Conant-Pablos, Santiago E. [1 ]
Terashima-Marin, Hugo [1 ]
Rad, P. [2 ]
机构
[1] Tecnol Monterrey, Sch Engn & Sci, Monterrey 64849, Mexico
[2] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
关键词
Behavioral sciences; Databases; Pose estimation; Feature extraction; Cameras; Neural networks; Security; Computer vision; Violent behavior; pedestrian detection; tracking; pose estimation; neural networks; frame buffer; computer vision;
D O I
10.1109/ACCESS.2022.3198985
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Regarding safety and security, felonies and crimes with physical violence remain a significant problem worldwide. Some solutions for pedestrian safety are guards, police car patrolling, sensors, and security cameras. Nonetheless, these methods react only when the crime takes place. In the worst cases, the damage may be irreversible when it has already occurred. Therefore, numerous methods based on Artificial Intelligence have been proposed to solve this problem. Many approaches to detect violent behavior and action recognition rely on 3D convolutional neural networks (3D-CNNs), spatio-temporal models, long short-term memory networks, pose estimation, among other implementations. However, these approaches work in a limited fashion and have not been adapted to uncontrolled environments. Thus, a significant contribution from this work is the development of an innovative solution model capable of detecting violent behavior. This approach focuses on pedestrian detection, tracking, pose estimation, and neural networks to predict pedestrian behavior in video frames. Our proposal uses a time window frame to extract joint angles, given by the pose estimation algorithm, as features for classifying behavior. Another significant contribution of this work is the creation of a new database, Kranok-NV, with a total of 3,683 normal and violent videos. This database was used to train and test the solution model. For the evaluation, we designed a protocol using 10-fold cross-validation. We obtained an accuracy slightly above 98% on the Kranok-NV database with the implemented solution model. Although the proposed solution model detects violent and normal behavior, it can be easily extended to classify other types of behavior.
引用
收藏
页码:86339 / 86352
页数:14
相关论文
共 41 条
[1]  
Abdali Al-Maamoon R., 2019, 2019 2nd Scientific Conference of Computer Sciences (SCCS), P104, DOI 10.1109/SCCS.2019.8852616
[2]  
Akti S., 2019, P 9 INT C IM PROC TH, P1, DOI DOI 10.1109/IPTA.2019.8936070
[3]   2D Human Pose Estimation: New Benchmark and State of the Art Analysis [J].
Andriluka, Mykhaylo ;
Pishchulin, Leonid ;
Gehler, Peter ;
Schiele, Bernt .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3686-3693
[4]  
Nievas EB, 2011, LECT NOTES COMPUT SC, V6855, P332, DOI 10.1007/978-3-642-23678-5_39
[5]  
Brownlee J.., MOVING AVERAGE SMOOT
[6]  
C. People, CCTV PEOPL DEM 2
[7]   OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields [J].
Cao, Zhe ;
Hidalgo, Gines ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) :172-186
[8]   Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].
Cao, Zhe ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310
[9]  
Carreira J., 2018, arXiv
[10]  
Carreira J, 2019, Arxiv, DOI arXiv:1907.06987