Violence detection in crowd videos using nuanced facial expression analysis

被引:0
作者
Sreenu, G. [1 ]
Durai, M. A. Saleem [1 ]
机构
[1] VIT, Vellore, India
来源
SYSTEMS AND SOFT COMPUTING | 2024年 / 6卷
关键词
Crowd analysis; Surveillance video; Face detection; Expression identification; Violence detection; CRBM; Dropout regularization; Logistic regression; Maximum likelihood estimation;
D O I
10.1016/j.sasc.2024.200104
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video analysis for violence detection is crucial, especially when dealing with crowd data, where the potential for severe mob attacks in sensitive areas is high. This paper proposes a solution utilizing Convolutional Restricted Boltzmann Machine (CRBM) for video analysis, integrating the strengths of Convolutional Neural Network (CNN) and Restricted Boltzmann Machine (RBM). By focusing on image patches rather than entire frames, the method addresses the challenge of object detection in crowded scenes. The CRBM combines deep-level image analysis from CNN with unsupervised feature extraction in RBM, facilitated by image convolution using Gabor filters in the hidden layer. Dropout regularization mitigates overfitting, enhancing model generality. Extracted features are inputted into an SVM classifier for face detection and a custom VGG16 model for emotion identification. Event probability is then determined through logistic regression based on facial expressions. Despite existing approaches for smart crowd behaviour identification, there remains a tradeoff between accuracy and processing time. Our proposed solution addresses this by employing proper frame preprocessing techniques for feature extraction. Validation using quantitative and qualitative metrics confirms the effectiveness of the approach.
引用
收藏
页数:12
相关论文
共 25 条
[1]   Timed-image based deep learning for action recognition in video sequences [J].
Atto, Abdourrahmane Mahamane ;
Benoit, Alexandre ;
Lambert, Patrick .
PATTERN RECOGNITION, 2020, 104
[2]  
Carreira-Perpinan Miguel A., 2005, INT C ART INT STAT
[3]   Anomaly detection in surveillance video based on bidirectional prediction [J].
Chen, Dongyue ;
Wang, Pengtao ;
Yue, Lingyi ;
Zhang, Yuxin ;
Jia, Tong .
IMAGE AND VISION COMPUTING, 2020, 98 (98)
[4]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[5]  
Febrianti R., 2021, Journal of Physics: Conference Series, V1725, DOI 10.1088/1742-6596/1725/1/012014
[6]  
Gkountakos Konstantinos, 2020, ICMR '20: Proceedings of the 2020 International Conference on Multimedia Retrieval, P276, DOI 10.1145/3372278.3390725
[7]  
Hao Wu, 2020, Journal of Physics: Conference Series, V1486, DOI 10.1088/1742-6596/1486/5/052026
[8]  
Hassner T., 2012, COMPUTER VISION PATT, DOI DOI 10.1109/CVPRW.2012.6239348
[9]   Context Based Emotion Recognition Using EMOTIC Dataset [J].
Kosti, Ronak ;
Alvarez, Jose M. ;
Recasens, Adria ;
Lapedriza, Agata .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (11) :2755-2766
[10]  
Kumar RekhilM., 2014, Int. J. Comput. Sci. Inf. Technol., V5, P7668