Violence detection in crowd videos using nuanced facial expression analysis

被引：0

作者：

Sreenu, G. ^{[1
]}

Durai, M. A. Saleem ^{[1
]}

机构：

[1] VIT, Vellore, India

来源：

SYSTEMS AND SOFT COMPUTING | 2024年 / 6卷

关键词：

Crowd analysis; Surveillance video; Face detection; Expression identification; Violence detection; CRBM; Dropout regularization; Logistic regression; Maximum likelihood estimation;

D O I：

10.1016/j.sasc.2024.200104

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video analysis for violence detection is crucial, especially when dealing with crowd data, where the potential for severe mob attacks in sensitive areas is high. This paper proposes a solution utilizing Convolutional Restricted Boltzmann Machine (CRBM) for video analysis, integrating the strengths of Convolutional Neural Network (CNN) and Restricted Boltzmann Machine (RBM). By focusing on image patches rather than entire frames, the method addresses the challenge of object detection in crowded scenes. The CRBM combines deep-level image analysis from CNN with unsupervised feature extraction in RBM, facilitated by image convolution using Gabor filters in the hidden layer. Dropout regularization mitigates overfitting, enhancing model generality. Extracted features are inputted into an SVM classifier for face detection and a custom VGG16 model for emotion identification. Event probability is then determined through logistic regression based on facial expressions. Despite existing approaches for smart crowd behaviour identification, there remains a tradeoff between accuracy and processing time. Our proposed solution addresses this by employing proper frame preprocessing techniques for feature extraction. Validation using quantitative and qualitative metrics confirms the effectiveness of the approach.

引用

页数：12

共 25 条

[1]

[Anonymous], 2005, INT C ART INT STAT

[2] Timed-image based deep learning for action recognition in video sequences [J].