Real-Time Violent Action Recognition Using Key Frames Extraction and Deep Learning

被引:19
作者
Ahmed, Muzamil [1 ,2 ]
Ramzan, Muhammad [3 ,4 ]
Khan, Hikmat Ullah [2 ]
Iqbal, Saqib [5 ]
Khan, Muhammad Attique [6 ]
Choi, Jung-In [7 ]
Nam, Yunyoung [8 ]
Kadry, Seifedine [9 ]
机构
[1] Univ Lahore, Dept Comp Sci & Informat Technol, Sargodha Campus, Sargodha 40100, Pakistan
[2] COMSATS Univ Islamabad, Dept Comp Sci, Wah Campus, Wah Cantt 47040, Pakistan
[3] Univ Management & Technol, Sch Syst & Technol, Lahore 54782, Pakistan
[4] Univ Sargodha, Dept Comp Sci & Informat Technol, Sargodha 40100, Pakistan
[5] Al Ain Univ, Coll Engn, Al Ain, U Arab Emirates
[6] HITEC Univ Taxila, Dept Comp Sci, Taxila, Pakistan
[7] Ajou Univ, Appl Artificial Intelligence, Suwon, South Korea
[8] Soonchunhyang Univ, Dept Comp Sci & Engn, Asan, South Korea
[9] Beirut Arab Univ, Fac Sci, Dept Math & Comp Sci, Beirut, Lebanon
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2021年 / 69卷 / 02期
基金
新加坡国家研究基金会;
关键词
Violence detection; violence recognition; deep learning; convolutional neural network; inception v4; keyframe extraction; CLASSIFICATION; SELECTION; RECURRENT; FUSION;
D O I
10.32604/cmc.2021.018103
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Violence recognition is crucial because of its applications in activities related to security and lawenforcement. Existing semi-automated systems have issues such as tedious manual surveillances, which causes human errors and makes these systems less effective. Several approaches have been proposed using trajectory-based, non-object-centric, and deep-learning-based methods. Previous studies have shown that deep learning techniques attain higher accuracy and lower error rates than those of other methods. However, the their performance must be improved. This study explores the state-of-the-art deep learning architecture of convolutional neural networks (CNNs) and inception V4 to detect and recognize violence using video data. In the proposed framework, the keyframe extraction technique eliminates duplicate consecutive frames. This keyframing phase reduces the training data size and hence decreases the computational cost by avoiding duplicate frames. For feature selection and classification tasks, the applied sequential CNN uses one kernel size, whereas the inception v4CNN uses multiple kernels for different layers of the architecture. For empirical analysis, four widely used standard datasets are used with diverse activities. The results confirm that the proposed approach attains 98% accuracy, reduces the computational cost, and outperforms the existing techniques of violence detection and recognition.
引用
收藏
页码:2217 / 2230
页数:14
相关论文
共 40 条
[1]   Violence Detection in Videos by Combining 3D Convolutional Neural Networks and Support Vector Machines [J].
Accattoli, Simone ;
Sernani, Paolo ;
Falcionelli, Nicola ;
Mekuria, Dagmawi Neway ;
Dragoni, Aldo Franco .
APPLIED ARTIFICIAL INTELLIGENCE, 2020, 34 (04) :329-344
[2]  
Ahmad I. S., 2017, 2017 INT C ADV TECHN, P1
[3]   Activity Recognition and Abnormal Behaviour Detection with Recurrent Neural Networks [J].
Arifoglu, Damla ;
Bouchachia, Abdelhamid .
14TH INTERNATIONAL CONFERENCE ON MOBILE SYSTEMS AND PERVASIVE COMPUTING (MOBISPC 2017) / 12TH INTERNATIONAL CONFERENCE ON FUTURE NETWORKS AND COMMUNICATIONS (FNC 2017) / AFFILIATED WORKSHOPS, 2017, 110 :86-93
[4]   A multilevel paradigm for deep convolutional neural network features selection with an application to human gait recognition [J].
Arshad, Habiba ;
Khan, Muhammad Attique ;
Sharif, Muhammad Irfan ;
Yasmin, Mussarat ;
Tavares, Joao Manuel R. S. ;
Zhang, Yu-Dong ;
Satapathy, Suresh Chandra .
EXPERT SYSTEMS, 2022, 39 (07)
[5]   Human Behavior Analysis Based on Multi-Types Features Fusion and Von Nauman Entropy Based Features Reduction [J].
Aurangzeb, Khursheed ;
Haider, Irfan ;
Khan, Muhammad Attique ;
Saba, Tanzila ;
Javed, Kashif ;
Iqbal, Tassawar ;
Rehman, Amjad ;
Ali, Hashim ;
Sarfraz, Muhammad Shahzad .
JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2019, 9 (04) :662-669
[6]   A Sensor Network Approach for Violence Detection in Smart Cities Using Deep Learning [J].
Baba, Marius ;
Gui, Vasile ;
Cernazanu, Cosmin ;
Pescaru, Dan .
SENSORS, 2019, 19 (07)
[7]  
Basavaraj GM, 2017, 2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), P1516, DOI 10.1109/RTEICT.2017.8256851
[8]  
Brito R., 2021, MULTIMED TOOLS APPL, V3, P1
[9]   Autocorrelation of gradients based violence detection in surveillance videos [J].
Deepak, K. ;
Vignesh, L. K. P. ;
Chandrakala, S. .
ICT EXPRESS, 2020, 6 (03) :155-159
[10]   Cross-Species Learning: A Low-Cost Approach to Learning Human Fight from Animal Fight [J].
Fu, Eugene Yujun ;
Huang, Michael Xuelin ;
Hong Va Leong ;
Ngai, Grace .
PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, :320-327