Real-Time Violent Action Recognition Using Key Frames Extraction and Deep Learning

被引:18
|
作者
Ahmed, Muzamil [1 ,2 ]
Ramzan, Muhammad [3 ,4 ]
Khan, Hikmat Ullah [2 ]
Iqbal, Saqib [5 ]
Khan, Muhammad Attique [6 ]
Choi, Jung-In [7 ]
Nam, Yunyoung [8 ]
Kadry, Seifedine [9 ]
机构
[1] Univ Lahore, Dept Comp Sci & Informat Technol, Sargodha Campus, Sargodha 40100, Pakistan
[2] COMSATS Univ Islamabad, Dept Comp Sci, Wah Campus, Wah Cantt 47040, Pakistan
[3] Univ Management & Technol, Sch Syst & Technol, Lahore 54782, Pakistan
[4] Univ Sargodha, Dept Comp Sci & Informat Technol, Sargodha 40100, Pakistan
[5] Al Ain Univ, Coll Engn, Al Ain, U Arab Emirates
[6] HITEC Univ Taxila, Dept Comp Sci, Taxila, Pakistan
[7] Ajou Univ, Appl Artificial Intelligence, Suwon, South Korea
[8] Soonchunhyang Univ, Dept Comp Sci & Engn, Asan, South Korea
[9] Beirut Arab Univ, Fac Sci, Dept Math & Comp Sci, Beirut, Lebanon
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2021年 / 69卷 / 02期
基金
新加坡国家研究基金会;
关键词
Violence detection; violence recognition; deep learning; convolutional neural network; inception v4; keyframe extraction; CLASSIFICATION; SELECTION; RECURRENT; FUSION;
D O I
10.32604/cmc.2021.018103
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Violence recognition is crucial because of its applications in activities related to security and lawenforcement. Existing semi-automated systems have issues such as tedious manual surveillances, which causes human errors and makes these systems less effective. Several approaches have been proposed using trajectory-based, non-object-centric, and deep-learning-based methods. Previous studies have shown that deep learning techniques attain higher accuracy and lower error rates than those of other methods. However, the their performance must be improved. This study explores the state-of-the-art deep learning architecture of convolutional neural networks (CNNs) and inception V4 to detect and recognize violence using video data. In the proposed framework, the keyframe extraction technique eliminates duplicate consecutive frames. This keyframing phase reduces the training data size and hence decreases the computational cost by avoiding duplicate frames. For feature selection and classification tasks, the applied sequential CNN uses one kernel size, whereas the inception v4CNN uses multiple kernels for different layers of the architecture. For empirical analysis, four widely used standard datasets are used with diverse activities. The results confirm that the proposed approach attains 98% accuracy, reduces the computational cost, and outperforms the existing techniques of violence detection and recognition.
引用
收藏
页码:2217 / 2230
页数:14
相关论文
共 50 条
  • [41] Real-time Driver Drowsiness Detection using Deep Learning
    Dipu, Md Tanvir Ahammed
    Hossain, Syeda Sumbul
    Arafat, Yeasir
    Rafiq, Fatama Binta
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (07) : 844 - 850
  • [42] A Real-Time Parking Space Occupancy Detection Using Deep Learning Model
    Prova, Raktim Raihan
    Shinha, Title
    Pew, Anamika Basak
    Rahman, Rashedur M.
    2022 IEEE INTERNATIONAL IOT, ELECTRONICS AND MECHATRONICS CONFERENCE (IEMTRONICS), 2022, : 51 - 57
  • [43] REAL-TIME AUTOMATIC FETAL BRAIN EXTRACTION IN FETAL MRI BY DEEP LEARNING
    Salehi, Seyed Sadegh Mohseni
    Hashemi, Seyed Raein
    Velasco-Annis, Clemente
    Ouaalam, Abdelhakim
    Estroff, Judy A.
    Erdogmus, Deniz
    Warfield, Simon K.
    Gholipour, Ali
    2018 IEEE 15TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2018), 2018, : 720 - 724
  • [44] Deep learning based real-time tourist spots detection and recognition mechanism
    Chen, Yen-Chiu
    Yu, Kun-Ming
    Kao, Tzu-Hsiang
    Hsieh, Hao-Lun
    SCIENCE PROGRESS, 2021, 104
  • [45] Deep learning-assisted real-time container corner casting recognition
    Lee, Jaecheul
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2019, 15 (01):
  • [46] Real-Time Deep Learning-Based Object Recognition in Augmented Reality
    Egipko, V
    Zhdanova, M.
    Gapon, N.
    Voronin, V.
    Semenishchev, E.
    REAL-TIME PROCESSING OF IMAGE, DEPTH, AND VIDEO INFORMATION 2024, 2024, 13000
  • [47] Deep Learning-Based Emotion Recognition from Real-Time Videos
    Zhou, Wenbin
    Cheng, Justin
    Lei, Xingyu
    Benes, Bedrich
    Adamo, Nicoletta
    HUMAN-COMPUTER INTERACTION. MULTIMODAL AND NATURAL INTERACTION, HCI 2020, PT II, 2020, 12182 : 321 - 332
  • [48] Real-Time Recognition of Signboards with Mobile Device using Deep Learning for Information Identification Support System
    Kitamura, Shigeo
    Kita, Kota
    Matsushita, Mitsunori
    SUI'18: PROCEEDINGS OF THE 2018 SYMPOSIUM ON SPATIAL USER INTERACTION, 2016, : 178 - 178
  • [49] Real-time Driver Drowsiness Detection using Deep Learning
    Dipu M.T.A.
    Hossain S.S.
    Arafat Y.
    Rafiq F.B.
    Dipu, Md. Tanvir Ahammed, 1600, Science and Information Organization (12): : 844 - 850
  • [50] Real-time masked face recognition using deep learning-based double generator network
    Sumathy, G.
    Usha, M.
    Rajakumar, S.
    Jayapriya, P.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 325 - 334