Transformer and Adaptive Threshold Sliding Window for Improving Violence Detection in Videos

被引:0
|
作者
Rendon-Segador, Fernando J. [1 ]
Alvarez-Garcia, Juan A. [1 ]
Soria-Morillo, Luis M. [1 ]
机构
[1] Univ Seville, Dept Lenguajes & Sistemas Informat, Seville 41012, Spain
关键词
deep learning; sliding window; transformer; violence detection; adaptive threshold;
D O I
10.3390/s24165429
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper presents a comprehensive approach to detect violent events in videos by combining CrimeNet, a Vision Transformer (ViT) model with structured neural learning and adversarial regularization, with an adaptive threshold sliding window model based on the Transformer architecture. CrimeNet demonstrates exceptional performance on all datasets (XD-Violence, UCF-Crime, NTU-CCTV Fights, UBI-Fights, Real Life Violence Situations, MediEval, RWF-2000, Hockey Fights, Violent Flows, Surveillance Camera Fights, and Movies Fight), achieving high AUC ROC and AUC PR values (up to 99% and 100%, respectively). However, the generalization of CrimeNet to cross-dataset experiments posed some problems, resulting in a 20-30% decrease in performance, for instance, training in UCF-Crime and testing in XD-Violence resulted in 70.20% in AUC ROC. The sliding window model with adaptive thresholding effectively solves these problems by automatically adjusting the violence detection threshold, resulting in a substantial improvement in detection accuracy. By applying the sliding window model as post-processing to CrimeNet results, we were able to improve detection accuracy by 10% to 15% in cross-dataset experiments. Future lines of research include improving generalization, addressing data imbalance, exploring multimodal representations, testing in real-world applications, and extending the approach to complex human interactions.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] An Adaptive Shot Change Detection Algorithm Using An Average of Absolute Difference Histogram within Extension Sliding Window
    Kim, Won-Hee
    Kim, Jong-Nam
    ISCE: 2009 IEEE 13TH INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS, VOLS 1 AND 2, 2009, : 961 - 964
  • [42] Video Segmentation Algorithm Using Threshold and Weighting Based on Moving Sliding Window
    Kim, Won-Hee
    Jeong, Tae-Il
    Kim, Jong-Nam
    11TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY, VOLS I-III, PROCEEDINGS,: UBIQUITOUS ICT CONVERGENCE MAKES LIFE BETTER!, 2009, : 1781 - 1784
  • [43] Sliding-Window Probabilistic Threshold Aggregate Queries on Uncertain Data Streams
    Chen, Donghui
    Chen, Ling
    INFORMATION SCIENCES, 2020, 520 (520) : 353 - 372
  • [44] Dynamic Threshold based Sliding-Window Filtering Technique for RFID Data
    Tyagi, Sapna
    Ansari, A. Q.
    Khan, M. Ayoub
    2010 IEEE 2ND INTERNATIONAL ADVANCE COMPUTING CONFERENCE, 2010, : 115 - +
  • [45] Statistical Features-Based Violence Detection in Surveillance Videos
    Deepak, K.
    Vignesh, L. K. P.
    Srivathsan, G.
    Roshan, S.
    Chandrakala, S.
    COGNITIVE INFORMATICS AND SOFT COMPUTING, 2020, 1040 : 197 - 203
  • [46] Sliding Dynamic Data Window: Improving Properties of the Incremental Learning Methods
    Ardakani, Mohammad Hamed
    Escudero, Gerard
    Graells, Moises
    Espuna, Antonio
    27TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING, PT B, 2017, 40B : 1663 - 1668
  • [47] VIOLENCE DETECTION IN VIDEOS BASED ON FUSING VISUAL AND AUDIO INFORMATION
    Pang, Wen-Feng
    He, Qian-Hua
    Hu, Yong-jian
    Li, Yan-Xiong
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2260 - 2264
  • [48] Improving small objects detection using transformer
    Dubey, Shikha
    Olimov, Farrukh
    Rafique, Muhammad Aasim
    Jeon, Moongu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 89
  • [49] A robust sliding window adaptive filtering technique for phonocardiogram signal denoising
    Shervegar, Vishwanath Madhava
    EXPERT SYSTEMS, 2025, 42 (01)
  • [50] Sliding Window Frequent Items Detection in Wireless Sensor Networks
    Wang Shuang
    Wu Li-Na
    INDUSTRIAL ENGINEERING, MACHINE DESIGN AND AUTOMATION (IEMDA 2014) & COMPUTER SCIENCE AND APPLICATION (CCSA 2014), 2015, : 76 - 82