Transformer and Adaptive Threshold Sliding Window for Improving Violence Detection in Videos

被引:0
|
作者
Rendon-Segador, Fernando J. [1 ]
Alvarez-Garcia, Juan A. [1 ]
Soria-Morillo, Luis M. [1 ]
机构
[1] Univ Seville, Dept Lenguajes & Sistemas Informat, Seville 41012, Spain
关键词
deep learning; sliding window; transformer; violence detection; adaptive threshold;
D O I
10.3390/s24165429
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper presents a comprehensive approach to detect violent events in videos by combining CrimeNet, a Vision Transformer (ViT) model with structured neural learning and adversarial regularization, with an adaptive threshold sliding window model based on the Transformer architecture. CrimeNet demonstrates exceptional performance on all datasets (XD-Violence, UCF-Crime, NTU-CCTV Fights, UBI-Fights, Real Life Violence Situations, MediEval, RWF-2000, Hockey Fights, Violent Flows, Surveillance Camera Fights, and Movies Fight), achieving high AUC ROC and AUC PR values (up to 99% and 100%, respectively). However, the generalization of CrimeNet to cross-dataset experiments posed some problems, resulting in a 20-30% decrease in performance, for instance, training in UCF-Crime and testing in XD-Violence resulted in 70.20% in AUC ROC. The sliding window model with adaptive thresholding effectively solves these problems by automatically adjusting the violence detection threshold, resulting in a substantial improvement in detection accuracy. By applying the sliding window model as post-processing to CrimeNet results, we were able to improve detection accuracy by 10% to 15% in cross-dataset experiments. Future lines of research include improving generalization, addressing data imbalance, exploring multimodal representations, testing in real-world applications, and extending the approach to complex human interactions.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Long Text Classification Model Based on Transformer Sliding Window and Threshold Optimization
    Pan, Jin
    Chen, Yang
    Zhao, Chunlu
    Liu, Yang
    Chu, Jie
    JOURNAL OF INTERNET TECHNOLOGY, 2025, 26 (02): : 231 - 240
  • [2] Smartphone-Based Unconstrained Step Detection Fusing a Variable Sliding Window and an Adaptive Threshold
    Xu, Ying
    Li, Guofeng
    Li, Zeyu
    Yu, Hao
    Cui, Jianhui
    Wang, Jin
    Chen, Yu
    REMOTE SENSING, 2022, 14 (12)
  • [3] Improving Packet Header Compression with Adaptive Sliding Window Size
    Cha, Hyejin
    Shon, Taeshik
    Kim, Kangseok
    Hong, Manpyo
    2015 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN), 2015, : 541 - 543
  • [4] Mining Frequent Flows Based on Adaptive Threshold with a Sliding Window over Online Packet Stream
    Zhang, Zhen
    Wang, Binqiang
    Chen, Shuqiao
    Zhu, Ke
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS, 2009, : 210 - 214
  • [5] Data Efficient Video Transformer for Violence Detection
    Abdali, Almamon Rasool
    2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION, NETWORKS AND SATELLITE (COMNETSAT 2021), 2021, : 195 - 199
  • [6] A dataset for automatic violence detection in videos
    Bianculli, Miriana
    Falcionelli, Nicola
    Sernani, Paolo
    Tomassini, Selene
    Contardo, Paolo
    Lombardi, Mara
    Dragoni, Aldo Franco
    DATA IN BRIEF, 2020, 33
  • [7] Research on Transformer Temperature Early Warning Method Based on Adaptive Sliding Window and Stacking
    Zhang, Pan
    Zhang, Qian
    Hu, Huan
    Hu, Huazhi
    Peng, Runze
    Liu, Jiaqi
    ELECTRONICS, 2025, 14 (02):
  • [8] Adaptive Sliding-Window Strategy for Vehicle Detection in Highway Environments
    Noh, SeungJong
    Shim, Daeyoung
    Jeon, Moongu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 17 (02) : 323 - 335
  • [9] MULTIMODAL VIOLENCE DETECTION IN VIDEOS
    Peixoto, Bruno
    Lavi, Bahram
    Bestagini, Paolo
    Dias, Zanoni
    Rocha, Anderson
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2957 - 2961
  • [10] Adaptive Sliding Window Load Forecasting
    Foster, Judith
    Liu, Xueqin
    McLoone, Sean
    2017 28TH IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC), 2017,