VD-Net: An Edge Vision-Based Surveillance System for Violence Detection

被引:14
作者
Khan, Mustaqeem [1 ]
El Saddik, Abdulmotaleb [1 ,2 ]
Gueaieb, Wail [2 ]
De Masi, Giulia [3 ]
Karray, Fakhri [1 ,4 ]
机构
[1] Mohamed Bin Zayed Univ Artificial Intelligence MBZ, Dept Comp Vis, Abu Dhabi, U Arab Emirates
[2] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON K1N 6N5, Canada
[3] Technol Innovat Inst TII, Abu Dhabi, U Arab Emirates
[4] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
关键词
Surveillance; Videos; Feature extraction; Real-time systems; Motion pictures; Transformers; Industrial Internet of Things; Artificial intelligence; cloud computing; edge intelligence; Internet of Things (IoT); security; smart city; violence detection;
D O I
10.1109/ACCESS.2024.3380192
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The automation of surveillance systems, driven by the rapid development of computer vision technology, has significantly enhanced the analysis of surveillance videos, particularly in recognition of human activity, including behavior analysis and violence detection, thereby bolstering public and industrial security. Despite these advancements, detecting and analyzing violent actions remains challenging, especially for real-time surveillance systems with limited computing power. We propose an artificial intelligence-based framework called VD-Net (Violence Detection Network), enabled by Intelligent Internet-of-Things (IIoT) to detect violent behavior in public and private spaces. The model utilizes lightweight special task temporal convolutional network (ST-TCN) blocks and several bottleneck layers to focus on salient features in the input sequence. The learned features passed from the classifier to discriminate between violent and nonviolent actions. Additionally, our system is supposed to trigger an alert if violence is detected, which is then communicated to relevant departments. We checked the robustness of our system by surveillance and non-surveillance datasets and ensured a 1-4 % improvement in State-of-The-Art (SoTA) accuracy.
引用
收藏
页码:43796 / 43808
页数:13
相关论文
共 53 条
[1]   Violence Detection in Videos by Combining 3D Convolutional Neural Networks and Support Vector Machines [J].
Accattoli, Simone ;
Sernani, Paolo ;
Falcionelli, Nicola ;
Mekuria, Dagmawi Neway ;
Dragoni, Aldo Franco .
APPLIED ARTIFICIAL INTELLIGENCE, 2020, 34 (04) :329-344
[2]  
Akti G. A., 2019, 9 INT C IMAGE PROCES, P1
[3]   Fight Detection from Still Images in the Wild [J].
Akti, Seymanur ;
Ofli, Ferda ;
Imran, Muhammad ;
Ekenel, Hazim Kemal .
2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, :550-559
[4]  
[Anonymous], 2019, Vidyodaya J. Sci., V22, P7
[5]  
Aremu T, 2023, Arxiv, DOI arXiv:2207.12850
[6]   Displacement monitoring of a pedestrian bridge using 3D digital image correlation [J].
Barros, Francisco ;
Aguiar, Susana ;
Sousa, Pedro J. ;
Cachaco, Antonio ;
Tavares, Paulo J. ;
Moreira, Pedro M. G. P. ;
Ranzal, D. ;
Cardoso, N. ;
Fernandes, N. ;
Fernandes, R. ;
Henriques, R. ;
Cruz, P. M. ;
Cannizzaro, A. .
4TH INTERNATIONAL CONFERENCE ON STRUCTURAL INTEGRITY (ICSI 2021), 2022, 37 :880-887
[7]  
Nievas EB, 2011, LECT NOTES COMPUT SC, V6855, P332, DOI 10.1007/978-3-642-23678-5_39
[8]   Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].
Carreira, Joao ;
Zisserman, Andrew .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733
[9]   Weakly Supervised Violence Detection in Surveillance Video [J].
Choqueluque-Roman, David ;
Camara-Chavez, Guillermo .
SENSORS, 2022, 22 (12)
[10]  
Ciampi L., 2023, Proc. IMPROVE, P37, DOI [10.5220/0011965300003497, 10.5220/0011965300003497 . INSTICC, DOI 10.5220/0011965300003497.INSTICC]