Violence Detection in Videos by Combining 3D Convolutional Neural Networks and Support Vector Machines

被引:45
作者
Accattoli, Simone [1 ]
Sernani, Paolo [1 ]
Falcionelli, Nicola [1 ]
Mekuria, Dagmawi Neway [1 ]
Dragoni, Aldo Franco [1 ]
机构
[1] Univ Politecn Marche, Dept Informat Engn, Ancona, Italy
关键词
RECOGNITION; AUDIO; IMAGE;
D O I
10.1080/08839514.2020.1723876
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video-surveillance has always been a vital tool to enforce safety in both public and private environments. Even though (smart) cameras are nowadays relatively widespread and cheap, such monitoring systems lack effectiveness in most scenarios. In addition, there is no guarantee about a human operator who monitors rare events in live video footages, forcing the use of such systems after unwanted events already took their undisturbed course, as a mere tool for investigations. Having an intelligent software to perform the task would allow to unlock the full potential of video-surveillance systems. To this end, in this paper we propose a solution based on a 3D Convolutional Neural Network that can effectively detect fights, aggressive motions and violence scenes in live video streams. Compared to state-of-the-art techniques, our method showed very promising performance on three challenging benchmark datasets: Hockey Fight, Crowd Violence and Movie Violence.
引用
收藏
页码:329 / 344
页数:16
相关论文
共 42 条
[1]   Spatio-temporal feature using optical flow based distribution for violence detection [J].
Ben Mabrouk, Amira ;
Zagrouba, Ezzeddine .
PATTERN RECOGNITION LETTERS, 2017, 92 :62-67
[2]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[3]  
Nievas EB, 2011, LECT NOTES COMPUT SC, V6855, P332, DOI 10.1007/978-3-642-23678-5_39
[4]   Convolutional neural network acceleration with hardware/software co-design [J].
Chen, Andrew Tzer-Yeu ;
Biglari-Abhari, Morteza ;
Wang, Kevin I-Kai ;
Bouzerdoum, Abdesselam ;
Tivive, Fok Hing Chi .
APPLIED INTELLIGENCE, 2018, 48 (05) :1288-1301
[5]  
Chen DT, 2008, IEEE ENG MED BIO, P5238, DOI 10.1109/IEMBS.2008.4650395
[6]  
Chen M.y., 2009, Tech. Rep. CMU-CS- 09-161
[7]  
de Souza F. D. M., 2010, Proceedings of the 23rd SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI 2010), P224, DOI 10.1109/SIBGRAPI.2010.38
[8]  
Deniz O, 2014, PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, P478
[9]  
Ding CH, 2014, LECT NOTES COMPUT SC, V8888, P551, DOI 10.1007/978-3-319-14364-4_53
[10]   Multi-stream Deep Networks for Person to Person Violence Detection in Videos [J].
Dong, Zhihong ;
Qin, Jie ;
Wang, Yunhong .
PATTERN RECOGNITION (CCPR 2016), PT I, 2016, 662 :517-531