Lightweight mobile network for real-time violence recognition

被引:5
作者
Zhang, Youshan [1 ]
Li, Yong [2 ]
Guo, Shaozhe [1 ]
机构
[1] Chinese Peoples Armed Police Force Engn Univ, Grad Student Brigade, Xian, Shaanxi, Peoples R China
[2] Chinese Peoples Armed Police Force Engn Univ, Coll Informat Engn, Xian, Shaanxi, Peoples R China
关键词
D O I
10.1371/journal.pone.0276939
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Most existing violence recognition methods have complex network structures and high cost of computation and cannot meet the requirements of large-scale deployment. The purpose of this paper is to reduce the complexity of the model to realize the application of violence recognition on mobile intelligent terminals. To solve this problem, we propose MobileNet-TSM, a lightweight network, which uses MobileNet-V2 as main structure. By incorporating temporal shift modules (TSM), which can exchange information between frames, the capability of extracting dynamic characteristics between consecutive frames is strengthened. Extensive experiments are conducted to prove the validity of this method. Our proposed model has only 8.49MB parameters and 175.86MB estimated total size. Compared with the existing methods, this method greatly reduced the model size, at the cost of an accuracy gap of about 3%. The proposed model has achieved accuracy of 97.959%, 97.5% and 87.75% on three public datasets (Crowd Violence, Hockey Fights, and RWF-2000), respectively. Based on this, we also build a real-time violence recognition application on the Android terminal. The source code and trained models are available on https://github.com/1840210289/MobileNet-TSM.git.
引用
收藏
页数:14
相关论文
共 51 条
[1]   Multi-Level Two-Stream Fusion-Based Spatio-Temporal Attention Model for Violence Detection and Localization [J].
Asad, Mujtaba ;
Jiang, He ;
Yang, Jie ;
Tu, Enmei ;
Malik, Aftab A. .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
[2]  
Nievas EB, 2011, LECT NOTES COMPUT SC, V6855, P332, DOI 10.1007/978-3-642-23678-5_39
[3]  
Bilinski P, 2016, 2016 13TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), P30, DOI 10.1109/AVSS.2016.7738019
[4]  
Bin Jiang, 2019, 2019 International Conference on Intelligent Computing and its Emerging Applications (ICEA). Proceedings, P59, DOI 10.1109/ICEA.2019.8858306
[5]   Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].
Carreira, Joao ;
Zisserman, Andrew .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733
[6]   VIOLENT SCENE DETECTION IN MOVIES [J].
Chen, Liang-Hua ;
Su, Chih-Wen ;
Hsu, Hsi-Wen .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2011, 25 (08) :1161-1172
[7]   RWF-2000: An Open Large Scale Video Database for Violence Detection [J].
Cheng, Ming ;
Cai, Kunjing ;
Li, Ming .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :4183-4190
[8]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[9]  
Das S, 2019, 2019 4 INT C EL INF, P1
[10]  
Donahue J, 2015, PROC CVPR IEEE, P2625, DOI 10.1109/CVPR.2015.7298878