A Novel Violent Video Detection Scheme Based on Modified 3D Convolutional Neural Networks

被引:64
作者
Song, Wei [1 ,2 ]
Zhang, Dongliang [1 ]
Zha, Xiaobing [1 ,2 ]
Yu, Jing [3 ]
Zheng, Rui [1 ]
Wang, Antai [4 ]
机构
[1] Minzu Univ China, Sch Informat Engn, Beijing 100081, Peoples R China
[2] Minzu Univ China, Natl Language Resource Monitoring & Res Ctr Minor, Beijing 100081, Peoples R China
[3] Beijing Jiaotong Univ, Sch Elect Informat Engn, Beijing 100044, Peoples R China
[4] New Jersey Inst Technol, Dept Math Sci, Newark, NJ 07102 USA
基金
美国国家科学基金会;
关键词
Violent video detection; 3D ConvNet; key frame extraction; MOVIES; SCENES; VECTOR; AUDIO;
D O I
10.1109/ACCESS.2019.2906275
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Violent video constitutes a threat to public security, and effective detection algorithms are in urgent need. In order to improve the detection accuracy of 3D convolutional neural networks (3D ConvNet), a novel violent video detection scheme based on the modified 3D ConvNet is proposed. In this paper, the preprocessing method of data is improved, and a new sampling method by using the key frame as dividing nodes is designed. Then, a random sampling method is adapted to produce the input frame sequence. With experimental evaluations on the crowd violence dataset, the results demonstrate the effectiveness of the proposed new sampling method. For three public violent detection datasets: hockey fight, movies, and crowd violence, individualized strategies are implemented to suit the varied clip length. For the short clips, the 3D ConvNet is constructed by using the uniform sampling method. For the longer clips, the new frame sampling strategy is adopted. The proposed scheme obtains competitive results: 99.62% on hockey fight, 99.97% on movies, and 94.3% on crowd violence. The experimental results show that our method is simple and effective.
引用
收藏
页码:39172 / 39179
页数:8
相关论文
共 41 条
[1]  
[Anonymous], 2014, ADV NEURAL INFORM PR
[2]  
[Anonymous], INT C AC SPEECH SIGN
[3]  
[Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
[4]  
[Anonymous], 2016, NETINFOSECUR
[5]  
[Anonymous], 2009, MOSIFT RECOGNIZING H
[6]  
[Anonymous], PROC CVPR IEEE
[7]  
[Anonymous], 2005, PHILLIPINE COMPUT SC
[8]  
[Anonymous], 2016, P ECCV
[9]  
Nievas EB, 2011, LECT NOTES COMPUT SC, V6855, P332, DOI 10.1007/978-3-642-23678-5_39
[10]  
Cheng W., 2003, P 5 ACM SIGMM INT WO, P109, DOI DOI 10.1145/973264.973282