Fast Violence Detection in Video

被引:0
作者
Deniz, Oscar [1 ]
Serrano, Ismael [1 ]
Bueno, Gloria [1 ]
Kim, Tae-Kyun [2 ]
机构
[1] Univ Castilla La Mancha, ETSI Ind, VISILAB Grp, Avda Camilo Jose Cela S-N, E-13071 Ciudad Real, Spain
[2] Imperial Coll, Dept Elect & Elect Engn, London SW7 2AZ, England
来源
PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2 | 2014年
关键词
Action Recognition; Violence Detection; Fight Detection; BODY MOVEMENT; PERCEPTION; MOTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Whereas the action recognition problem has become a hot topic within computer vision, the detection of fights or in general aggressive behavior has been comparatively less studied. Such capability may be extremely useful in some video surveillance scenarios like in prisons, psychiatric centers or even embedded in camera phones. Recent work has considered the well-known Bag-of-Words framework often used in generic action recognition for the specific problem of fight detection. Under this framework, spatio-temporal features are extracted from the video sequences and used for classification. Despite encouraging results in which near 90% accuracy rates were achieved for this specific task, the computational cost of extracting such features is prohibitive for practical applications, particularly in surveillance and media rating systems. The task of violence detection may have, however, specific features that can be leveraged. Inspired by psychology results that suggest that kinematic features alone are discriminant for specific actions, this work proposes a novel method which uses extreme acceleration patterns as the main feature. These extreme accelerations are efficiently estimated by applying the Radon transform to the power spectrum of consecutive frames. Experiments show that accuracy improvements of up to 12% are achieved with respect to state-of-the-art generic action recognition methods. Most importantly, the proposed method is at least 15 times faster.
引用
收藏
页码:478 / 485
页数:8
相关论文
共 26 条
[1]  
[Anonymous], P ANN M COGN SCI SOC
[2]  
[Anonymous], 2005, TECHNICAL REPORT
[3]   Convergent evidence for the visual analysis of optic flow through anisotropic attenuation of high spatial frequencies [J].
Barlow, HB ;
Olshausen, BA .
JOURNAL OF VISION, 2004, 4 (06) :415-426
[4]  
Nievas EB, 2011, LECT NOTES COMPUT SC, V6855, P332, DOI 10.1007/978-3-642-23678-5_39
[5]   Perception of human motion [J].
Blake, Randolph ;
Shiffrar, Maggie .
ANNUAL REVIEW OF PSYCHOLOGY, 2007, 58 :47-73
[6]  
Bobick A., 1996, Proceedings of the 13th International Conference on Pattern Recognition, P307, DOI 10.1109/ICPR.1996.546039
[7]  
Castellano G, 2007, LECT NOTES COMPUT SC, V4738, P71
[8]  
Chen DT, 2008, IEEE ENG MED BIO, P5238, DOI 10.1109/IEMBS.2008.4650395
[9]   VIOLENT SCENE DETECTION IN MOVIES [J].
Chen, Liang-Hua ;
Su, Chih-Wen ;
Hsu, Hsi-Wen .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2011, 25 (08) :1161-1172
[10]  
Chen MF, 2010, BIOMED CENTRAL CANC, V10, P1