UAV surveillance for violence detection and individual identification

被引：18

作者：

Srivastava, Anugrah ^{[1
]}

Badal, Tapas ^{[1
]}

Saxena, Pawan ^{[1
]}

Vidyarthi, Ankit ^{[2
]}

Singh, Rishav ^{[3
]}

机构：

[1] Bennett Univ, Comp Sci Engn Dept, Greater Noida, India

[2] Jaypee Inst Informat Technol, Dept CSE & IT, Noida, India

[3] Natl Inst Technol, Dept Comp Sci & Engn, Delhi, India

来源：

AUTOMATED SOFTWARE ENGINEERING | 2022年 / 29卷 / 01期

关键词：

Violence detection; Drone surveillance videos; Deep learning; LSTM; Transfer learning; Violent individual; VIDEO; RECOGNITION;

D O I：

10.1007/s10515-022-00323-3

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Violence detection and face recognition of the individuals involved in the violence has an influence that's noticeable on the development of automated video surveillance research. With increasing risks in society and insufficient staff to monitor them, there is an expanding demand for drones square measure and computerized video surveillance. Violence detection is expeditious and can be utilized as the method to selectively filter the surveillance videos, and identify or take note of the individual who is creating the anomaly. Individual identification from drone surveillance videos in a crowded area is difficult because of the expeditious movement, overlapping features, and bestrew backgrounds. The goal is to come with a better drone surveillance system that recognizes the violent individuals that are implicated in violence and evoke a distress signal so that fast help can be offered. This paper uses the currently developed techniques based on deep learning and proposed the concept of transfer learning using deep learning-based different hybrid models with LSTM for violence detection. Identifying individuals incriminated in violence from drone-captured images involves major issues in variations of human facial appearance, hence the paper uses a CNN model combined with image processing techniques. For testing, the drone captured video dataset is developed for an unconstrained environment. Ultimately, the features extracted from a hybrid of inception modules and residual blocks, with LSTM architecture yielded an accuracy of 97.33% and thereby proved to be noteworthy and thereby, demonstrating its superiority over other models that have been tested. For the individual identification module, the best accuracy of 99.20% obtained on our dataset, is a CNN model with residual blocks trained for face identification.

引用

页数：28

共 48 条

[1]

[Anonymous], 2018, 10 INT C ADV MULT MM

[2]

[Anonymous], 2003, P 5 ACM SIGMM INT WO, DOI DOI 10.1145/973264.973282

[3]

[Anonymous], 1989, Self-organization and associative memory

[4] Public acceptance of drones: Knowledge, attitudes, and practice [J].

Aydin, Burchan .

TECHNOLOGY IN SOCIETY, 2019, 59

[5] A Sensor Network Approach for Violence Detection in Smart Cities Using Deep Learning [J].

Baba, Marius ;

Gui, Vasile ;

Cernazanu, Cosmin ;

Pescaru, Dan .

SENSORS, 2019, 19 (07)

[6]

Nievas EB, 2011, LECT NOTES COMPUT SC, V6855, P332, DOI 10.1007/978-3-642-23678-5_39

[7] Person identification from aerial footage by a remote-controlled drone [J].

Bindemann, Markus ;

Fysh, Matthew C. ;

Sage, Sophie S. K. ;

Douglas, Kristina ;

Tummon, Hannah M. .

SCIENTIFIC REPORTS, 2017, 7

[8] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].

Cao, Zhe ;

Simon, Tomas ;

Wei, Shih-En ;

Sheikh, Yaser .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310

[9]

Deeba F, 2019, INT J ADV COMPUT SC, V10, P274

[10]

Deniz O, 2014, PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, P478

← 1 2 3 4 5 →