Elevating urban surveillance: A deep CCTV monitoring system for detection of anomalous events via human action recognition

被引:3
作者
Kim, Hyungmin [1 ,2 ]
Jeon, Hobeom [1 ,2 ]
Kim, Dohyung [1 ,2 ]
Kim, Jaehong [2 ]
机构
[1] Univ Sci & Technol UST, 217 Gajeong Ro, Daejeon 34113, South Korea
[2] Elect & Telecommun Res Inst ETRI, 218 Gajeong Ro, Daejeon 34129, South Korea
关键词
Social sustainability; Surveillance system; Abnormal action detection; Deep learning; HUMAN FALL DETECTION; OPTICAL-FLOW; VIOLENCE; CRIME; FEAR;
D O I
10.1016/j.scs.2024.105793
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
In the face of urbanization and the widespread use of CCTV cameras, the processing of surveillance videos has gained importance. This study endeavors to create a city-wide monitoring system utilizing human action recognition that can elevate the social sustainability of citizens. The primary goal is to develop an entire framework to detect unusual events within urban environments, with a specific focus on identifying four aberrant actions: "falling," "violence," "loitering," and "intrusion.". The processing of CCTV images is vulnerable to adverse weather conditions, particularly impacting human detection and tracking when obstructions like body parts occlusion, such as during falling events. To address these challenges, the paper proposes tracking compensation techniques that boost the system's ability to detect anomalies without requiring additional training. The proposed approach demonstrates a remarkable 21.21% enhancement in detecting falling events, without compromising its handling of other event types. Overall, the system achieves an impressive average F1 score of 93% across diverse event categories. The system's effectiveness is thoroughly assessed through an extensive subway domain case study, shedding light on its robustness and adaptability for potential real-world deployment. This study also delves into transfer learning dynamics based on sample quantity and pre-training with relevant human-of-interest data.
引用
收藏
页数:17
相关论文
共 91 条
[1]  
AI-Hub, 2020, Ai-hub subway station abnormal behavior dataset
[2]  
AI-Hub, 2019, Ai-hub abnormal behavior dataset
[3]  
[Anonymous], 2010, IEEE International Conference on Pattern Recognition Workshops
[4]  
Ansariyar A., 2023, PREPRINT
[5]   ViViT: A Video Vision Transformer [J].
Arnab, Anurag ;
Dehghani, Mostafa ;
Heigold, Georg ;
Sun, Chen ;
Lucic, Mario ;
Schmid, Cordelia .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :6816-6826
[6]   Efficient Human Violence Recognition for Surveillance in Real Time [J].
Baca, Herwin Alayn Huillcen ;
Valdivia, Flor de Luz Palomino ;
Caceres, Juan Carlos Gutierrez .
SENSORS, 2024, 24 (02)
[7]  
Nievas EB, 2011, LECT NOTES COMPUT SC, V6855, P332, DOI 10.1007/978-3-642-23678-5_39
[8]  
Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
[9]   A dataset for automatic violence detection in videos [J].
Bianculli, Miriana ;
Falcionelli, Nicola ;
Sernani, Paolo ;
Tomassini, Selene ;
Contardo, Paolo ;
Lombardi, Mara ;
Dragoni, Aldo Franco .
DATA IN BRIEF, 2020, 33
[10]   Evaluation of alternative policies to combat false emergency calls [J].
Blackstone, EA ;
Buck, AJ ;
Hakim, S .
EVALUATION AND PROGRAM PLANNING, 2005, 28 (02) :233-242