Stampede detector based on deep learning models using dense optical flow

被引:0
作者
Cob-Parro, Antonio Carlos [1 ]
Losada-Gutierrez, Cristina [1 ]
Marron-Romera, Marta [1 ]
机构
[1] Univ Alcala, Dept Elect, km 33600, Alcala De Henares 28805, Barcelona, Spain
关键词
Video surveillance; Stampede detection; Optical flow; Machine learning; Deep learning; BEHAVIOR; VIDEOS; CNN;
D O I
10.1016/j.engappai.2024.109940
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The world's population has grown in recent decades, increasing social events and leading to more crowd situations with potential issues, such as bottlenecks, stampedes, or falls. In this context, this paper presents an approach for stampede detection from image sequences in low- and medium-crowd. It is based on a feature vector extracted from the dense optical flow, using the Gunner-Farneback method, and a deep learning-based classification model capable of determining, frame by frame, whether a stampede is happening. It has been evaluated on four different datasets: two widely used in the state-of-the-art- University of Minnesota (UMN) and Performance Evaluation of Tracking and Surveillance (PETS-2009)- and two new labeled datasets, Geintra-Behaviour-Analysis (GBA-Stampedes) and Geintra-Santander Multiple Actions Dataset in Cruises (GSMADC), which include realistic indoor and outdoor scenarios, as well as diverse crowd types and sizes (up to 6 people in GSMADC and a minimum of 15 in GBA). Both datasets have been made publicly available to increase the limited number of sequences for validating stampede detection in videos, with more than 43000 frames. The proposed method was evaluated across various training scenarios to test its adaptability to new environments. In the most challenging scenario, using a limited training set, our system achieved average metrics of around 99% on UMN and PETS-2009, 95% on GBA, and 91% on GSMADC. In comparison, other models achieved only 90% on UMN and PETS-2009, 80% on GBA, and below 80% on GSMADC, demonstrating the accuracy and robustness of the stampede detector across scenarios.
引用
收藏
页数:18
相关论文
共 68 条
[51]   Online real-time crowd behavior detection in video sequences [J].
Pennisi, Andrea ;
Bloisi, Domenico D. ;
Iocchi, Luca .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 144 :166-176
[52]  
Pulli K, 2012, COMMUN ACM, V55, P61, DOI 10.1145/2184319.2184337
[53]  
Raj J.S., 2019, Journal of Soft Computing Paradigm (JSCP), V1, P33, DOI [DOI 10.36548/JSCP.2019.1.004, 10.36548/jscp.2019.1.004]
[54]   Survey on Contemporary Remote Surveillance Systems for Public Safety [J].
Raty, Tomi D. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2010, 40 (05) :493-515
[55]   Plug-and-Play CNN for Crowd Motion Analysis: An Application in Abnormal Event Detection [J].
Ravanbakhsh, Mahdyar ;
Nabi, Moin ;
Mousavi, Hossein ;
Sangineto, Enver ;
Sebe, Nicu .
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :1689-1698
[56]  
Ravanbakhsh M, 2017, IEEE IMAGE PROC, P1577, DOI 10.1109/ICIP.2017.8296547
[57]   STFlow: Self-Taught Optical Flow Estimation Using Pseudo Labels [J].
Ren, Zhe ;
Luo, Wenhan ;
Yan, Junchi ;
Liao, Wenlong ;
Yang, Xiaokang ;
Yuille, Alan ;
Zha, Hongyuan .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :9113-9124
[58]  
Shafie A.A., 2009, Motion detection techniques using optical flow
[59]   Secrets of Event-Based Optical Flow [J].
Shiba, Shintaro ;
Aoki, Yoshimitsu ;
Gallego, Guillermo .
COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 :628-645
[60]   Two-channel Attention Mechanism Fusion Model of Stock Price Prediction Based on CNN-LSTM [J].
Sun, Lin ;
Xu, Wenzheng ;
Liu, Jimin .
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (05)