Deep BiLSTM Attention Model for Spatial and Temporal Anomaly Detection in Video Surveillance

被引:0
作者
Natha, Sarfaraz [1 ,2 ]
Ahmed, Fareed [1 ]
Siraj, Mohammad [3 ]
Lagari, Mehwish [1 ]
Altamimi, Majid [3 ]
Chandio, Asghar Ali [1 ]
机构
[1] Quaid E Awam Univ, Dept Informat Technol, Nawabshah 67450, Pakistan
[2] Sir Syed Univ Engn & Technol, Dept Software Engn, Karachi 75300, Pakistan
[3] King Saud Univ, Coll Engn, Dept Elect Engn, Riyadh 11543, Saudi Arabia
关键词
convolutional neural network; recurrent neural network; BiLSTM; multi-attention layer; anomaly detection;
D O I
10.3390/s25010251
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Detection of anomalies in video surveillance plays a key role in ensuring the safety and security of public spaces. The number of surveillance cameras is growing, making it harder to monitor them manually. So, automated systems are needed. This change increases the demand for automated systems that detect abnormal events or anomalies, such as road accidents, fighting, snatching, car fires, and explosions in real-time. These systems improve detection accuracy, minimize human error, and make security operations more efficient. In this study, we proposed the Composite Recurrent Bi-Attention (CRBA) model for detecting anomalies in surveillance videos. The CRBA model combines DenseNet201 for robust spatial feature extraction with BiLSTM networks that capture temporal dependencies across video frames. A multi-attention mechanism was also incorporated to direct the model's focus to critical spatiotemporal regions. This improves the system's ability to distinguish between normal and abnormal behaviors. By integrating these methodologies, the CRBA model improves the detection and classification of anomalies in surveillance videos, effectively addressing both spatial and temporal challenges. Experimental assessments demonstrate that the CRBA model achieves high accuracy on both the University of Central Florida (UCF) and the newly developed Road Anomaly Dataset (RAD). This model enhances detection accuracy while also improving resource efficiency and minimizing response times in critical situations. These advantages make it an invaluable tool for public safety and security operations, where rapid and accurate responses are needed for maintaining safety.
引用
收藏
页数:24
相关论文
共 61 条
  • [11] An accurate violence detection framework using unsupervised spatial-temporal action translation network
    Ehsan, Tahereh Zarrat
    Nahvi, Manoochehr
    Mohtavipour, Seyed Mehdi
    [J]. VISUAL COMPUTER, 2024, 40 (03) : 1515 - 1535
  • [12] A hybrid model of Internet of Things and cloud computing to manage big data in health services applications
    Elhoseny, Mohamed
    Abdelaziz, Ahmed
    Salama, Ahmed S.
    Riad, A. M.
    Muhammad, Khan
    Sangaiah, Arun Kumar
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 86 : 1383 - 1394
  • [13] A survey on deep learning techniques for image and video semantic segmentation
    Garcia-Garcia, Alberto
    Orts-Escolano, Sergio
    Oprea, Sergiu
    Villena-Martinez, Victor
    Martinez-Gonzalez, Pablo
    Garcia-Rodriguez, Jose
    [J]. APPLIED SOFT COMPUTING, 2018, 70 : 41 - 65
  • [14] Human Behavior Recognition from Multiview Videos
    Hsueh, Yu-Ling
    Lie, Wen-Nung
    Guo, Guan-You
    [J]. INFORMATION SCIENCES, 2020, 517 : 275 - 296
  • [15] Convolutional Networks with Dense Connectivity
    Huang, Gao
    Liu, Zhuang
    Pleiss, Geoff
    van der Maaten, Laurens
    Weinberger, Kilian Q.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 8704 - 8716
  • [16] Illahi M., 2022, Quaid-e-Awam Univ. Res. J. Eng., Sci Technol, V20, P123, DOI [10.52584/qrj.2002.15, DOI 10.52584/QRJ.2002.15]
  • [17] A Review of Deep Transfer Learning and Recent Advancements
    Iman, Mohammadreza
    Arabnia, Hamid Reza
    Rasheed, Khaled
    [J]. TECHNOLOGIES, 2023, 11 (02)
  • [18] A new hybrid deep learning model for human action recognition
    Jaouedi, Neziha
    Boujnah, Noureddine
    Bouhlel, Salim
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2020, 32 (04) : 447 - 453
  • [19] Jebur SA, 2024, Arxiv, DOI arXiv:2408.00792
  • [20] A long short-term memory-based framework for crash detection on freeways with traffic data of different temporal resolutions
    Jiang, Feifeng
    Yuen, Kwok Kit Richard
    Lee, Eric Wai Ming
    [J]. ACCIDENT ANALYSIS AND PREVENTION, 2020, 141