Real-time crowd behavior recognition in surveillance videos based on deep learning methods

被引:13
作者
Rezaei, Fariba [1 ]
Yazdi, Mehran [1 ]
机构
[1] Shiraz Univ, Sch Elect & Comp Engn, Shiraz, Iran
关键词
Crowd behavior recognition; Deep learning; PETS2009; dataset; CONV-LSTM-AE; ANOMALY DETECTION;
D O I
10.1007/s11554-021-01116-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic video surveillance in public crowded places has been an active research area for security purposes. Traditional approaches try to solve the crowd behavior recognition task using a sequential two-stage pipeline as low-level feature extraction and classification. Lately, deep learning has shown promising results in comparison to traditional methods by extracting high-level representation and solving the problem in an end-to-end pipeline. In this paper, we investigate a deep architecture for crowd event recognition to detect seven behavior categories in PETS2009 event recognition dataset. More especially, we apply an integrated handcrafted and Conv-LSTM-AE method with optical flow images as input to extract a high-level representation of data and conduct classification. After achieving a latent representation of input optical flow image sequences in the bottleneck of autoencoder(AE), the architecture is split into two separate branches, one as AE decoder and the other as the classifier. The proposed architecture is jointly trained for representation and classification by defining two different losses. The experimental results in comparison to the state-of-the-art methods demonstrate that our algorithm can be promising for real-time event recognition and achieves a better performance in calculated metrics.
引用
收藏
页码:1669 / 1679
页数:11
相关论文
共 61 条
  • [1] [Anonymous], 2009, PETS 2009 BENCHMARK
  • [2] [Anonymous], 2015, J COMPUT SCI COMMUN
  • [3] [Anonymous], 2014, INT C DIG IM COMP TE
  • [4] Speeded-Up Robust Features (SURF)
    Bay, Herbert
    Ess, Andreas
    Tuytelaars, Tinne
    Van Gool, Luc
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) : 346 - 359
  • [5] Motion Pattern Extraction and Event Detection for Automatic Visual Surveillance
    Benabbas, Yassine
    Ihaddadene, Nacim
    Djeraba, Chaabane
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2011,
  • [6] BORJABORJA LF, 2018, INT JOINT C NEUR NET, P1
  • [7] Briassouli A., 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), P928, DOI 10.1109/ICCVW.2011.6130351
  • [8] Burney A, 2016, INT CONF FRONT INFO, P247, DOI [10.1109/FIT.2016.50, 10.1109/FIT.2016.052]
  • [9] Cermeño E, 2013, IEEE INT W PERFORM, P1, DOI 10.1109/PETS.2013.6523788
  • [10] Chan A.B., 2009, PERF EV TRACK SURV W, P101