PredGAN - a deep multi-scale video prediction framework for detecting anomalies in videos

被引:2
作者
Jamadandi, Adarsh [1 ]
Kotturshettar, Sunidhi [1 ]
Mudenagudi, Uma [1 ]
机构
[1] BV Bhoomaraddi Coll Engn & Technol, Hubli, Karnataka, India
来源
ELEVENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2018) | 2018年
关键词
Anomaly detection; video frame prediction; generative adversarial networks;
D O I
10.1145/3293353.3293354
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a multi-scale video prediction framework with adversarial training for detecting anomalies in videos. Anomalous events are those which do not conform to normal behavior. Supervised learning framework cannot account for all the unusual activities since a universal definition of anomaly cannot be adopted. To tackle this problem, we propose an unsupervised approach to learn the internal representation of videos and use this learning to accurately predict the future-frames of the videos. We train our network adversarially on videos consisting of only normal activities. When our network encounters unusual or irregular activities the generated frames consists of fuzzy regions where the irregular activities are present. These fuzzy regions consequently lower the peak signal to noise ratio (PSNR) of the generated frames. The PSNR values are normalized to have values between 0 and 1 and is used as a regularity score to tag a frame as anomalous or not-anomalous. We provide quantitative and qualitative evaluation of the proposed framework and also introduce Earth Mover's Distance as a new evaluation metric to assess the quality of the images generated. We demonstrate our framework on UCSD Pedestrian dataset and show that we achieve comparable results.
引用
收藏
页数:8
相关论文
共 20 条
  • [1] [Anonymous], 2016, CORR
  • [2] [Anonymous], 2017, CORR
  • [3] [Anonymous], 2015, CORR
  • [4] [Anonymous], 2016, P 4 INT C LEARNING R
  • [5] [Anonymous], 2017, CORR
  • [6] Sparse Reconstruction Cost for Abnormal Event Detection
    Cong, Yang
    Yuan, Junsong
    Liu, Ji
    [J]. 2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 1807 - +
  • [7] Girod Bernd, 1993, P207
  • [8] Generative Adversarial Networks
    Goodfellow, Ian
    Pouget-Abadie, Jean
    Mirza, Mehdi
    Xu, Bing
    Warde-Farley, David
    Ozair, Sherjil
    Courville, Aaron
    Bengio, Yoshua
    [J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144
  • [9] Hasan Mahmudul, 2016, CORR
  • [10] Deep Visual-Semantic Alignments for Generating Image Descriptions
    Karpathy, Andrej
    Li Fei-Fei
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) : 664 - 676