PredGAN - a deep multi-scale video prediction framework for detecting anomalies in videos

被引：2

作者：

Jamadandi, Adarsh ^{[1
]}

Kotturshettar, Sunidhi ^{[1
]}

Mudenagudi, Uma ^{[1
]}

机构：

[1] BV Bhoomaraddi Coll Engn & Technol, Hubli, Karnataka, India

来源：

ELEVENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2018) | 2018年

关键词：

Anomaly detection; video frame prediction; generative adversarial networks;

D O I：

10.1145/3293353.3293354

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we propose a multi-scale video prediction framework with adversarial training for detecting anomalies in videos. Anomalous events are those which do not conform to normal behavior. Supervised learning framework cannot account for all the unusual activities since a universal definition of anomaly cannot be adopted. To tackle this problem, we propose an unsupervised approach to learn the internal representation of videos and use this learning to accurately predict the future-frames of the videos. We train our network adversarially on videos consisting of only normal activities. When our network encounters unusual or irregular activities the generated frames consists of fuzzy regions where the irregular activities are present. These fuzzy regions consequently lower the peak signal to noise ratio (PSNR) of the generated frames. The PSNR values are normalized to have values between 0 and 1 and is used as a regularity score to tag a frame as anomalous or not-anomalous. We provide quantitative and qualitative evaluation of the proposed framework and also introduce Earth Mover's Distance as a new evaluation metric to assess the quality of the images generated. We demonstrate our framework on UCSD Pedestrian dataset and show that we achieve comparable results.

引用

页数：8

共 20 条

[1] [Anonymous], 2016, CORR
[2] [Anonymous], 2017, CORR
[3] [Anonymous], 2015, CORR
[4] [Anonymous], 2016, P 4 INT C LEARNING R
[5] [Anonymous], 2017, CORR
[6] Sparse Reconstruction Cost for Abnormal Event Detection
Cong, Yang
Yuan, Junsong
Liu, Ji
[J]. 2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 1807 - +
[7] Girod Bernd, 1993, P207
[8] Generative Adversarial Networks
Goodfellow, Ian
Pouget-Abadie, Jean
Mirza, Mehdi
Xu, Bing
Warde-Farley, David
Ozair, Sherjil
Courville, Aaron
Bengio, Yoshua
[J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144
[9] Hasan Mahmudul, 2016, CORR
[10] Deep Visual-Semantic Alignments for Generating Image Descriptions
Karpathy, Andrej
Li Fei-Fei
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) : 664 - 676

← 1 2 →