Spatio-temporal predictive tasks for abnormal event detection in videos

被引:6
作者
Naji, Yassine [1 ,2 ]
Setkov, Aleksandr [1 ]
Loesch, Angelique [1 ]
Gouiffes, Michele [2 ]
Audigier, Romaric [1 ]
机构
[1] Univ Paris Saclay, CEA, List, F-91120 Palaiseau, France
[2] Univ Paris Saclay, CNRS, LISN, F-91400 Orsay, France
来源
2022 18TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2022) | 2022年
关键词
D O I
10.1109/AVSS56176.2022.9959669
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Abnormal event detection in videos is a challenging problem, partly due to the multiplicity of abnormal patterns and the lack of their corresponding annotations. In this paper, we propose new constrained pretext tasks to learn object level normality patterns. Our approach consists in learning a mapping between down-scaled visual queries and their corresponding normal appearance and motion characteristics at the original resolution. The proposed tasks are more challenging than reconstruction and future frame prediction tasks which are widely used in the literature, since our model learns to jointly predict spatial and temporal features rather than reconstructing them. We believe that more constrained pretext tasks induce a better learning of normality patterns. Experiments on several benchmark datasets demonstrate the effectiveness of our approach to localize and track anomalies as it outperforms or reaches the current state-of-the-art on spatio-temporal evaluation metrics.
引用
收藏
页数:8
相关论文
共 34 条
[1]  
Chen K, 2019, Arxiv, DOI [arXiv:1906.07155, 10.48550/arXiv.1906.07155, DOI 10.48550/ARXIV.1906.07155]
[2]   Sparse Reconstruction Cost for Abnormal Event Detection [J].
Cong, Yang ;
Yuan, Junsong ;
Liu, Ji .
2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, :1807-+
[3]  
Dong F., 2020, IEEE ACCESS, V8, p88 170
[4]   Any-Shot Sequential Anomaly Detection in Surveillance Videos [J].
Doshi, Keval ;
Yilmaz, Yasin .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :4037-4042
[5]  
Fang JW, 2019, Arxiv, DOI arXiv:1912.12148
[6]   Two-frame motion estimation based on polynomial expansion [J].
Farnebäck, G .
IMAGE ANALYSIS, PROCEEDINGS, 2003, 2749 :363-370
[7]   A Background-Agnostic Framework With Adversarial Training for Abnormal Event Detection in Video [J].
Georgescu, Mariana Iuliana ;
Ionescu, Radu Tudor ;
Khan, Fahad Shahbaz ;
Popescu, Marius ;
Shah, Mubarak .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) :4505-4523
[8]   Anomaly Detection in Video via Self-Supervised and Multi-Task Learning [J].
Georgescu, Mariana-Iuliana ;
Barbalau, Antonio ;
Ionescu, Radu Tudor ;
Khan, Fahad Shahbaz ;
Popescu, Marius ;
Shah, Mubarak .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12737-12747
[9]   Memorizing Normality to Detect Anomaly: Memory-augmented Deep Autoencoder for Unsupervised Anomaly Detection [J].
Gong, Dong ;
Liu, Lingqiao ;
Le, Vuong ;
Saha, Budhaditya ;
Mansour, Moussa Reda ;
Venkatesh, Svetha ;
van den Hengel, Anton .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1705-1714
[10]   Learning Temporal Regularity in Video Sequences [J].
Hasan, Mahmudul ;
Choi, Jonghyun ;
Neumann, Jan ;
Roy-Chowdhury, Amit K. ;
Davis, Larry S. .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :733-742