Boosting Variational Inference With Margin Learning for Few-Shot Scene-Adaptive Anomaly Detection

被引:11
作者
Huang, Xin [1 ]
Hu, Yutao [1 ]
Luo, Xiaoyan [2 ]
Han, Jungong [3 ]
Zhang, Baochang [4 ,5 ]
Cao, Xianbin [1 ,6 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Astronaut, Beijing 100191, Peoples R China
[3] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3FL, Wales
[4] Beihang Univ, Sch Inst Artificial Intelligence, Beijing 100191, Peoples R China
[5] Zhongguancun Lab, Beijing 100190, Peoples R China
[6] Beihang Univ, Key Lab Adv Technol Near Space Informat Syst, Minist Ind & Informat Technol China, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Few-shot scene-adaptive anomaly detection; conditional variational auto-encoder; margin learning embedding;
D O I
10.1109/TCSVT.2022.3227716
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Anomaly detection in surveillance videos aims to identify frames where abnormal events happen. Existing approaches assume that the training and testing videos are from the same scene, exhibiting poor generalization performance when encountering an unseen scene. In this paper, we propose a Variational Anomaly Detection Network (VADNet), which is characterized by its high scene-adaptation - it can identify abnormal events in a new scene only via referring to a few normal samples without fine-tuning. Our model embodies two major innovations. First, a novel Variational Normal Inference (VNI) module is proposed to formulate image reconstruction in a conditional variational auto-encoder (CVAE) framework, which learns a probabilistic decision model instead of a traditional deterministic one. Secondly, a Margin Learning Embedding (MLE) module is leveraged to boost the variational inference and aid in distinguishing normal events. We theoretically demonstrate that minimizing the triplet loss in MLE module facilitates maximizing the evidence lower bound (ELBO) of CVAE, which promotes the convergence of VNI. By incorporating variational inference with margin learning, VADNet becomes much more generative that is able to handle the uncertainty caused by the changed scene and limited reference data. Extensive experiments on several datasets demonstrate that the proposed VADNet can adapt to a new scene effectively without fine-tuning and achieve remarkable performance, which outperforms other methods significantly and establishes new state-of-the-art in the case of few-shot scene-adaptive anomaly detection. We believe our method is closer to real-world application due to its strong generalization ability. All codes are released in https://github.com/huangxx156/VADNet.
引用
收藏
页码:2813 / 2825
页数:13
相关论文
共 69 条
[1]   UBnormal: New Benchmark for Supervised Open-Set Video Anomaly Detection [J].
Acsintoae, Andra ;
Florescu, Andrei ;
Georgescu, Mariana-Iuliana ;
Mare, Tudor ;
Sumedrea, Paul ;
Ionescu, Radu Tudor ;
Khan, Fahad Shahbaz ;
Shah, Mubarak .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :20111-20121
[2]   CVAE-GAN: Fine-Grained Image Generation through Asymmetric Training [J].
Bao, Jianmin ;
Chen, Dong ;
Wen, Fang ;
Li, Houqiang ;
Hua, Gang .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2764-2773
[3]   Hierarchical Graph Neural Networks for Few-Shot Learning [J].
Chen, Cen ;
Li, Kenli ;
Wei, Wei ;
Zhou, Joey Tianyi ;
Zeng, Zeng .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) :240-252
[4]   Robust, Deep and Inductive Anomaly Detection [J].
Chalapathy, Raghavendra ;
Menon, Aditya Krishna ;
Chawla, Sanjay .
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT I, 2017, 10534 :36-51
[5]  
Chang YP, 2020, Img Proc Comp Vis Re, V12360, P329, DOI 10.1007/978-3-030-58555-6_20
[6]   A Two-Stage Approach to Few-Shot Learning for Image Recognition [J].
Das, Debasmit ;
Lee, C. S. George .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :3336-3350
[7]   A Discriminative Framework for Anomaly Detection in Large Videos [J].
Del Giorno, Allison ;
Bagnell, J. Andrew ;
Hebert, Martial .
COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 :334-349
[8]   Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector [J].
Fan, Qi ;
Zhuo, Wei ;
Tang, Chi-Keung ;
Tai, Yu-Wing .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4012-4021
[9]  
Finn C, 2019, Arxiv, DOI arXiv:1806.02817
[10]  
Finn C, 2017, PR MACH LEARN RES, V70