An informative dual ForkNet for video anomaly detection

被引：0

作者：

Li, Hongjun ^{[1
]}

Wang, Yunlong ^{[1
]}

Wang, Yating ^{[1
]}

Chen, Junjie ^{[1
]}

机构：

[1] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Jiangsu, Peoples R China

来源：

NEURAL NETWORKS | 2024年 / 179卷

关键词：

Video anomaly detection; Informetrics recalibration; ForkNet structure; EVENT DETECTION;

D O I：

10.1016/j.neunet.2024.106509

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

An autoencoder for video anomaly detection task is a type of algorithm with the primary purpose of learning an "informative"representation of the normal data that can be used for identifying the abnormal data by learning to reconstruct a set of input observations. Based on the encoding-decoding structure, we explore a novel dual ForkNet architecture that can dissociate and process the spatio-temporal representation. It is well-known in the information theory community that most autoencoders coding processes are inevitably accompanied by a certain loss of information. In this dual ForkNet, we focus on mitigating the information loss problem and propose a novel architectural recalibration approach, which we term the "Informetrics Recalibration"(IR). It can adaptively recalibrate latent feature representation by explicitly modeling the similarity between the corresponding feature maps of encoder and decoder, and retain more useful semantic information to generate greater differentiation between normal and abnormal events. Additionally, because the structure of the autoencoder itself determines the difficulty to obtain deep semantic information, we introduce a Secondary Encoder (SE) in each ForkNet, so as to recalibrate target features responses of latent feature representation. Our model is easy to be trained and robust to be applied, because it basically consists of some ResNet blocks without using complicated modules. Extensive experiments on the five publicly available benchmarks show that our model outperforms the existing state-of-the-art architectures, demonstrating our framework's effectiveness.

引用

页数：14

共 53 条

[1] Robust real-time unusual event detection using multiple fixed-location monitors [J].

Adam, Amit ;

Rivlin, Ehud ;

Shimshoni, Ilan ;

Reinitz, David .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (03) :555-560

[2] Memory Matching Networks for One-Shot Image Recognition [J].

Cai, Qi ;

Pan, Yingwei ;

Yao, Ting ;

Yan, Chenggang ;

Mei, Tao .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4080-4088

[3] Context Recovery and Knowledge Retrieval: A Novel Two-Stream Framework for Video Anomaly Detection [J].

Cao, Congqi ;

Lu, Yue ;

Zhang, Yanning .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 :1810-1825

[4] Video anomaly detection with spatio-temporal dissociation [J].

Chang, Yunpeng ;

Tu, Zhigang ;

Xie, Wei ;

Luo, Bin ;

Zhang, Shifu ;

Sui, Haigang ;

Yuan, Junsong .

PATTERN RECOGNITION, 2022, 122

[5] Clustering Driven Deep Autoencoder for Video Anomaly Detection [J].

Chang, Yunpeng ;

Tu, Zhigang ;

Xie, Wei ;

Yuan, Junsong .

COMPUTER VISION - ECCV 2020, PT XV, 2020, 12360 :329-345

[6] Deep Clustering via Joint Convolutional Autoencoder Embedding and Relative Entropy Minimization [J].

Dizaji, Kamran Ghasedi ;

Herandi, Amirhossein ;

Deng, Cheng ;

Cai, Weidong ;

Huang, Heng .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5747-5756

[7] Video anomaly detection and localization via Gaussian Mixture Fully Convolutional Variational Autoencoder [J].

Fan, Yaxiang ;

Wen, Gongjian ;

Li, Deren ;

Qiu, Shaohua ;

Levine, Martin D. ;

Xiao, Fei .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 195

[8] Dual Attention Network for Scene Segmentation [J].

Fu, Jun ;

Liu, Jing ;

Tian, Haijie ;

Li, Yong ;

Bao, Yongjun ;

Fang, Zhiwei ;

Lu, Hanqing .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3141-3149

[9] A Background-Agnostic Framework With Adversarial Training for Abnormal Event Detection in Video [J].

Georgescu, Mariana Iuliana ;

Ionescu, Radu Tudor ;

Khan, Fahad Shahbaz ;

Popescu, Marius ;

Shah, Mubarak .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) :4505-4523

[10] Learning Appearance-Motion Synergy via Memory-Guided Event Prediction for Video Anomaly Detection [J].

Guo, Chongye ;

Wang, Hongbo ;

Xia, Yingjie ;

Feng, Guorui .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) :1519-1531

← 1 2 3 4 5 6 →