FE-VAD: High-Low Frequency Enhanced Weakly Supervised Video Anomaly Detection

被引：1

作者：

Pi, Ruoyan ^{[1
]}

Xu, Jinglin ^{[2
]}

Peng, Yuxin ^{[1
]}

机构：

[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China

[2] Univ Sci & Technol Beijing, Sch Intelligence Sci & Technol, Beijing, Peoples R China

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024 | 2024年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Video anomaly detection; weakly supervised; learning; frequency domain analysis;

D O I：

10.1109/ICME57554.2024.10688326

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Weakly Supervised Video Anomaly Detection (WSVAD) aims at identifying anomaly events in videos with videolevel labels instead of frame-level ones. Previous works usually focused on modeling anomalies in spatio-temporal domains. However, there are various forms of anomaly expressions, thus modeling them only in the spatio-temporal domain is insufficient. To address this issue and comprehensively capture the diverse forms of anomalies, we propose a new approach, High-Low Frequency Enhanced Weakly Supervised Video Anomaly Detection (FE-VAD), which introduces frequency domain information to capture and analyze anomaly features at different frequency levels, facilitating the learning of local and global spatio-temporal dependencies. Our FE-VAD is composed of a temporal strengthening network (TSN) and a high-low frequency enhancement network (HLFN). TSN is utilized to enhance the anomaly features in the traditional spatio-temporal domain, and HLFN decouples and adjusts high and low-frequency information spatially and temporally. In FE-VAD, frequency domain analysis offers a complementary perspective to describe anomalous events that are challenging to detect in traditional spatio-temporal domains. Extensive experiments show that our FE-VAD method achieves state-of-the-art results on three datasets: ShanghaiTech, UCFCrime, and XD-Violence.

引用

页数：6

共 17 条

[1] WEAKLY SUPERVISED VIDEO ANOMALY DETECTION BASED ON CROSS-BATCH CLUSTERING GUIDANCE [J].

Cao, Congqi ;

Zhang, Xin ;

Zhang, Shizhou ;

Wang, Peng ;

Zhang, Yanning .

2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, :2723-2728

[2] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].

Carreira, Joao ;

Zisserman, Andrew .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733

[3]

Chen YX, 2023, AAAI CONF ARTIF INTE, P387

[4] MIST: Multiple Instance Self-Training Framework for Video Anomaly Detection [J].

Feng, Jia-Chang ;

Hong, Fa-Ting ;

Zheng, Wei-Shi .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14004-14013

[5] An anomaly-introduced learning method for abnormal event detection [J].

He, Chengkun ;

Shao, Jie ;

Sun, Jiayu .

MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (22) :29573-29588

[6]

Li S, 2022, AAAI CONF ARTIF INTE, P1395

[7] Future Frame Prediction for Anomaly Detection - A New Baseline [J].

Liu, Wen ;

Luo, Weixin ;

Lian, Dongze ;

Gao, Shenghua .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6536-6545

[8] Global Spectral Filter Memory Network for Video Object Segmentation [J].

Liu, Yong ;

Yu, Ran ;

Wang, Jiahao ;

Zhao, Xinyuan ;

Wang, Yitong ;

Tang, Yansong ;

Yang, Yujiu .

COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 :648-665

[9] Unbiased Multiple Instance Learning for Weakly Supervised Video Anomaly Detection [J].

Lv, Hui ;

Yue, Zhongqi ;

Sun, Qianru ;

Luo, Bin ;

Cui, Zhen ;

Zhang, Hanwang .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :8022-8031

[10]

Patro Badri N, 2023, arXiv

← 1 2 →