FE-VAD: High-Low Frequency Enhanced Weakly Supervised Video Anomaly Detection

被引:1
作者
Pi, Ruoyan [1 ]
Xu, Jinglin [2 ]
Peng, Yuxin [1 ]
机构
[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Intelligence Sci & Technol, Beijing, Peoples R China
来源
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024 | 2024年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Video anomaly detection; weakly supervised; learning; frequency domain analysis;
D O I
10.1109/ICME57554.2024.10688326
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly Supervised Video Anomaly Detection (WSVAD) aims at identifying anomaly events in videos with videolevel labels instead of frame-level ones. Previous works usually focused on modeling anomalies in spatio-temporal domains. However, there are various forms of anomaly expressions, thus modeling them only in the spatio-temporal domain is insufficient. To address this issue and comprehensively capture the diverse forms of anomalies, we propose a new approach, High-Low Frequency Enhanced Weakly Supervised Video Anomaly Detection (FE-VAD), which introduces frequency domain information to capture and analyze anomaly features at different frequency levels, facilitating the learning of local and global spatio-temporal dependencies. Our FE-VAD is composed of a temporal strengthening network (TSN) and a high-low frequency enhancement network (HLFN). TSN is utilized to enhance the anomaly features in the traditional spatio-temporal domain, and HLFN decouples and adjusts high and low-frequency information spatially and temporally. In FE-VAD, frequency domain analysis offers a complementary perspective to describe anomalous events that are challenging to detect in traditional spatio-temporal domains. Extensive experiments show that our FE-VAD method achieves state-of-the-art results on three datasets: ShanghaiTech, UCFCrime, and XD-Violence.
引用
收藏
页数:6
相关论文
共 17 条
[1]   WEAKLY SUPERVISED VIDEO ANOMALY DETECTION BASED ON CROSS-BATCH CLUSTERING GUIDANCE [J].
Cao, Congqi ;
Zhang, Xin ;
Zhang, Shizhou ;
Wang, Peng ;
Zhang, Yanning .
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, :2723-2728
[2]   Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].
Carreira, Joao ;
Zisserman, Andrew .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733
[3]  
Chen YX, 2023, AAAI CONF ARTIF INTE, P387
[4]   MIST: Multiple Instance Self-Training Framework for Video Anomaly Detection [J].
Feng, Jia-Chang ;
Hong, Fa-Ting ;
Zheng, Wei-Shi .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14004-14013
[5]   An anomaly-introduced learning method for abnormal event detection [J].
He, Chengkun ;
Shao, Jie ;
Sun, Jiayu .
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (22) :29573-29588
[6]  
Li S, 2022, AAAI CONF ARTIF INTE, P1395
[7]   Future Frame Prediction for Anomaly Detection - A New Baseline [J].
Liu, Wen ;
Luo, Weixin ;
Lian, Dongze ;
Gao, Shenghua .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6536-6545
[8]   Global Spectral Filter Memory Network for Video Object Segmentation [J].
Liu, Yong ;
Yu, Ran ;
Wang, Jiahao ;
Zhao, Xinyuan ;
Wang, Yitong ;
Tang, Yansong ;
Yang, Yujiu .
COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 :648-665
[9]   Unbiased Multiple Instance Learning for Weakly Supervised Video Anomaly Detection [J].
Lv, Hui ;
Yue, Zhongqi ;
Sun, Qianru ;
Luo, Bin ;
Cui, Zhen ;
Zhang, Hanwang .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :8022-8031
[10]  
Patro Badri N, 2023, arXiv