FE-VAD: High-Low Frequency Enhanced Weakly Supervised Video Anomaly Detection

被引:1
作者
Pi, Ruoyan [1 ]
Xu, Jinglin [2 ]
Peng, Yuxin [1 ]
机构
[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Intelligence Sci & Technol, Beijing, Peoples R China
来源
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024 | 2024年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Video anomaly detection; weakly supervised; learning; frequency domain analysis;
D O I
10.1109/ICME57554.2024.10688326
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly Supervised Video Anomaly Detection (WSVAD) aims at identifying anomaly events in videos with videolevel labels instead of frame-level ones. Previous works usually focused on modeling anomalies in spatio-temporal domains. However, there are various forms of anomaly expressions, thus modeling them only in the spatio-temporal domain is insufficient. To address this issue and comprehensively capture the diverse forms of anomalies, we propose a new approach, High-Low Frequency Enhanced Weakly Supervised Video Anomaly Detection (FE-VAD), which introduces frequency domain information to capture and analyze anomaly features at different frequency levels, facilitating the learning of local and global spatio-temporal dependencies. Our FE-VAD is composed of a temporal strengthening network (TSN) and a high-low frequency enhancement network (HLFN). TSN is utilized to enhance the anomaly features in the traditional spatio-temporal domain, and HLFN decouples and adjusts high and low-frequency information spatially and temporally. In FE-VAD, frequency domain analysis offers a complementary perspective to describe anomalous events that are challenging to detect in traditional spatio-temporal domains. Extensive experiments show that our FE-VAD method achieves state-of-the-art results on three datasets: ShanghaiTech, UCFCrime, and XD-Violence.
引用
收藏
页数:6
相关论文
共 17 条
[11]   Not only Look, But Also Listen: Learning Multimodal Violence Detection Under Weak Supervision [J].
Wu, Peng ;
Liu, Jing ;
Shi, Yujia ;
Sun, Yujia ;
Shao, Fangtao ;
Wu, Zhaoyang ;
Yang, Zhiwei .
COMPUTER VISION - ECCV 2020, PT XXX, 2020, 12375 :322-339
[12]  
Pu Yuwen, 2023, ARXIV
[13]   A Survey of Single-Scene Video Anomaly Detection [J].
Ramachandra, Bharathkumar ;
Jones, Michael J. ;
Vatsavai, Ranga Raju .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (05) :2293-2312
[14]   Real-world Anomaly Detection in Surveillance Videos [J].
Sultani, Waqas ;
Chen, Chen ;
Shah, Mubarak .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6479-6488
[15]   Long-Short Temporal Co-Teaching for Weakly Supervised Video Anomaly Detection [J].
Sun, Shengyang ;
Gong, Xiaojin .
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, :2711-2716
[16]   LIGHTWEIGHT DUAL-TASK NETWORKS FOR CROWD COUNTING IN AERIAL IMAGES [J].
Tian, Ye ;
Duan, Chengzhen ;
Zhang, Ruilin ;
Wei, Zhiwei ;
Wang, Hongpeng .
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :1975-1979
[17]   Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection [J].
Zhong, Jia-Xing ;
Li, Nannan ;
Kong, Weijie ;
Liu, Shan ;
Li, Thomas H. ;
Li, Ge .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1237-1246