FE-VAD: High-Low Frequency Enhanced Weakly Supervised Video Anomaly Detection

被引：1

作者：

Pi, Ruoyan ^{[1
]}

Xu, Jinglin ^{[2
]}

Peng, Yuxin ^{[1
]}

机构：

[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China

[2] Univ Sci & Technol Beijing, Sch Intelligence Sci & Technol, Beijing, Peoples R China

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024 | 2024年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Video anomaly detection; weakly supervised; learning; frequency domain analysis;

D O I：

10.1109/ICME57554.2024.10688326

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Weakly Supervised Video Anomaly Detection (WSVAD) aims at identifying anomaly events in videos with videolevel labels instead of frame-level ones. Previous works usually focused on modeling anomalies in spatio-temporal domains. However, there are various forms of anomaly expressions, thus modeling them only in the spatio-temporal domain is insufficient. To address this issue and comprehensively capture the diverse forms of anomalies, we propose a new approach, High-Low Frequency Enhanced Weakly Supervised Video Anomaly Detection (FE-VAD), which introduces frequency domain information to capture and analyze anomaly features at different frequency levels, facilitating the learning of local and global spatio-temporal dependencies. Our FE-VAD is composed of a temporal strengthening network (TSN) and a high-low frequency enhancement network (HLFN). TSN is utilized to enhance the anomaly features in the traditional spatio-temporal domain, and HLFN decouples and adjusts high and low-frequency information spatially and temporally. In FE-VAD, frequency domain analysis offers a complementary perspective to describe anomalous events that are challenging to detect in traditional spatio-temporal domains. Extensive experiments show that our FE-VAD method achieves state-of-the-art results on three datasets: ShanghaiTech, UCFCrime, and XD-Violence.

引用

页数：6

共 17 条

[11] Not only Look, But Also Listen: Learning Multimodal Violence Detection Under Weak Supervision [J].

Wu, Peng ;

Liu, Jing ;

Shi, Yujia ;

Sun, Yujia ;

Shao, Fangtao ;

Wu, Zhaoyang ;

Yang, Zhiwei .

COMPUTER VISION - ECCV 2020, PT XXX, 2020, 12375 :322-339

[12]

Pu Yuwen, 2023, ARXIV

[13] A Survey of Single-Scene Video Anomaly Detection [J].

Ramachandra, Bharathkumar ;

Jones, Michael J. ;

Vatsavai, Ranga Raju .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (05) :2293-2312

[14] Real-world Anomaly Detection in Surveillance Videos [J].

Sultani, Waqas ;

Chen, Chen ;

Shah, Mubarak .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6479-6488

[15] Long-Short Temporal Co-Teaching for Weakly Supervised Video Anomaly Detection [J].

Sun, Shengyang ;

Gong, Xiaojin .

2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, :2711-2716

[16] LIGHTWEIGHT DUAL-TASK NETWORKS FOR CROWD COUNTING IN AERIAL IMAGES [J].

Tian, Ye ;

Duan, Chengzhen ;

Zhang, Ruilin ;

Wei, Zhiwei ;

Wang, Hongpeng .

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :1975-1979

[17] Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection [J].

Zhong, Jia-Xing ;

Li, Nannan ;

Kong, Weijie ;

Liu, Shan ;

Li, Thomas H. ;

Li, Ge .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1237-1246

← 1 2 →