Few-shot fine-tuning with auxiliary tasks for video anomaly detection

被引：0

作者：

Lv, Jing ^{[1
]}

Liu, Zhi ^{[1
,2
]}

Li, Gongyang ^{[1
,2
,3
]}

机构：

[1] Shanghai Univ, Sch Commun & Informat Engn, Shanghai, Peoples R China

[2] Shanghai Univ, Wenzhou Inst, Wenzhou, Peoples R China

[3] Yunnan Univ Finance & Econ, Yunnan Key Lab Serv Comp, Kunming, Peoples R China

来源：

MULTIMEDIA SYSTEMS | 2025年 / 31卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Video anomaly detection; Fine-tuning; Segmentation; Optical flow;

D O I：

10.1007/s00530-025-01706-8

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Anomaly detection in surveillance videos aims to identify video frames that exhibit unexpected behavior. Most existing methods follow an unsupervised setup, training with normal videos and testing with videos from the same scene. However, in real-world deployments, the performance of existing models significantly degrades when faced with unseen scenes. To address this issue, we introduce the auxiliary tasks of segmentation and optical flow estimation into the fine-tuning process, proposing a novel Segmentation and Optical Flow Fine-tuning (SOFF) framework. This framework enables the existing models to adapt to new scenes with only a few samples for fine-tuning. To integrate these auxiliary tasks, we design a Segmentation and Flow Output Network (SFO-Net). SFO-Net enhances fine-tuning performance in unseen scenes by extracting rich shape and motion information through the execution of auxiliary tasks during the fine-tuning process. Additionally, SFO-Net can be flexibly cascaded with existing models that output images to form the SOFF framework. Experiments on multiple datasets demonstrate that our framework improves the performance of existing models when faced unseen scenes through few-shot scenes fine-tuning and achieves competitive performance.

引用

页数：14

共 43 条

[1] UBnormal: New Benchmark for Supervised Open-Set Video Anomaly Detection [J].

Acsintoae, Andra ;

Florescu, Andrei ;

Georgescu, Mariana-Iuliana ;

Mare, Tudor ;

Sumedrea, Paul ;

Ionescu, Radu Tudor ;

Khan, Fahad Shahbaz ;

Shah, Mubarak .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :20111-20121

[2]

Basharat A, 2008, PROC CVPR IEEE, P1301

[3] Histograms of oriented gradients for human detection [J].

Dalal, N ;

Triggs, B .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893

[4]

Dollar P., 2005, Proceedings. 2nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (VS-PETS) (IEEE Cat. No. 05EX1178), P65

[5] Global birdsong embeddings enable superior transfer learning for bioacoustic classification [J].

Ghani, Burooj ;

Denton, Tom ;

Kahl, Stefan ;

Klinck, Holger .

SCIENTIFIC REPORTS, 2023, 13 (01)

[6] Memorizing Normality to Detect Anomaly: Memory-augmented Deep Autoencoder for Unsupervised Anomaly Detection [J].

Gong, Dong ;

Liu, Lingqiao ;

Le, Vuong ;

Saha, Budhaditya ;

Mansour, Moussa Reda ;

Venkatesh, Svetha ;

van den Hengel, Anton .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1705-1714

[7] Coordinate Attention for Efficient Mobile Network Design [J].

Hou, Qibin ;

Zhou, Daquan ;

Feng, Jiashi .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13708-13717

[8]

Hu J, 2018, ADV NEUR IN, V31

[9]

Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]

[10] Adaptive Anomaly Detection Network for Unseen Scene Without Fine-Tuning [J].

Hu, Yutao ;

Huang, Xin ;

Luo, Xiaoyan .

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 :311-323

← 1 2 3 4 5 →