Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model

被引：9

作者：

Wu, Peng ^{[1
]}

Liu, Jing ^{[2
]}

He, Xiangteng ^{[3
]}

Peng, Yuxin ^{[3
]}

Wang, Peng ^{[1
]}

Zhang, Yanning ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp Sci, Natl Engn Lab Integrated Aerosp Ground Ocean Big D, Xian 710060, Peoples R China

[2] Xidian Univ, Guangzhou Inst Technol, Guangzhou 510555, Peoples R China

[3] Peking Univ, Wangxuan Inst Comp Technol, Beijing 100871, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2024年 / 33卷

基金：

中国国家自然科学基金;

关键词：

Video anomaly retrieval; video anomaly detection; cross-modal retrieval;

D O I：

10.1109/TIP.2024.3374070

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video anomaly detection (VAD) has been paid increasing attention due to its potential applications, its current dominant tasks focus on online detecting anomalies, which can be roughly interpreted as the binary or multiple event classification. However, such a setup that builds relationships between complicated anomalous events and single labels, e.g., "vandalism", is superficial, since single labels are deficient to characterize anomalous events. In reality, users tend to search a specific video rather than a series of approximate videos. Therefore, retrieving anomalous events using detailed descriptions is practical and positive but few researches focus on this. In this context, we propose a novel task called Video Anomaly Retrieval (VAR), which aims to pragmatically retrieve relevant anomalous videos by cross-modalities, e.g., language descriptions and synchronous audios. Unlike the current video retrieval where videos are assumed to be temporally well-trimmed with short duration, VAR is devised to retrieve long untrimmed videos which may be partially relevant to the given query. To achieve this, we present two large-scale VAR benchmarks and design a model called Anomaly-Led Alignment Network (ALAN) for VAR. In ALAN, we propose an anomaly-led sampling to focus on key segments in long untrimmed videos. Then, we introduce an efficient pretext task to enhance semantic associations between video-text fine-grained representations. Besides, we leverage two complementary alignments to further match cross-modal contents. Experimental results on two benchmarks reveal the challenges of VAR task and also demonstrate the advantages of our tailored method. Captions are publicly released at https://github.com/Roc-Ng/VAR.

引用

页码：2213 / 2225

页数：13

共 50 条

[21] Generate anomalies from normal: a partial pseudo-anomaly augmented approach for video anomaly detection
Dang, Yuanjie
Chen, Jiangyun
Chen, Peng
Gao, Nan
Huan, Ruohong
Zhao, Dongdong
VISUAL COMPUTER, 2024, : 3843 - 3852
[22] Video Anomaly Detection based on Deep Generative Network
Saypadith, Savath
Onoye, Takao
2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
[23] A Fine Grained Quality Assessment of Video Anomaly Detection
Zhou, Jiang
McGuinness, Kevin
Antony, Joseph
O'Connor, Noel E.
19TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING, CBMI 2022, 2022, : 29 - 35
[24] Video anomaly detection with spatio-temporal dissociation
Chang, Yunpeng
Tu, Zhigang
Xie, Wei
Luo, Bin
Zhang, Shifu
Sui, Haigang
Yuan, Junsong
PATTERN RECOGNITION, 2022, 122
[25] MUTUALITY ATTRIBUTE MAKES BETTER VIDEO ANOMALY DETECTION
Han, Xingshuo
Wang, Xiao
Jiang, Kui
Liu, Wei
Hu, Ruimin
Pan, Xuefeng
Xu, Xin
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2670 - 2674
[26] Anomaly Detection With Particle Filtering for Online Video Surveillance
Ata-Ur-Rehman
Tariq, Sameema
Farooq, Haroon
Jaleel, Abdul
Wasif, Syed Muhammad
IEEE ACCESS, 2021, 9 : 19457 - 19468
[27] Video Anomaly Detection Framework Based on Motion Consistency
Zhao, Caidan
Li, Xiang
Gao, Chenxing
Wu, Zhiqiang
PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 802 - 807
[28] Conjoined triple deep network for video anomaly detection
Chang, Xingya
Wu, Yunhe
Deng, Shizhuo
Jia, Tong
Chen, Dongyue
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (20) : 59491 - 59518
[29] Future Frame Prediction Network for Video Anomaly Detection
Luo, Weixin
Liu, Wen
Lian, Dongze
Gao, Shenghua
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 7505 - 7520
[30] Video Anomaly Detection via Visual Cloze Tests
Yu, Guang
Wang, Siqi
Cai, Zhiping
Liu, Xinwang
Zhu, En
Yin, Jianping
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 4955 - 4969

← 1 2 3 4 5 →