Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model

被引:9
|
作者
Wu, Peng [1 ]
Liu, Jing [2 ]
He, Xiangteng [3 ]
Peng, Yuxin [3 ]
Wang, Peng [1 ]
Zhang, Yanning [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Natl Engn Lab Integrated Aerosp Ground Ocean Big D, Xian 710060, Peoples R China
[2] Xidian Univ, Guangzhou Inst Technol, Guangzhou 510555, Peoples R China
[3] Peking Univ, Wangxuan Inst Comp Technol, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
Video anomaly retrieval; video anomaly detection; cross-modal retrieval;
D O I
10.1109/TIP.2024.3374070
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video anomaly detection (VAD) has been paid increasing attention due to its potential applications, its current dominant tasks focus on online detecting anomalies, which can be roughly interpreted as the binary or multiple event classification. However, such a setup that builds relationships between complicated anomalous events and single labels, e.g., "vandalism", is superficial, since single labels are deficient to characterize anomalous events. In reality, users tend to search a specific video rather than a series of approximate videos. Therefore, retrieving anomalous events using detailed descriptions is practical and positive but few researches focus on this. In this context, we propose a novel task called Video Anomaly Retrieval (VAR), which aims to pragmatically retrieve relevant anomalous videos by cross-modalities, e.g., language descriptions and synchronous audios. Unlike the current video retrieval where videos are assumed to be temporally well-trimmed with short duration, VAR is devised to retrieve long untrimmed videos which may be partially relevant to the given query. To achieve this, we present two large-scale VAR benchmarks and design a model called Anomaly-Led Alignment Network (ALAN) for VAR. In ALAN, we propose an anomaly-led sampling to focus on key segments in long untrimmed videos. Then, we introduce an efficient pretext task to enhance semantic associations between video-text fine-grained representations. Besides, we leverage two complementary alignments to further match cross-modal contents. Experimental results on two benchmarks reveal the challenges of VAR task and also demonstrate the advantages of our tailored method. Captions are publicly released at https://github.com/Roc-Ng/VAR.
引用
收藏
页码:2213 / 2225
页数:13
相关论文
共 50 条
  • [41] A Graph-based Approach to Video Anomaly Detection from the Perspective of Superpixels
    Siemon, Mia
    Nasrollahi, Kamal
    Moeslund, Thomas B.
    FIFTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION, ICMV 2022, 2023, 12701
  • [42] PyAnomaly: A Pytorch-based Toolkit for Video Anomaly Detection
    Cheng, Yuhao
    Liu, Wu
    Duan, Pengrui
    Liu, Jingen
    Mei, Tao
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4473 - 4476
  • [43] Dual contrast discriminator with sharing attention for video anomaly detection
    Zeng, Yiwenhao
    Chen, Yihua
    Yu, Songsen
    Yang, Mingzhang
    Chen, Rongrong
    Xu, Fang
    MACHINE VISION AND APPLICATIONS, 2024, 35 (04)
  • [44] Self-supervised Sparse Representation for Video Anomaly Detection
    Wu, Jhih-Ciang
    Hsieh, He-Yen
    Chen, Ding-Jie
    Fuh, Chiou-Shann
    Liu, Tyng-Luh
    COMPUTER VISION, ECCV 2022, PT XIII, 2022, 13673 : 729 - 745
  • [45] BatchNorm-Based Weakly Supervised Video Anomaly Detection
    Zhou, Yixuan
    Qu, Yi
    Xu, Xing
    Shen, Fumin
    Song, Jingkuan
    Tao Shen, Heng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 13642 - 13654
  • [46] Review of Deep Learning-Based Video Anomaly Detection
    Ji G.
    Qi X.
    Wang J.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2024, 37 (02): : 128 - 143
  • [47] Weakly supervised video anomaly detection based on hyperbolic space
    Qi, Meilin
    Wu, Yuanyuan
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [48] Survey on video anomaly detection in dynamic scenes with moving cameras
    Runyu Jiao
    Yi Wan
    Fabio Poiesi
    Yiming Wang
    Artificial Intelligence Review, 2023, 56 : 3515 - 3570
  • [49] Event-driven weakly supervised video anomaly detection
    Sun, Shengyang
    Gong, Xiaojin
    IMAGE AND VISION COMPUTING, 2024, 149
  • [50] Exploiting Spatial-temporal Correlations for Video Anomaly Detection
    Zhao, Mengyang
    Liu, Yang
    Liu, Jing
    Zeng, Xinhua
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1727 - 1733