Spatio-temporal graph-based self-labeling for video anomaly detection

被引:0
|
作者
Xing, Meng [1 ,2 ]
Feng, Zhiyong [3 ]
Su, Yong [4 ]
Zhang, Yiming [3 ]
Oh, Changjae [5 ]
Gribova, Valeriya [6 ]
Filaretoy, Vladimir Fedorovich [6 ]
Huang, Deshuang [1 ,7 ]
机构
[1] Ningbo Inst Digital Twin, Eastern Inst Technol, 568 Tongxin Rd,Zhuangshi St, Ningbo 315201, Zhejiang, Peoples R China
[2] Univ Sci & Technol China, Sch Informat Sci & Technol, 96 JinZhai Rd, Hefei 230026, Anhui, Peoples R China
[3] Tianjin Univ, Coll Intelligence & Comp, 135 Yaguan Rd,Haihe Educ Pk, Tianjin 300350, Peoples R China
[4] Tianjin Normal Univ, Tianjin Key Lab Wireless Mobile Commun & Power Tra, 393 Binshui West Rd, Tianjin 300387, Peoples R China
[5] Queen Mary Univ London, Ctr Intelligent Sensing, Mile End Rd, London E1 4NS, England
[6] Russian Acad Sci, Inst Automat & Control Proc, Far Eastern Branch, Radio St 5, Vladivostok 690041, Primorsky Krai, Russia
[7] Shanghai East Hosp, Inst Regenerat Med, 150 Jimo Rd, Shanghai 200120, Peoples R China
基金
中国博士后科学基金; 美国国家科学基金会;
关键词
VAD; ST-graph; Self-labeling; Not-normal space; Object-level criterion; ABNORMAL EVENT DETECTION; CONVOLUTIONAL NETWORKS;
D O I
10.1016/j.neucom.2025.129576
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video anomaly detection (VAD) aims to identify abnormal events in a video sequence. Existing methods achieve VAD by learning the decision boundary between the normal space and the abnormal space pre-defined in the training data. However, these methods trend to neglect the distribution gap between the pre-defined abnormal space and the real one, which lead to overfitting on the normal space or bias toward the pre-defined abnormal space. In this paper, we propose a spatio-temporal graph-based self-labeling method that not only focuses on the pre-defined abnormal space but considers the real abnormal space, enabling it to capture the decision boundary between the normal space and a complementary space, called as the not-normal space. We first construct a spatio-temporal graph (ST-Graph) based on the objects of input video and utilize a spatio-temporal graph convolution network (ST-GCN) to model the interaction between objects. We then propose a self-labeling- based learning mechanism that encourages the proposed ST-GCN to record the normal events while abstaining from labeling the pseudo-abnormal events, thereby aggregating the pre-defined and real abnormal spaces into not-normal space. To evaluate the model performance on localizing anomalous objects and capturing interactions between objects, we further introduce an object-level criterion that bridges frame-level and pixel- level criteria. Our method is validated on three datasets and achieves state-of-the-art frame-level AUC results on Avenue (92.5%), and outperforms existing ST-Graph-based methods on UCSD Ped2 (96.5%) and ShanghaiTech (76.8%).
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Anomaly detection for key performance indicators by fusing self-supervised spatio-temporal graph attention networks
    Chen, Ningjiang
    Tu, Huan
    Zeng, Haoyang
    Ou, Yangjie
    KNOWLEDGE-BASED SYSTEMS, 2024, 300
  • [42] Analyzing Shared Bike Usage Through Graph-Based Spatio-Temporal Modeling
    Cuong, Dinh Viet
    Ngo, Vuong M.
    Cappellari, Paolo
    Roantree, Mark
    IEEE OPEN JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 5 : 115 - 131
  • [43] Spatio-Temporal Heterogeneous Graph-Based Convolutional Networks for Traffic Flow Forecasting
    Ma, Zhaobin
    Lv, Zhiqiang
    Xin, Xiaoyang
    Cheng, Zesheng
    Xia, Fengqian
    Li, Jianbo
    TRANSPORTATION RESEARCH RECORD, 2024, 2678 (08) : 120 - 133
  • [44] Graph-based approach for human action recognition using spatio-temporal features
    Ben Aoun, Najib
    Mejdoub, Mahmoud
    Ben Amar, Chokri
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2014, 25 (02) : 329 - 338
  • [45] A Graph-based Approach to Video Anomaly Detection from the Perspective of Superpixels
    Siemon, Mia
    Nasrollahi, Kamal
    Moeslund, Thomas B.
    FIFTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION, ICMV 2022, 2023, 12701
  • [46] Exploring the Spatio-Temporal Aware Graph for video captioning
    Xue, Ping
    Zhou, Bing
    IET COMPUTER VISION, 2022, 16 (05) : 456 - 467
  • [47] Spatio-Temporal Anomaly Detection with Graph Networks for Data Quality Monitoring of the Hadron Calorimeter
    Asres, Mulugeta Weldezgina
    Omlin, Christian Walter
    Wang, Long
    Yu, David
    Parygin, Pavel
    Dittmann, Jay
    Karapostoli, Georgia
    Seidel, Markus
    Venditti, Rosamaria
    Lambrecht, Luka
    Usai, Emanuele
    Ahmad, Muhammad
    Menendez, Javier Fernandez
    Maeshima, Kaori
    SENSORS, 2023, 23 (24)
  • [48] A Spatio-Temporal Enhanced Graph-Transformer AutoEncoder embedded pose for anomaly detection
    Zhu, Honglei
    Wei, Pengjuan
    Xu, Zhigang
    IET COMPUTER VISION, 2024, 18 (03) : 405 - 419
  • [49] Normal data based rotating machine anomaly detection using CNN with self-labeling
    Bae, Jaewoong
    Jung, Wonho
    Park, Yong-Hwa
    SMART STRUCTURES AND SYSTEMS, 2022, 29 (06) : 757 - 766
  • [50] VIDEO ANOMALY DETECTION VIA PREDICTION NETWORK WITH ENHANCED SPATIO-TEMPORAL MEMORY EXCHANGE
    Shen, Guodong
    Ouyang, Yuqi
    Sanchez, Victor
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3728 - 3732