MC-MIL: video surveillance anomaly detection with multi-instance learning and multiple overlapped cameras

被引:1
作者
Pereira S.S.L. [1 ,2 ]
Maia J.E.B. [2 ]
机构
[1] Federal Institute of Education, Science and Technology of Ceará - IFCE, CE, Aracati
[2] State University of Ceará - UECE, CE, Fortaleza
关键词
Anomaly detection; Intelligent video surveillance; Multiple cameras; Multiple instance learning;
D O I
10.1007/s00521-024-09611-3
中图分类号
学科分类号
摘要
Anomaly detection approaches have limiting aspects regarding the representativeness of the information since the video data is captured from a single perspective and may not distinguish all relevant aspects of the scene. The lack of sufficient labeled data is also a challenging aspect of building video anomaly detection approaches. Although multiple instance learning (MIL) has been explored extensively in the weakly supervised video anomaly detection (WS-VAD) literature since it is less hungry for labeled data, there are no studies that exploit multiple overlapping camera views to provide wider representativeness of vision data under MIL assumption. In this work, we show the performance of the video anomaly detection task can be improved by using multiple cameras to capture spatiotemporal information from different perspectives. We propose the approach MC-MIL (Video Anomaly Detection with Multiple Overlapped Cameras and Multiple Instance Learning) framework, which consists of a training scheme with multiple cameras under multiple instance learning for video anomaly detection. We specialize our proposed framework for the two-camera case as a proof of concept for performance evaluation. Due to the lack of datasets for this task, we relabeled the multiple-camera PETS-2009 benchmark dataset for the anomaly detection task from multiple overlapped camera views to evaluate the MC-MIL algorithm. The result shows a significant performance improvement in the AUC ROC score compared to the single-camera configuration and with the literature. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:10527 / 10543
页数:16
相关论文
共 28 条
[1]  
Deepak K., Srivathsan G., Roshan S., Chandrakala S., Deep multi-view representation learning for video anomaly detection using spatiotemporal autoencoders, Circ Syst Signal Process, 40, 3, pp. 1333-1349, (2021)
[2]  
Shreyas D., Raksha S., Prasad B., Implementation of an anomalous human activity recognition system, SN Comput Sci, 1, pp. 1-10, (2020)
[3]  
Feng J.-C., Hong F.-T., Zheng W.-S., Mist: Multiple instance self-training framework for video anomaly detection, In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14009-14018, (2021)
[4]  
Asad M., Jiang H., Yang J., Tu E., Malik A.A., Multi-stream 3d latent feature clustering for abnormality detection in videos, Appl Intell, 52, 1, pp. 1126-1143, (2022)
[5]  
Ren J., Xia F., Liu Y., Lee I., Deep video anomaly detection: Opportunities and challenges, In: 2021 International Conference on Data Mining Workshops (ICDMW), pp. 959-966, (2021)
[6]  
Kamoona A.M., Gosta A.K., Bab-Hadiashar A., Hoseinnezhad R., Multiple instance-based video anomaly detection using deep temporal encoding-decoding., (2020)
[7]  
Wan B., Fang Y., Xia X., Mei J., Weakly supervised video anomaly detection via center-guided discriminative learning, In: 2020 IEEE International Conference on Multimedia and Expo (ICME), pp. 1-6, (2020)
[8]  
Lv H., Yue Z., Sun Q., Luo B., Cui Z., Zhang H., Unbiased multiple instance learning for weakly supervised video anomaly detection, In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8022-8031, (2023)
[9]  
Herrera F., Ventura S., Bello R., Cornelis C., Zafra A., Sanchez-Tarrago D., Vluymans S., Multiple instance learning, pp. 17-33, (2016)
[10]  
Pehlivan S., Duygulu P., A new pose-based representation for recognizing actions from multiple cameras, Comput Vis Image Underst, 115, 2, pp. 140-151, (2011)