Multi-Task Learning for Video Surveillance with Limited Data

被引：17

作者：

Doshi, Keval ^{[1
]}

Yilmaz, Yasin ^{[1
]}

机构：

[1] Univ S Florida, 4202 E Fowler Ave, Tampa, FL 33620 USA

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022 | 2022年

关键词：

ANOMALY DETECTION;

D O I：

10.1109/CVPRW56347.2022.00434

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Learning from limited data in video surveillance is important for sustainable performance while adapting to new information in a scene over time or adapting to a different scene. In a real-world scene, for an anomaly detection algorithm, all possible nominal patterns and behaviors are not typically available immediately for a single training session. In contrast, labeled nominal data patterns may become available irregularly over a long time horizon, and the anomaly detection algorithm needs to quickly learn such new patterns from limited samples for acceptable performance. Otherwise, it would suffer from frequent false alarms. Additionally, the anomaly detection algorithm needs to continually learn new nominal patterns in multiple training sessions without forgetting the previous knowledge and losing performance. Cross-domain adaptability (i.e., transfer learning to another surveillance scene) is another task where the anomaly detection algorithm has to quickly learn from limited nominal training data to achieve acceptable performance. To overcome these challenges, we design a modular framework and use it to extract semantic embeddings, which we then train on by using deep metric learning. Particularly, we study these three problems (few-shot learning, continual learning, cross-domain adaptability) in a multi-task learning setting. We also compare our proposed framework to existing state-of-the-art approaches using various evaluation metrics. The empirical results indicate that the proposed approach is able to outperform the existing approaches on all three tasks for three benchmark datasets.

引用

页码：3888 / 3898

页数：11

共 54 条

[21] Making sense: data, publics and storytelling INTRODUCTION [J].

Holland, Kate ;

Park, Sora .

COMMUNICATION RESEARCH AND PRACTICE, 2020, 6 (01) :1-2

[22] A Markov Clustering Topic Model for Mining Behaviour in Video [J].

Hospedales, Timothy ;

Gong, Shaogang ;

Xiang, Tao .

2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :1165-1172

[23]

Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]

[24] Histograms of Optical Flow Orientation and Magnitude and Entropy to Detect Anomalous Events in Videos [J].

Hugo Mora Colque, Rensso Victor ;

Caetano, Carlos ;

Lustosa de Andrade, Matheus Toledo ;

Schwartz, William Robson .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (03) :673-682

[25] Detecting abnormal events in video using Narrowed Normality Clusters [J].

Ionescu, Radu Tudor ;

Smeureanu, Sorina ;

Popescu, Marius ;

Alexe, Bogdan .

2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :1951-1960

[26]

Jaechul Kim, 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P2921, DOI 10.1109/CVPRW.2009.5206569

[27] Overcoming catastrophic forgetting in neural networks [J].

Kirkpatricka, James ;

Pascanu, Razvan ;

Rabinowitz, Neil ;

Veness, Joel ;

Desjardins, Guillaume ;

Rusu, Andrei A. ;

Milan, Kieran ;

Quan, John ;

Ramalho, Tiago ;

Grabska-Barwinska, Agnieszka ;

Hassabis, Demis ;

Clopath, Claudia ;

Kumaran, Dharshan ;

Hadsell, Raia .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (13) :3521-3526

[28]

Koch G., 2015, ICML DEEP LEARN WORK, V2

[29]

Kratz L, 2009, PROC CVPR IEEE, P1446, DOI 10.1109/CVPRW.2009.5206771

[30] Evaluating Real-time Anomaly Detection Algorithms - the Numenta Anomaly Benchmark [J].

Lavin, Alexander ;

Ahmad, Subutai .

2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, :38-44

← 1 2 3 4 5 6 →