Online Action Detection in Surveillance Scenarios: A Comprehensive Review and Comparative Study of State-of-the-Art Multi-Object Tracking Methods

被引:10
作者
Alikhanov, Jumabek [1 ]
Kim, Hakil [1 ]
机构
[1] Inha Univ, Dept Elect & Comp Engn, Incheon 402751, South Korea
基金
新加坡国家研究基金会;
关键词
~Video surveillance; object detection; multi-object tracking; action detection; deep learning; computer vision;
D O I
10.1109/ACCESS.2023.3292539
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Online action detection in surveillance scenarios presents considerable challenges, particularly due to the dynamically changing environments and real-time processing requirements. Within this context, Multi-Object Tracking (MOT) serves as a critical component of the online action detection pipeline. Despite the emergence of several state-of-the-art (SOTA) object trackers in recent years, a gap remains in the comprehensive evaluation of these trackers specifically for action detection in surveillance scenarios. This paper bridges this gap by offering a thorough study of SOTA MOT trackers, aimed at determining the influential factors affecting their performance in surveillance settings and identifying the trackers optimally suited for an online action detection pipeline. For relevance and rigor, we introduce SurvTrack, a new dataset derived from a subset of VIRAT-dataset explicitly designed for action detection tasks-but intended for object tracking. SurvTrack is utilized to assess these trackers under various conditions, including differing image resolutions and detector confidence thresholds. This study uncovers the distinctive strengths and weaknesses of each tracker, providing invaluable insights for researchers and practitioners in surveillance and action detection. Importantly, this work focuses on tracking methods within the action detection domain, underscoring the development of a tracker explicitly designed for action detection on pertinent datasets, such as VIRAT.
引用
收藏
页码:68079 / 68092
页数:14
相关论文
共 31 条
[1]  
Aharon N, 2022, Arxiv, DOI arXiv:2206.14651
[2]   Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics [J].
Bernardin, Keni ;
Stiefelhagen, Rainer .
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2008, 2008 (1)
[3]  
Brostrom M., 2022, GITHUB
[4]  
Cao JK, 2023, Arxiv, DOI [arXiv:2203.14360, 10.48550/ARXIV.2203.14360]
[5]   A multibranch CNN-BiLSTM model for human activity recognition using wearable sensor data [J].
Challa, Sravan Kumar ;
Kumar, Akhilesh ;
Semwal, Vijay Bhaskar .
VISUAL COMPUTER, 2022, 38 (12) :4095-4109
[6]   Minding the Gaps in a Video Action Analysis Pipeline [J].
Chen, Jia ;
Liu, Jiang ;
Liang, Junwei ;
Hu, Ting-Yao ;
Ke, Wei ;
Barrios, Wayner ;
Huang, Dong ;
Hauptmann, Alexander G. .
2019 IEEE WINTER APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2019, :41-46
[7]   Deep learning in video multi-object tracking: A survey [J].
Ciaparrone, Gioele ;
Luque Sanchez, Francisco ;
Tabik, Siham ;
Troiano, Luigi ;
Tagliaferri, Roberto ;
Herrera, Francisco .
NEUROCOMPUTING, 2020, 381 :61-88
[8]   MEVA: A Large-Scale Multiview, Multimodal Video Dataset for Activity Detection [J].
Corona, Kellie ;
Osterdahl, Katie ;
Collins, Roderic ;
Hoogs, Anthony .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :1059-1067
[9]   DG-Labeler and DGL-MOTS Dataset: Boost the Autonomous Driving Perception [J].
Cui, Yiming ;
Cao, Zhiwen ;
Xie, Yixin ;
Jiang, Xingyu ;
Tao, Feng ;
Chen, Yingjie Victor ;
Li, Lin ;
Liu, Dongfang .
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, :3411-3420
[10]   GabriellaV2: Towards better generalization in surveillance videos for Action Detection [J].
Dave, Ishan ;
Scheffer, Zacchaeus ;
Kumar, Akash ;
Shiraz, Sarah ;
Rawat, Yogesh Singh ;
Shah, Mubarak .
2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, :122-132