Granulated RCNN and Multi-Class Deep SORT for Multi-Object Detection and Tracking

被引:101
作者
Pramanik, Anima [1 ]
Pal, Sankar K. [2 ]
Maiti, J. [1 ]
Mitra, Pabitra [3 ]
机构
[1] Indian Inst Technol Kharagpur, Ind & Syst Engn, Kharagpur 721302, W Bengal, India
[2] ISI, Soft Comp, Kolkata 700108, India
[3] IIT Kharagpur, Comp Sci & Engn, Kharagpur 721302, W Bengal, India
来源
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2022年 / 6卷 / 01期
关键词
Videos; Detectors; Feature extraction; Proposals; Computational modeling; Steel; Object detection; Deep CNN; Foreground region proposal; Granulation; Object detection and tracking; Video analysis; SAR IMAGES; ENTROPY;
D O I
10.1109/TETCI.2020.3041019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, two new models, namely granulated RCNN (G-RCNN) and multi-class deep SORT (MCD-SORT), for object detection and tracking, respectively from videos are developed. Object detection has two stages: object localization (region of interest RoI) and classification. G-RCNN is an improved version of the well-known Fast RCNN and Faster RCNN for extracting RoIs by incorporating the unique concept of granulation in a deep convolutional neural network. Granulation with spatio-temporal information enables more accurate extraction of RoIs (object regions) in unsupervised mode. Compared to Fast and Faster RCNNs, G-RCNN uses (i) granules (clusters) formed over the pooling feature map, instead of its all feature values, in defining RoIs, (ii) only the positive RoIs during training, instead of the whole RoI-map, (iii) videos directly as input, rather than static images, and (iv) only the objects in RoIs, instead of the entire feature map, for performing object classification. All these lead to the improvement in real-time detection speed and accuracy. MCD-SORT is an advanced form of the popular Deep SORT. In MCD-SORT, the searching for association of objects with trajectories is restricted only within the same categories. This increases the performance in multi-class tracking. These characteristic features have been demonstrated over 37 videos containing single-class, two-class, and multi-class objects. Superiority of the models over several state-of-the-art methodologies is also established extensively, both qualitatively and quantitatively.
引用
收藏
页码:171 / 181
页数:11
相关论文
共 31 条
[1]   Trajectory-Based Surveillance Analysis: A Survey [J].
Ahmed, Sk Arif ;
Dogra, Debi Prosad ;
Kar, Samarjit ;
Roy, Partha Pratim .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (07) :1985-1997
[2]   A New Statistical-Based Kurtosis Wavelet Energy Feature for Texture Recognition of SAR Images [J].
Akbarizadeh, Gholamreza .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2012, 50 (11) :4358-4368
[3]   Measuring the Objectness of Image Windows [J].
Alexe, Bogdan ;
Deselaers, Thomas ;
Ferrari, Vittorio .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2189-2202
[4]   A Deep Learning Approach for Energy Efficient Computational Offloading in Mobile Edge Computing [J].
Ali, Zaiwar ;
Jiao, Lei ;
Baker, Thar ;
Abbas, Ghulam ;
Abbas, Ziaul Haq ;
Khaf, Sadia .
IEEE ACCESS, 2019, 7 :149623-149633
[5]  
[Anonymous], 2015, ARXIV
[6]   Confidence-Based Data Association and Discriminative Deep Appearance Learning for Robust Online Multi-Object Tracking [J].
Bae, Seung-Hwan ;
Yoon, Kuk-Jin .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (03) :595-610
[7]   Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics [J].
Bernardin, Keni ;
Stiefelhagen, Rainer .
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2008, 2008 (1)
[8]  
Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
[9]   Granulation, rough entropy and spatiotemporal moving object detection [J].
Chakraborty, Debarati ;
Shankar, B. Uma ;
Pal, Sankar K. .
APPLIED SOFT COMPUTING, 2013, 13 (09) :4001-4009
[10]   Neighborhood Rough Filter and Intuitionistic Entropy in Unsupervised Tracking [J].
Chakraborty, Debarati Bhunia ;
Pal, Sankar K. .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 26 (04) :2188-2200