Robust Deep Simple Online Real-time Tracking

被引：2

作者：

Belmouhcine, Abdelbadie ^{[1
]}

Simon, Julien ^{[2
]}

Courtrai, Luc ^{[3
]}

Lefevre, Sebastien ^{[3
]}

机构：

[1] Univ Bretagne Sud, IFREMER, LTBH IRISA, Vannes, France

[2] IFREMER, LTBH, Lorient, France

[3] Univ Bretagne Sud, IRISA, Vannes, France

来源：

PROCEEDINGS OF THE 12TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2021) | 2021年

关键词：

Counting; DeepSORT; EfficientDet; MultiObject Tracking; SORT;

D O I：

10.1109/ISPA52656.2021.9552062

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Simple Online and Real-time Tracking (SORT) and its deep extension (DeepSORT) are simple, fast, and effective multi-object tracking by detection frameworks. Their main strengths are simplicity and speed. However, they still suffer from some problems, such as identity switch, instance merge, and many false positives, which prevent the tracking results from being used for subsequent tasks such as counting. In this paper, we strengthen and improve the tracking using EfficientDet and DeepSORT. In our approach, the motion prediction uses appearance, and the appearance embedding uses location. First, we modify the deep detection network to predict the objects' motion in the next frame by leveraging the attention between the current image and the next image. Second, an appearance-based metric is used to associate detection to tracks after false negatives and occlusion. This metric is a learned Mahalanobis distance between two feature descriptors constructed using EfficientDet and attention given to regions of interest from their images. Finally, we count only high confidence tracks having a minimum frequency of apparition. Our approach has been applied to a challenging real-life problem, namely seabed species tracking and counting. Our experimental results show that Robust DeepSORT reduces identity switches and merges. Thus, it improves tracking and counting evaluation measures while keeping the simplicity of the original DeepSORT.

引用

页码：138 / 144

页数：7

共 22 条

[1] Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics [J].

Bernardin, Keni ;

Stiefelhagen, Rainer .

EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2008, 2008 (1)

[2]

Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003

[3] Fast R-CNN [J].

Girshick, Ross .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448

[4] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[5] Pedestrian Flow Tracking and Statistics of Monocular Camera Based on Convolutional Neural Network and Kalman Filter [J].

He, Miao ;

Luo, Haibo ;

Hui, Bin ;

Chang, Zheng .

APPLIED SCIENCES-BASEL, 2019, 9 (08)

[6] Single and Multiple Object Tracking Using Log-Euclidean Riemannian Subspace and Block-Division Appearance Model [J].

Hu, Weiming ;

Li, Xi ;

Luo, Wenhan ;

Zhang, Xiaoqin ;

Maybank, Stephen ;

Zhang, Zhongfei .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (12) :2420-2440

[7]

Kalman R., 1960, J BASIC ENG, V82, P35, DOI [10.1115/1.3662552, DOI 10.1115/1.3662552]

[8]

Koch G., 2015, P ICML DEEP LEARN WO, VVolume 2

[9] The Hungarian Method for the assignment problem [J].

Kuhn, HW .

NAVAL RESEARCH LOGISTICS, 2005, 52 (01) :7-21

[10]

Leal-Taix‚ L, 2015, Arxiv, DOI arXiv:1504.01942

← 1 2 3 →