Improving Multiple Object Tracking with Single Object Tracking

被引：89

作者：

Zheng, Linyu ^{[1
,2
]}

Tang, Ming ^{[1
]}

Chen, Yingying ^{[1
,2
,3
]}

Zhu, Guibo ^{[1
,2
]}

Wang, Jinqiao ^{[1
,2
,3
]}

Lu, Hanqing ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China

[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China

[3] ObjectEye Inc, Beijing, Peoples R China

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/CVPR46437.2021.00248

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Despite considerable similarities between multiple object tracking (MOT) and single object tracking (SOT) tasks, modern MOT methods have not benefited from the development of SOT ones to achieve satisfactory performance. The major reason for this situation is that it is inappropriate and inefficient to apply multiple SOT models directly to the MOT task, although advanced SOT methods are of the strong discriminative power and can run at fast speeds. In this paper, we propose a novel and end-to-end trainable MOT architecture that extends CenterNet by adding an SOT branch for tracking objects in parallel with the existing branch for object detection, allowing the MOT task to benefit from the strong discriminative power of SOT methods in an effective and efficient way. Unlike most existing SOT methods which learn to distinguish the target object from its local backgrounds, the added SOT branch trains a separate SOT model per target online to distinguish the target from its surrounding targets, assigning SOT models the novel discrimination. Moreover, similar to the detection branch, the SOT branch treats objects as points, making its online learning efficient even if multiple targets are processed simultaneously. Without tricks, the proposed tracker achieves MOTAs of 0.710 and 0.686, IDF1s of 0.719 and 0.714, on MOT17 and MOT20 benchmarks, respectively, while running at 16 FPS on MOT17.

引用

页码：2453 / 2462

页数：10

共 51 条

[1]

[Anonymous], 2005 IEEE COMPUTER S, DOI DOI 10.1109/CVPR.2005.177

[2]

[Anonymous], 2015, ARXIV

[3] Tracking without bells and whistles [J].

Bergmann, Philipp ;

Meinhardt, Tim ;

Leal-Taixe, Laura .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :941-951

[4]

Bertinetto L., 2019, 7 INT C LEARNING REP

[5]

Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003

[6] Learning Discriminative Model Prediction for Tracking [J].

Bhat, Goutam ;

Danelljan, Martin ;

Van Gool, Luc ;

Timofte, Radu .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6181-6190

[7] Learning a Neural Solver for Multiple Object Tracking [J].

Braso, Guillem ;

Leal-Taixe, Laura .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6246-6256

[8]

Chen L, 2017, IEEE IMAGE PROC, P645, DOI 10.1109/ICIP.2017.8296360

[9] Online Multi-Object Tracking with Instance-Aware Tracker and Dynamic Model Refreshment [J].

Chu, Peng ;

Fan, Heng ;

Tan, Chiu C. ;

Ling, Haibin .

2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :161-170

[10] Online Multi-Object Tracking Using CNN-based Single Object Tracker with Spatial-Temporal Attention Mechanism [J].

Chu, Qi ;

Ouyang, Wanli ;

Li, Hongsheng ;

Wang, Xiaogang ;

Liu, Bin ;

Yu, Nenghai .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4846-4855

← 1 2 3 4 5 6 →