ByteTrack: Multi-object Tracking by Associating Every Detection Box

被引:903
作者
Zhang, Yifu [1 ]
Sun, Peize [2 ]
Jiang, Yi [3 ]
Yu, Dongdong [3 ]
Weng, Fucheng [1 ]
Yuan, Zehuan [3 ]
Luo, Ping [2 ]
Liu, Wenyu [1 ]
Wang, Xinggang [1 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan, Peoples R China
[2] Univ Hong Kong, Hong Kong, Peoples R China
[3] ByteDance Inc, Beijing, Peoples R China
来源
COMPUTER VISION, ECCV 2022, PT XXII | 2022年 / 13682卷
关键词
Multi-object tracking; Data association; Detection boxes; MULTITARGET;
D O I
10.1007/978-3-031-20047-2_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-object tracking (MOT) aims at estimating bounding boxes and identities of objects in videos. Most methods obtain identities by associating detection boxes whose scores are higher than a threshold. The objects with low detection scores, e.g. occluded objects, are simply thrown away, which brings non-negligible true object missing and fragmented trajectories. To solve this problem, we present a simple, effective and generic association method, tracking by associating almost every detection box instead of only the high score ones. For the low score detection boxes, we utilize their similarities with tracklets to recover true objects and filter out the background detections. When applied to 9 different state-of-the-art trackers, our method achieves consistent improvement on IDF1 score ranging from 1 to 10 points. To put forwards the state-of-the-art performance of MOT, we design a simple and strong tracker, named ByteTrack. For the first time, we achieve 80.3 MOTA, 77.3 IDF1 and 63.1 HOTA on the test set of MOT17 with 30 FPS running speed on a single V100 GPU. ByteTrack also achieves state-of-the-art performance on MOT20, HiEve and BDD100K tracking benchmarks. The source code, pre-trained models with deploy versions and tutorials of applying to other trackers are released at https://github. com/ifzhang/ByteTrack.
引用
收藏
页码:1 / 21
页数:21
相关论文
共 91 条
[1]   Robust Online Multi-Object Tracking based on Tracklet Confidence and Online Discriminative Appearance Learning [J].
Bae, Seung-Hwan ;
Yoon, Kuk-Jin .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1218-1225
[2]   Multiple Object Tracking Using K-Shortest Paths Optimization [J].
Berclaz, Jerome ;
Fleuret, Francois ;
Tueretken, Engin ;
Fua, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (09) :1806-1819
[3]   Tracking without bells and whistles [J].
Bergmann, Philipp ;
Meinhardt, Tim ;
Leal-Taixe, Laura .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :941-951
[4]   Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics [J].
Bernardin, Keni ;
Stiefelhagen, Rainer .
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2008, 2008 (1)
[5]   Fully-Convolutional Siamese Networks for Object Tracking [J].
Bertinetto, Luca ;
Valmadre, Jack ;
Henriques, Joao F. ;
Vedaldi, Andrea ;
Torr, Philip H. S. .
COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 :850-865
[6]  
Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
[7]  
Bochinski E, 2017, 2017 14TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS)
[8]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[9]   Learning a Neural Solver for Multiple Object Tracking [J].
Braso, Guillem ;
Leal-Taixe, Laura .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6246-6256
[10]  
Cai J, 2022, P IEEECVF C COMPUTER, P8090