Joint Re-Detection and Re-Identification for Multi-Object Tracking

被引:3
作者
He, Jian [1 ]
Zhong, Xian [1 ,2 ]
Yuan, Jingling [1 ]
Tan, Ming [1 ]
Zhao, Shilei [1 ]
Zhong, Luo [1 ]
机构
[1] Wuhan Univ Technol, Sch Comp & Artificial Intelligence, Wuhan, Peoples R China
[2] Peking Univ, Sch Elect Engn & Comp Sci, Beijing, Peoples R China
来源
MULTIMEDIA MODELING (MMM 2022), PT I | 2022年 / 13141卷
关键词
Multi-Object tracking; Trajectory re-detection; Re-identification non-maximum suppression; Missing detection;
D O I
10.1007/978-3-030-98358-1_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Within the tracking-by-detection framework, multi-object tracking (MOT) has always been plagued by missing detection. To address this problem, existing methods usually predict new positions of the trajectories first to provide more candidate bounding boxes (BBoxes), and then use non-maximum suppression (NMS) to eliminate the redundant BBoxes. However, when two BBoxes belonging to different objects have a significant intersection over union (IoU) due to occlusion, NMS will mistakenly filter out the one with lower confidence score, and these methods ignore the missing detection caused by NMS. We propose a joint re-detection and re-identification tracker (JDI) for MOT, consisting of two components, trajectory re-detection and NMS with re-identification (ReID). Specifically, the trajectory re-detection could predict new position of the trajectory in detection, a more reliable way than motion model (MM), based on feature matching. Furthermore, we propose to embed ReID features into NMS and take the similarity of the ReID features as an additional necessary condition to determine whether two BBoxes are the same object. Based on the "overlap degree" calculated by IoU and the similarity of ReID features, accurate filtering can be achieved through double-checking. We demonstrate the effectiveness of our tracking components with ablative experiments and surpass the state-of-the-art methods on the three tracking benchmarks MOT16, MOT17, and MOT20.
引用
收藏
页码:364 / 376
页数:13
相关论文
共 31 条
[1]  
Andriyenko A, 2012, PROC CVPR IEEE, P1926, DOI 10.1109/CVPR.2012.6247893
[2]   Tracking without bells and whistles [J].
Bergmann, Philipp ;
Meinhardt, Tim ;
Leal-Taixe, Laura .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :941-951
[3]  
Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
[4]   Soft-NMS - Improving Object Detection With One Line of Code [J].
Bodla, Navaneeth ;
Singh, Bharat ;
Chellappa, Rama ;
Davis, Larry S. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5562-5570
[5]   Learning a Neural Solver for Multiple Object Tracking [J].
Braso, Guillem ;
Leal-Taixe, Laura .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6246-6256
[6]  
Chen LJ, 2018, 2018 3RD INTERNATIONAL CONFERENCE ON SYSTEM RELIABILITY AND SAFETY (ICSRS), P1, DOI [10.1109/ICSRS.2018.8688869, 10.1109/ICSRS.2018.00009]
[7]   FAMNet: Joint Learning of Feature, Affinity and Multi-dimensional Assignment for Online Multiple Object Tracking [J].
Chu, Peng ;
Ling, Haibin .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6171-6180
[8]   Learning a Proposal Classifier for Multiple Object Tracking [J].
Dai, Peng ;
Weng, Renliang ;
Choi, Wongun ;
Zhang, Changshui ;
He, Zhangping ;
Ding, Wei .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :2443-2452
[9]  
Datal N., 2005, 2005 IEEE COMP VIS P, P886, DOI 10.1109/CVPR.2005.177
[10]  
Fan Wang, 2021, MultiMedia Modeling. 27th International Conference, MMM 2021. Proceedings. Lecture Notes in Computer Science (LNCS 12573), P49, DOI 10.1007/978-3-030-67835-7_5