Multi-object tracking with Siamese-RPN and adaptive matching strategy

被引:8
作者
Gao, Xinwen [1 ,2 ]
Shen, Zhuo [2 ,3 ]
Yang, Yumeng [2 ,3 ]
机构
[1] Shanghai Univ, Inst Mech & Elect Engn & Automat, Shanghai, Peoples R China
[2] Shanghai Univ, SHU SUCG Res Ctr Bldg Industrializat, Shanghai, Peoples R China
[3] Shanghai Univ, SILC Business Sch, Shanghai, Peoples R China
关键词
Multiple object tracking; Siamese RPN; Joint detection module; Adaptive matching strategy; MULTITARGET;
D O I
10.1007/s11760-021-02041-x
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The multiple object tracking (MOT) task has always been a research hot point in computer vision. However, most current MOT algorithms do not pay enough attention to the prediction module. Also, in data association, they use manual debugging to determine the matching threshold. In this paper, we propose a new MOT algorithm. By introducing the Siamese RPN network as a predictor in the advanced detection module, the algorithm greatly enhances the adaptability to complex and diverse application scenarios while improving accuracy. Simultaneously, by analyzing the distance matrix in the data association module, we design a simple adaptive threshold determination method, which saves a lot of redundant experiments in the debugging process and avoids manual intervention. Combined with the self-designed matching strategy, the MOT algorithm with high accuracy and adaptability to more complex and diverse application scenarios such as nonlinear and high-speed is realized. Finally, the effectiveness and advantages of each module are verified on the MOT16, MOT17, and MOT20 benchmarks.
引用
收藏
页码:965 / 973
页数:9
相关论文
共 33 条
  • [1] Tracking without bells and whistles
    Bergmann, Philipp
    Meinhardt, Tim
    Leal-Taixe, Laura
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 941 - 951
  • [2] Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics
    Bernardin, Keni
    Stiefelhagen, Rainer
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2008, 2008 (1)
  • [3] Bertinetto L., 2016, NEURIPS, P523
  • [4] Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
  • [5] Bochinski E, 2018, 2018 15TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), P435
  • [6] Bochinski E, 2017, 2017 14TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS)
  • [7] Bochkovskiy A., 2020, ARXIV 200410934
  • [8] Learning a Neural Solver for Multiple Object Tracking
    Braso, Guillem
    Leal-Taixe, Laura
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6246 - 6256
  • [9] Cheng L., 2018, 2018 IEEE International Magnetics Conference (INTERMAG), DOI 10.1109/INTMAG.2018.8508819
  • [10] Han S., 2020, ARXIV PREPRINT ARXIV