Siamese Cascaded Region Proposal Networks for Real-Time Visual Tracking

被引:349
作者
Fan, Heng [1 ]
Ling, Haibin [1 ]
机构
[1] Temple Univ, Dept Comp & Informat Sci, Philadelphia, PA 19122 USA
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00814
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, the regionproposal networks (RPN) have been combinedwith the Siamese networkfor tracking,and shown excellent accuracy with high efficiency. Nevertheless, previously proposedone-stage Siamese-RPNtrackersdegenerate in presence of similar distractorsand large scale variation. Addressing these issues, we propose a multi-stage tracking framework, Siamese Cascaded RPN (C-RPN), which consists of a sequence of RPNs cascadedfrom deep high-level to shallow low-level layers in a Siamese network. Comparedto previous solutions, C-RPN has severaladvantages: (1) Each RPN is trained using the outputs of RPN in the previous stage. Such process stimulates hardnegative sampling, resulting in more balanced training samples. Consequently, the RPNs are sequentially more discriminative in distinguishingdifficult background (i.e., similar distractors). (2) Multi-level features arefully leveragedthrough a novelfeature transferblock (FTB)for each RPNfurther improving the discriminabilityof C-RPN using both high-level semantic and low-level spatial information. (3) With multiple steps of regressions, C-RPN progressively refines the location and shape of the target in each RPN with adjusted anchor boxes in the previous stage, which makes localization more accurate. C-RPN is trained end-to-end with the multi-task lossfunction. In inference, C-RPN is deployed as it is, without any temporaladaption,for real-time tracking. In extensive experiments on OTB-2013, OTB-2015, VOT2016, VOT-2017, LaSOT and TrackingNet, C-RPN consistently achieves state-of-the-artresultsand runs in real-time.
引用
收藏
页码:7944 / 7953
页数:10
相关论文
共 59 条
  • [1] [Anonymous], 2018, LASOT HIGH QUALITY B
  • [2] [Anonymous], 2018, P COMPUTER VISION PA
  • [3] [Anonymous], 2017, CVPR
  • [4] [Anonymous], 2013, CVPR
  • [5] [Anonymous], 2016, ECCVW
  • [6] [Anonymous], 2015, CVPR
  • [7] [Anonymous], 2016, CVPR
  • [8] [Anonymous], 2016, CVPR
  • [9] [Anonymous], 2015, ICCV
  • [10] [Anonymous], 2015, ICCV