Siamese Cascaded Region Proposal Networks for Real-Time Visual Tracking

被引：371

作者：

Fan, Heng ^{[1
]}

Ling, Haibin ^{[1
]}

机构：

[1] Temple Univ, Dept Comp & Informat Sci, Philadelphia, PA 19122 USA

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

D O I：

10.1109/CVPR.2019.00814

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, the regionproposal networks (RPN) have been combinedwith the Siamese networkfor tracking,and shown excellent accuracy with high efficiency. Nevertheless, previously proposedone-stage Siamese-RPNtrackersdegenerate in presence of similar distractorsand large scale variation. Addressing these issues, we propose a multi-stage tracking framework, Siamese Cascaded RPN (C-RPN), which consists of a sequence of RPNs cascadedfrom deep high-level to shallow low-level layers in a Siamese network. Comparedto previous solutions, C-RPN has severaladvantages: (1) Each RPN is trained using the outputs of RPN in the previous stage. Such process stimulates hardnegative sampling, resulting in more balanced training samples. Consequently, the RPNs are sequentially more discriminative in distinguishingdifficult background (i.e., similar distractors). (2) Multi-level features arefully leveragedthrough a novelfeature transferblock (FTB)for each RPNfurther improving the discriminabilityof C-RPN using both high-level semantic and low-level spatial information. (3) With multiple steps of regressions, C-RPN progressively refines the location and shape of the target in each RPN with adjusted anchor boxes in the previous stage, which makes localization more accurate. C-RPN is trained end-to-end with the multi-task lossfunction. In inference, C-RPN is deployed as it is, without any temporaladaption,for real-time tracking. In extensive experiments on OTB-2013, OTB-2015, VOT2016, VOT-2017, LaSOT and TrackingNet, C-RPN consistently achieves state-of-the-artresultsand runs in real-time.

引用

页码：7944 / 7953

页数：10

共 59 条

[1]

[Anonymous], 2018, LASOT HIGH QUALITY B

[2]

[Anonymous], 2018, P COMPUTER VISION PA

[3]

[Anonymous], 2017, CVPR

[4]

[Anonymous], 2013, CVPR

[5]

[Anonymous], 2016, ECCVW

[6]

[Anonymous], 2015, CVPR

[7]

[Anonymous], 2016, CVPR

[8]

[Anonymous], 2016, CVPR

[9]

[Anonymous], 2015, ICCV

[10]

[Anonymous], 2015, ICCV

← 1 2 3 4 5 6 →