Siamese Instance Search for Tracking

被引:940
作者
Tao, Ran [1 ]
Gavves, Efstratios [1 ]
Smeulders, Arnold W. M. [1 ]
机构
[1] QUVA Lab, Sci Pk 904, NL-1098 XH Amsterdam, Netherlands
来源
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年
关键词
D O I
10.1109/CVPR.2016.158
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a tracker, which is radically different from state-of-the-art trackers: we apply no model updating, no occlusion detection, no combination of trackers, no geometric matching, and still deliver state-of-theart tracking performance, as demonstrated on the popular online tracking benchmark (OTB) and six very challenging YouTube videos. The presented tracker simply matches the initial patch of the target in the first frame with candidates in a new frame and returns the most similar patch by a learned matching function. The strength of the matching function comes from being extensively trained generically, i.e., without any data of the target, using a Siamese deep neural network, which we design for tracking. Once learned, the matching function is used as is, without any adapting, to track previously unseen targets. It turns out that the learned matching function is so powerful that a simple tracker built upon it, coined Siamese INstance search Tracker, SINT, which only uses the original observation of the target from the first frame, suffices to reach state-of-theart performance. Further, we show the proposed tracker even allows for target re-identification after the target was absent for a complete video shot.
引用
收藏
页码:1420 / 1429
页数:10
相关论文
共 58 条
[1]  
[Anonymous], 2012, CVPR
[2]  
[Anonymous], 2015, CVPR
[3]  
[Anonymous], 2005, CVPR
[4]  
[Anonymous], 2013, ICCV
[5]  
[Anonymous], 2015, CVPR
[6]  
[Anonymous], ICCV
[7]  
[Anonymous], 2013, CVPR
[8]  
[Anonymous], 2014, ECCV
[9]  
[Anonymous], TPAMI
[10]  
[Anonymous], 2007, CVPR