Understanding and Diagnosing Visual Tracking Systems

被引:294
作者
Wang, Naiyan [1 ]
Shi, Jianping [2 ,3 ]
Yeung, Dit-Yan [1 ]
Jia, Jiaya [3 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Hong Kong, Peoples R China
[2] Linkface Grp Ltd, Beijing, Peoples R China
[3] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
来源
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2015年
关键词
ROBUST OBJECT TRACKING;
D O I
10.1109/ICCV.2015.355
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several benchmark datasets for visual tracking research have been created in recent years. Despite their usefulness, whether they are sufficient for understanding and diagnosing the strengths and weaknesses of different trackers remains questionable. To address this issue, we propose a framework by breaking a tracker down into five constituent parts, namely, motion model, feature extractor, observation model, model updater, and ensemble post-processor. We then conduct ablative experiments on each component to study how it affects the overall result. Surprisingly, our findings are discrepant with some common beliefs in the visual tracking research community. We find that the feature extractor plays the most important role in a tracker. On the other hand, although the observation model is the focus of many studies, we find that it often brings no significant improvement. Moreover, the motion model and model updater contain many details that could affect the result. Also, the ensemble post-processor can improve the result substantially when the constituent trackers have high diversity. Based on our findings, we put together some very elementary building blocks to give a basic tracker which is competitive in performance to the state-of-the-art trackers. We believe our framework can provide a solid baseline when conducting controlled experiments for visual tracking research.
引用
收藏
页码:3101 / 3109
页数:9
相关论文
共 43 条
  • [1] Adam A., 2006, IEEE C COMPUTER VISI, V1, P798, DOI [DOI 10.1109/CVPR.2006.256, 10.1109/CVPR.2006.256]
  • [2] [Anonymous], 2015, CVPR
  • [3] [Anonymous], 2013, CVPR
  • [4] [Anonymous], BMVC
  • [5] [Anonymous], 2014, ARXIV14047584
  • [6] [Anonymous], 2006, BMVC06
  • [7] [Anonymous], 2014, ECCV WORKSH
  • [8] [Anonymous], 2015, ARXIV150206796
  • [9] [Anonymous], 2014, P INT C MACH LEARN
  • [10] [Anonymous], 2014, IEEE T PATTERN ANAL, DOI DOI 10.1109/TPAMI.2013.230