Visual tracking via weakly supervised learning from multiple imperfect oracles

被引:65
作者
Zhong, Bineng [1 ,6 ]
Yao, Hongxun [2 ]
Chen, Sheng [3 ]
Ji, Rongrong [4 ]
Chin, Tat-Jun [5 ]
Wang, Hanzi [6 ]
机构
[1] Huaqiao Univ, Dept Comp Sci & Engn, Fujian, Peoples R China
[2] Harbin Inst Technol, Dept Comp Sci & Engn, Harbin, Peoples R China
[3] Oregon State Univ, Corvallis, OR 97331 USA
[4] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
[5] Univ Adelaide, Sch Comp Sci, Adelaide, SA 5005, Australia
[6] Xiamen Univ, Sch Informat Sci & Technol, Xiamen 361005, Peoples R China
基金
中国博士后科学基金;
关键词
Visual tracking; Weakly supervised learning; Information fusion; Online learning; Adaptive appearance model; Drift problem; Online evaluation;
D O I
10.1016/j.patcog.2013.10.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Notwithstanding many years of progress, visual tracking is still a difficult but important problem. Since most top-performing tracking methods have their strengths and weaknesses and are suited for handling only a certain type of variation, one of the next challenges is to integrate all these methods and address the problem of long-term persistent tracking in ever-changing environments. Towards this goal, we consider visual tracking in a novel weakly supervised learning scenario where (possibly noisy) labels but no ground truth are provided by multiple imperfect oracles (i.e., different trackers). These trackers naturally have intrinsic diversity due to their different design strategies, and we propose a probabilistic method to simultaneously infer the most likely object position by considering the outputs of all trackers, and estimate the accuracy of each tracker. An online evaluation strategy of trackers and a heuristic training data selection scheme are adopted to make the inference more effective and efficient. Consequently, the proposed method can avoid the pitfalls of purely single tracking methods and get reliably labeled samples to incrementally update each tracker (if it is an appearance-adaptive tracker) to capture the appearance changes. Extensive experiments on challenging video sequences demonstrate the robustness and effectiveness of the proposed method. Crown Copyright (C) 2013 Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:1395 / 1410
页数:16
相关论文
共 57 条
[1]  
Adam A., 2006, P CVPR
[2]  
Ahn L., 2008, SCIENCE
[3]  
[Anonymous], P ECCV
[4]  
[Anonymous], P CVPR
[5]  
[Anonymous], P CVPR
[6]  
Avidan S., 2007, P TPAMI
[7]  
Babenko B., 2011, P TPAMI
[8]  
Bai T.X., 2012, P PR
[9]  
Birchfield S., 1998, P CVPR
[10]  
Collins R., 2005, P TPAMI