Deep Metric Learning for Visual Tracking

被引:56
作者
Hu, Junlin [1 ]
Lu, Jiwen [2 ]
Tan, Yap-Peng [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
基金
新加坡国家研究基金会;
关键词
Deep learning; metric learning; visual tracking; OBJECT TRACKING; REPRESENTATIONS; MODEL;
D O I
10.1109/TCSVT.2015.2477936
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a deep metric learning (DML) approach for robust visual tracking under the particle filter framework. Unlike most existing appearance-based visual trackers, which use hand-crafted similarity metrics, our DML tracker learns a nonlinear distance metric to classify the target object and background regions using a feed-forward neural network architecture. Since there are usually large variations in visual objects caused by varying deformations, illuminations, occlusions, motions, rotations, scales, and cluttered backgrounds, conventional linear similarity metrics cannot work well in such scenarios. To address this, our proposed DML tracker first learns a set of hierarchical nonlinear transformations in the feed-forward neural network to project both the template and particles into the same feature space where the intra-class variations of positive training pairs are minimized and the interclass variations of negative training pairs are maximized simultaneously. Then, the candidate that is most similar to the template in the learned deep network is identified as the true target. Experiments on the benchmark data set including 51 challenging videos show that our DML tracker achieves a very competitive performance with the state-of-the-art trackers.
引用
收藏
页码:2056 / 2068
页数:13
相关论文
共 60 条
[1]  
Andrew G., 2013, P 30 INT C INT C MAC, P1247
[2]  
[Anonymous], 2006, BMVC06
[3]  
[Anonymous], 2012, PROC CVPR IEEE
[4]  
[Anonymous], 2013, DRUG ALCOHOL DEPEND
[5]  
Babenko B, 2009, PROC CVPR IEEE, P983, DOI 10.1109/CVPRW.2009.5206737
[6]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[7]   EigenTracking: Robust matching and tracking of articulated objects using a view-based representation [J].
Black, MJ ;
Jepson, AD .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1998, 26 (01) :63-84
[8]  
Cai Xinyuan., 2012, ACM MULTIMEDIA MM 12, P749, DOI DOI 10.1145/2393347.2396303
[9]   Kernel-based object tracking [J].
Comaniciu, D ;
Ramesh, V ;
Meer, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (05) :564-577
[10]  
Davis J.V., 2007, P 24 INT C MACHINE L, P209, DOI DOI 10.1145/1273496.1273523