A deep features based generative model for visual tracking

被引:10
作者
Feng, Ping [2 ]
Xu, Chunyan [3 ]
Zhao, Zhiqiang [2 ]
Liu, Fang [2 ]
Guo, Jingjuan [2 ]
Yuan, Caihong [2 ]
Wang, Tianjiang [2 ]
Duan, Kui [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Hosp, Wuhan 430074, Hubei, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430074, Hubei, Peoples R China
[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China
关键词
Visual tracking; Deep features; Generative model; OBJECT TRACKING;
D O I
10.1016/j.neucom.2018.05.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose a novel visual tracking algorithm based on a framework of generative model. In order to make the algorithm robust to various challenging appearance changes, we adopt the powerful deep features in the description of tracking object appearance. The features are extracted from a Convolutional Neural Network (CNN), which is a modified one based on the VGG-M nets but constructed with fewer convolution layers and sequences exclusively full connection layers. In the pretraining process, we add a special convolution layer called coefficients layer before the full connection layers. In the tracking process after the network being pretrained, we remove the coefficients layer and just update the full connection layers conditionally. To decide the new target's positions, we compute the compositive similarity scores containing three kinds of similarities with different weights. The first kind is similarities between candidates and the target in the first frame, and the second kind is between candidates and tracking results in the last frame. The third kind is related to the important object appearance variations in the tracking process. We design a simple mechanism to produce a collection to record those historical templates when the object appearance changed largely. With similarities between candidates and the historical templates, the drift problem can be alleviated to some extent, because similar historical appearances sometimes appear repeatedly and the recorded historical templates can provide important information. We use the outputs of the convolution part before the full connection layers as features and weight them with the coefficients layer's filter weights to compute all similarities. Finally, candidates with the highest scores will be regarded as new targets in the current frame. The evaluated results on CVPR2013 Online Object Tracking Benchmark show that our algorithm can achieve outstanding performance compared with state-of-the-art trackers. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:245 / 254
页数:10
相关论文
共 54 条
  • [11] Danelljan M., 2017, P C COMP VIS PATT RE
  • [12] Learning Spatially Regularized Correlation Filters for Visual Tracking
    Danelljan, Martin
    Hager, Gustav
    Khan, Fahad Shahbaz
    Felsberg, Michael
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4310 - 4318
  • [13] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [14] Sparse representation combined with context information for visual tracking
    Feng, Ping
    Xu, Chunyan
    Zhao, Zhiqiang
    Liu, Fang
    Yuan, Caihong
    Wang, Tianjiang
    Duan, Kui
    [J]. NEUROCOMPUTING, 2017, 225 : 92 - 102
  • [15] Gao J, 2014, LECT NOTES COMPUT SC, V8691, P188, DOI 10.1007/978-3-319-10578-9_13
  • [16] Deep Relative Tracking
    Gao, Junyu
    Zhang, Tianzhu
    Yang, Xiaoshan
    Xu, Changsheng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (04) : 1845 - 1858
  • [17] Combined feature evaluation for adaptive visual object tracking
    Han, Zhenjun
    Ye, Qixiang
    Jiao, Jianbin
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2011, 115 (01) : 69 - 80
  • [18] Hare S, 2011, IEEE I CONF COMP VIS, P263, DOI 10.1109/ICCV.2011.6126251
  • [19] High-Speed Tracking with Kernelized Correlation Filters
    Henriques, Joao F.
    Caseiro, Rui
    Martins, Pedro
    Batista, Jorge
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (03) : 583 - 596
  • [20] Exploiting the Circulant Structure of Tracking-by-Detection with Kernels
    Henriques, Joao F.
    Caseiro, Rui
    Martins, Pedro
    Batista, Jorge
    [J]. COMPUTER VISION - ECCV 2012, PT IV, 2012, 7575 : 702 - 715