Learning spatio-temporal context via hierarchical features for visual tracking

被引:8
作者
Cao, Yi [1 ]
Ji, Hongbing [1 ]
Zhang, Wenbo [1 ]
Xue, Fei [1 ]
机构
[1] Xidian Univ, Sch Elect Engn, POB 229, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Visual tracking; Convolutional neural network; Transfer learning; Spatio-temporal context; Dynamic training confidence map; Training confidence index; OBJECT TRACKING;
D O I
10.1016/j.image.2018.04.010
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Spatio-temporal context (STC) based visual tracking algorithms have demonstrated remarkable tracking capabilities in recent years. In this paper, we propose an improved STC method, which seamlessly integrates capabilities of the powerful feature representations and mappings from the convolutional neural networks (CNNs) based on the theory of transfer learning. Firstly, the dynamic training confidence map, obtained from a mapping neural network using transferred CNN features, rather than the fixed training confidence map is utilized in our tracker to adapt the practical tracking scenes better. Secondly, we exploit hierarchical features from both the original image intensity and the transferred CNN features to construct context prior models. In order to enhance the accuracy and robustness of our tracker, we simultaneously transfer the fine-grained and semantic features from deep networks. Thirdly, we adopt the training confidence index (TCI), reflected from the dynamic training confidence map, to guide the updating process. It can determine whether back propagations should be conducted in the mapping neural network, and whether the STC model should be updated. The introduction of the dynamic training confidence map could effectively deal with the problem of location ambiguity further in our tracker. Overall, the comprehensive experimental results illustrate that the tracking capability of our tracker is competitive against several state-of-the-art trackers, especially the baseline STC tracker, on the existing OTB-2015 and UAV123 visual tracking benchmarks.
引用
收藏
页码:50 / 65
页数:16
相关论文
共 55 条
[1]  
[Anonymous], 2017, ARXIV170305020
[2]  
[Anonymous], 2006, BMVC06
[3]  
[Anonymous], 2012, Technical Report
[4]  
Bolme DS, 2010, PROC CVPR IEEE, P2544, DOI 10.1109/CVPR.2010.5539960
[5]   Visual tracking via adaptive multi-task feature learning with calibration and identification [J].
Chen, Pengguang ;
Zhang, Xingming ;
Mao, Aihua ;
Xiong, Jianbin .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2016, 49 :17-24
[6]  
Chen Z., 2015, Comput. Sci., V53, P68
[7]  
Danelljan M., 2014, Proceedings of the British Machine Vision Conference (BMVC), P1
[8]   Adaptive Color Attributes for Real-Time Visual Tracking [J].
Danelljan, Martin ;
Khan, Fahad Shahbaz ;
Felsberg, Michael ;
van de Weijer, Joost .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1090-1097
[9]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[10]  
Dinh TB, 2011, PROC CVPR IEEE, P1177, DOI 10.1109/CVPR.2011.5995733