Local Semantic Siamese Networks for Fast Tracking

被引:115
作者
Liang, Zhiyuan [1 ]
Shen, Jianbing [1 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci, Beijing Lab Intelligent Informat Technol, Beijing 100081, Peoples R China
基金
北京市自然科学基金;
关键词
Visual object tracking; Siamese deep network; local feature representation; OBJECT TRACKING;
D O I
10.1109/TIP.2019.2959256
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning a powerful feature representation is critical for constructing a robust Siamese tracker. However, most existing Siamese trackers learn the global appearance features of the entire object, which usually suffers from drift problems caused by partial occlusion or non-rigid appearance deformation. In this paper, we propose a new Local Semantic Siamese (LSSiam) network to extract more robust features for solving these drift problems, since the local semantic features contain more fine-grained and partial information. We learn the semantic features during offline training by adding a classification branch into the classical Siamese framework. To further enhance the representation of features, we design a generally focal logistic loss to mine the hard negative samples. During the online tracking, we remove the classification branch and propose an efficient template updating strategy to avoid aggressive computing load. Thus, the proposed tracker can run at a high-speed of 100 Frame-per-Second (FPS) far beyond real-time requirement. Extensive experiments on popular benchmarks demonstrate the proposed LSSiam tracker achieves the state-of-the-art performance with a high-speed. Our source code is available at https://github.com/shenjianbing/LSSiam.
引用
收藏
页码:3351 / 3364
页数:14
相关论文
共 62 条
[41]   Fast Online Tracking With Detection Refinement [J].
Shen, Jianbing ;
Yu, Dajiang ;
Deng, Leyao ;
Dong, Xingping .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (01) :162-173
[42]  
Simonyan K., 2015, P 3 INT C LEARN REPR, P1
[43]   CREST: Convolutional Residual Learning for Visual Tracking [J].
Song, Yibing ;
Ma, Chao ;
Gong, Lijun ;
Zhang, Jiawei ;
Lau, Rynson W. H. ;
Yang, Ming-Hsuan .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2574-2583
[44]   Beyond Part Models: Person Retrieval with Refined Part Pooling (and A Strong Convolutional Baseline) [J].
Sun, Yifan ;
Zheng, Liang ;
Yang, Yi ;
Tian, Qi ;
Wang, Shengjin .
COMPUTER VISION - ECCV 2018, PT IV, 2018, 11208 :501-518
[45]   End-to-end representation learning for Correlation Filter based tracking [J].
Valmadre, Jack ;
Bertinetto, Luca ;
Henriques, Joao ;
Vedaldi, Andrea ;
Torr, Philip H. S. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5000-5008
[46]   MatConvNet Convolutional Neural Networks for MATLAB [J].
Vedaldi, Andrea ;
Lenc, Karel .
MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, :689-692
[47]   SPM-Tracker: Series-Parallel Matching for Real-Time Visual Object Tracking [J].
Wang, Guangting ;
Luo, Chong ;
Xiong, Zhiwei ;
Zeng, Wenjun .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3638-3647
[48]   Large Margin Object Tracking with Circulant Feature Maps [J].
Wang, Mengmeng ;
Liu, Yong ;
Huang, Zeyi .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4800-4808
[49]   Fast Online Object Tracking and Segmentation: A Unifying Approach [J].
Wang, Qiang ;
Zhang, Li ;
Bertinetto, Luca ;
Hu, Weiming ;
Torr, Philip H. S. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1328-1338
[50]   Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking [J].
Wang, Qiang ;
Teng, Zhu ;
Xing, Junliang ;
Gao, Jin ;
Hu, Weiming ;
Maybank, Stephen .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4854-4863