Region-based High-resolution Siamese Network for Robust Visual Tracking

被引:1
作者
Li, Chunbao [1 ]
Yang, Bo [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
来源
PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON BIOMEDICAL SIGNAL AND IMAGE PROCESSING (ICBIP 2019) | 2019年
关键词
Visual tracking; Appearance variations; High-resolution representation; Siamese neural networks; Position-sensitive score maps; OBJECT TRACKING;
D O I
10.1145/3354031.3354051
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Visual tracking is an active and challenging research topic in computer vision, as objects often undergo significant appearance variations caused by occlusion, deformation and background clutter. In recent years, many convolutional neural network based trackers have achieved impressive performance by integrating multi-layer features. However, in order to conduct multi-scale feature fusion, most of these trackers recover high-resolution presentations from low-resolution representations produced by a high-to-low resolution network, which tend to result in inaccurate feature maps or lose of details of the target object. In this paper, we propose an end-to-end region-based high-resolution fully convolutional Siamese network for tracking. In the tracker, we propose to extract the spatial information and semantic information of the target object using a high-resolution network that maintains rich high-resolution representations of the target object through the whole process. Furthermore, a set of position-sensitive score maps are obtained for all regions of the target template, and an adaptive weighting method is proposed to fuse score maps of multiple regions. Experimental results on the OTB50 and OTB100 benchmark datasets demonstrate that our tracker performs better than several state-of-the-art trackers while running in real-time.
引用
收藏
页码:107 / 112
页数:6
相关论文
共 17 条
  • [1] Deformable part-based tracking by coupled global and local correlation filters
    Akin, Osman
    Erdem, Erkut
    Erdem, Aykut
    Mikolajczyk, Krystian
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 38 : 763 - 774
  • [2] Staple: Complementary Learners for Real-Time Tracking
    Bertinetto, Luca
    Valmadre, Jack
    Golodetz, Stuart
    Miksik, Ondrej
    Torr, Philip H. S.
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1401 - 1409
  • [3] Fully-Convolutional Siamese Networks for Object Tracking
    Bertinetto, Luca
    Valmadre, Jack
    Henriques, Joao F.
    Vedaldi, Andrea
    Torr, Philip H. S.
    [J]. COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 : 850 - 865
  • [4] Instance-Sensitive Fully Convolutional Networks
    Dai, Jifeng
    He, Kaiming
    Li, Yi
    Ren, Shaoqing
    Sun, Jian
    [J]. COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 534 - 549
  • [5] Dai J, 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1796, DOI 10.1109/ICIT.2016.7475036
  • [6] Dong Z, 2018, INT CONF POW ELECTR, P459, DOI 10.23919/IPEC.2018.8507611
  • [7] Fan H., 2017, P IEEE C COMP VIS PA, P42
  • [8] Deep visual tracking: Review and experimental comparison
    Li, Peixia
    Wang, Dong
    Wang, Lijun
    Lu, Huchuan
    [J]. PATTERN RECOGNITION, 2018, 76 : 323 - 338
  • [9] Hierarchical Convolutional Features for Visual Tracking
    Ma, Chao
    Huang, Jia-Bin
    Yang, Xiaokang
    Yang, Ming-Hsuan
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3074 - 3082
  • [10] Sun K., 2019, P IEEE C COMP VIS PA