A Memory Model Based on the Siamese Network for Long-Term Tracking

被引:4
作者
Lee, Hankyeol [1 ]
Choi, Seokeon [1 ]
Kim, Changick [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon, South Korea
来源
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I | 2019年 / 11129卷
关键词
Long-term tracking; Atkinson-Shiffrin Memory Model; Siamese network; Regional Maximum Activation of Convolutions;
D O I
10.1007/978-3-030-11009-3_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel memory model using deep convolutional features for long-term tracking to handle the challenging issues, including visual deformation or target disappearance. Our memory model is separated into short- and long-term stores inspired by Atkinson-Shiffrin Memory Model (ASMM). In the tracking step, the bounding box of the target is estimated by the Siamese features obtained from both memory stores to accommodate changes in the visual appearance of the target. In the re-detection step, we take features only in the long-term store to alleviate the drift problem. At this time, we adopt a coarse-to-fine strategy to detect the target in the entire image without the dependency of the previous position. In the end, we employ Regional Maximum Activation of Convolutions (R-MAC) as key criteria. Our tracker achieves an F-score of 0.52 on the LTB35 dataset, which is 0.04 higher than the performance of the state-of-the-art algorithm.
引用
收藏
页码:100 / 115
页数:16
相关论文
共 34 条
  • [1] A Lukelie, 2017, ARXIV171109594
  • [2] [Anonymous], 2017, CVPR
  • [3] [Anonymous], 2016, INT C LEARNING REPRE
  • [4] [Anonymous], 2016, CVPR
  • [5] [Anonymous], 2017, ARXIV170800153
  • [6] Atkinson R., 1968, Psychology of Learning and Motivation, V2, P89, DOI [10.1016/S0079-7421(08)60422-3, DOI 10.1016/S0079-7421(08)60422-3]
  • [7] Fully-Convolutional Siamese Networks for Object Tracking
    Bertinetto, Luca
    Valmadre, Jack
    Henriques, Joao F.
    Vedaldi, Andrea
    Torr, Philip H. S.
    [J]. COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 : 850 - 865
  • [8] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [9] Danelljan M., 2014, P BRIT MACH VIS C
  • [10] Learning Spatially Regularized Correlation Filters for Visual Tracking
    Danelljan, Martin
    Hager, Gustav
    Khan, Fahad Shahbaz
    Felsberg, Michael
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4310 - 4318