Spatially Attentive Visual Tracking Using Multi-Model Adaptive Response Fusion

被引:58
作者
Zhang, Jianming [2 ]
Wu, You [2 ]
Feng, Wenjun [2 ]
Wang, Jin [1 ,2 ]
机构
[1] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Hunan, Peoples R China
[2] Changsha Univ Sci & Technol, Hunan Prov Key Lab Intelligent Proc Big Data Tran, Changsha 410114, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive updating strategy; convolutional neural network; correlation filter; multi-model adaptive response fusion mechanism; object tracking; spatial attention map; OBJECT TRACKING; NETWORKS; MODEL;
D O I
10.1109/ACCESS.2019.2924944
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent years have witnessed the top performances of integrating multi-level features from the pre-trained convolutional neural network (CNN) into correlation filters framework. However, they still suffer from background interference in detection stage due to the large search region and contamination of training samples caused by inaccurate tracking. In this paper, to suppress the interference of background features in target detection stage, an effective spatial attention map (SAM) is proposed to differently weight the multi-hierarchical convolutional features from search region to obtain the attentional features. This way helps to reduce the filter values corresponding to background features. Moreover, we construct multiple elementary correlation filter (ECF) models on multi-hierarchical deep features from CNN to track the target in parallel. To further improve the tracking stability, a multi-model adaptive response fusion (MAF) mechanism is presented. The mechanism can adaptively choose the outputs of reliable ECF models for adaptive weighted fusion by evaluating the confidences of response maps generated by attentional features convolved with ECF models. Finally, to adapt the target appearance changes in the following frames and avoid model corruption, we propose an adaptive updating strategy for the updates of the SAM and ECF models. We perform comprehensive experiments on OTB-2013 and OTB-2015 datasets and the experimental results show the superiority of our algorithm over other 12 state-of-the-art approaches.
引用
收藏
页码:83873 / 83887
页数:15
相关论文
共 53 条
  • [1] [Anonymous], 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition, DOI [10.1109/CVPR.2016.465, DOI 10.1109/CVPR.2016.465]
  • [2] [Anonymous], 2012, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2012.6247881
  • [3] Staple: Complementary Learners for Real-Time Tracking
    Bertinetto, Luca
    Valmadre, Jack
    Golodetz, Stuart
    Miksik, Ondrej
    Torr, Philip H. S.
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1401 - 1409
  • [4] Fully-Convolutional Siamese Networks for Object Tracking
    Bertinetto, Luca
    Valmadre, Jack
    Henriques, Joao F.
    Vedaldi, Andrea
    Torr, Philip H. S.
    [J]. COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 : 850 - 865
  • [5] Bolme DS, 2010, PROC CVPR IEEE, P2544, DOI 10.1109/CVPR.2010.5539960
  • [6] RETRACTED: The visual object tracking algorithm research based on adaptive combination kernel (Retracted Article)
    Chen, Yuantao
    Wang, Jin
    Xia, Runlong
    Zhang, Qian
    Cao, Zhouhong
    Yang, Kai
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 10 (12) : 4855 - 4867
  • [7] Attentional Correlation Filter Network for Adaptive Visual Tracking
    Choi, Jongwon
    Chang, Hyung Jin
    Yun, Sangdoo
    Fischer, Tobias
    Demiris, Yiannis
    Choi, Jin Young
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4828 - 4837
  • [8] Visual Tracking Using Attention-Modulated Disintegration and Integration
    Choi, Jongwon
    Chang, Hyung Jin
    Jeong, Jiyeoup
    Demiris, Yiannis
    Choi, Jin Young
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4321 - 4330
  • [9] Danelljan M., 2014, P BRIT MACH VIS C NO
  • [10] ECO: Efficient Convolution Operators for Tracking
    Danelljan, Martin
    Bhat, Goutam
    Khan, Fahad Shahbaz
    Felsberg, Michael
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6931 - 6939