Learning Localization-Aware Target Confidence for Siamese Visual Tracking

被引:20
作者
Nie, Jiahao [1 ]
He, Zhiwei [1 ]
Yang, Yuxiang [2 ]
Gao, Mingyu [1 ]
Dong, Zhekang [3 ]
机构
[1] Hangzhou Dianzi Univ, Sch Elect Informat, Hangzhou 310018, Peoples R China
[2] Univ Sci & Technol China, Sch Control Sci & Engn, Hefei 230052, Peoples R China
[3] Zhejiang Univ, Sch Elect Engn, Hangzhou 310058, Peoples R China
基金
中国国家自然科学基金;
关键词
Target tracking; Task analysis; Feature extraction; Training; Location awareness; Visualization; Smoothing methods; Localization-aware components; Siamese tracking paradigm; task misalignment; OBJECT TRACKING;
D O I
10.1109/TMM.2022.3206668
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Siamese tracking paradigm has achieved great success, providing effective appearance discrimination and size estimation by classification and regression. While such a paradigm typically optimizes the classification and regression independently, leading to task misalignment (accurate prediction boxes have no high target confidence scores). In this paper, to alleviate this misalignment, we propose a novel tracking paradigm, called SiamLA. Within this paradigm, a series of simple, yet effective localization-aware components are introduced to generate localization-aware target confidence scores. Specifically, with the proposed localization-aware dynamic label (LADL) loss and localization-aware label smoothing (LALS) strategy, collaborative optimization between the classification and regression is achieved, enabling classification scores to be aware of location state, not just appearance similarity. Besides, we propose a separate localization-aware quality prediction (LAQP) branch to produce location quality scores to further modify the classification scores. To guide a more reliable modification, a novel localization-aware feature aggregation (LAFA) module is designed and embedded into this branch. Consequently, the resulting target confidence scores are more discriminative for the location state, allowing accurate prediction boxes tend to be predicted as high scores. Extensive experiments are conducted on six challenging benchmarks, including GOT10 k, TrackingNet, LaSOT, TNL2K, OTB100 and VOT2018. Our SiamLA achieves competitive performance in terms of both accuracy and efficiency. Furthermore, a stability analysis reveals that our tracking paradigm is relatively stable, implying that the paradigm is potential for real-world applications.
引用
收藏
页码:6194 / 6206
页数:13
相关论文
共 50 条
  • [21] Learning target-aware correlation filters for visual tracking
    Li, Dongdong
    Wen, Gongjian
    Kuai, Yangliu
    Xiao, Jingjing
    Porikli, Fatih
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 58 : 149 - 159
  • [22] Target Salient Confidence for Visual Tracking
    Chen, Hongkai
    Zhao, Xiaoguang
    Tan, Min
    2014 7TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP 2014), 2014, : 436 - 441
  • [23] Target-Distractor Aware Deep Tracking With Discriminative Enhancement Learning Loss
    Zhang, Huanlong
    Cheng, Liyun
    Zhang, Tianzhu
    Wang, Yanfeng
    Zhang, W. J.
    Zhang, Jie
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6267 - 6278
  • [24] Fast Learning of Spatially Regularized and Content Aware Correlation Filter for Visual Tracking
    Han, Ruize
    Feng, Wei
    Wang, Song
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7128 - 7140
  • [25] Visual Sensor Target Tracking and Localization Method for Automatic Excavators
    Liu, Guangxu
    Wang, Qingfeng
    Wang, Tao
    Li, Bingcheng
    Xi, Xiangshuo
    IEEE SENSORS JOURNAL, 2024, 24 (14) : 22814 - 22829
  • [26] Siamese Attentional Cascade Keypoints Network for Visual Object Tracking
    Wang, Ershen
    Wang, Donglei
    Huang, Yufeng
    Tong, Gang
    Xu, Song
    Pang, Tao
    IEEE ACCESS, 2021, 9 : 7243 - 7254
  • [27] SiamSampler: Video-Guided Sampling for Siamese Visual Tracking
    Li, Peixia
    Chen, Boyu
    Bai, Lei
    Qiao, Lei
    Li, Bo
    Ouyang, Wanli
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (04) : 1752 - 1761
  • [28] Visual Tracking Based on Siamese Network of Fused Score Map
    Xu, Liang
    Wang, Liejun
    Zhang, Yaqin
    Cheng, Shuli
    IEEE ACCESS, 2019, 7 : 151389 - 151398
  • [29] Inverted Residual Siamese Visual Tracking With Feature Crossing Network
    Zhang, Feng
    Qian, Xiaoyan
    Han, Lei
    Shen, Yi
    IEEE ACCESS, 2021, 9 : 27158 - 27166
  • [30] Visual Tracking With Siamese Network Based on Fast Attention Network
    Qin, Lin
    Yang, Yang
    Huang, Dandan
    Zhu, Naibo
    Yang, Han
    Xu, Zhisong
    IEEE ACCESS, 2022, 10 : 35632 - 35642