Learning Localization-Aware Target Confidence for Siamese Visual Tracking

被引:20
作者
Nie, Jiahao [1 ]
He, Zhiwei [1 ]
Yang, Yuxiang [2 ]
Gao, Mingyu [1 ]
Dong, Zhekang [3 ]
机构
[1] Hangzhou Dianzi Univ, Sch Elect Informat, Hangzhou 310018, Peoples R China
[2] Univ Sci & Technol China, Sch Control Sci & Engn, Hefei 230052, Peoples R China
[3] Zhejiang Univ, Sch Elect Engn, Hangzhou 310058, Peoples R China
基金
中国国家自然科学基金;
关键词
Target tracking; Task analysis; Feature extraction; Training; Location awareness; Visualization; Smoothing methods; Localization-aware components; Siamese tracking paradigm; task misalignment; OBJECT TRACKING;
D O I
10.1109/TMM.2022.3206668
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Siamese tracking paradigm has achieved great success, providing effective appearance discrimination and size estimation by classification and regression. While such a paradigm typically optimizes the classification and regression independently, leading to task misalignment (accurate prediction boxes have no high target confidence scores). In this paper, to alleviate this misalignment, we propose a novel tracking paradigm, called SiamLA. Within this paradigm, a series of simple, yet effective localization-aware components are introduced to generate localization-aware target confidence scores. Specifically, with the proposed localization-aware dynamic label (LADL) loss and localization-aware label smoothing (LALS) strategy, collaborative optimization between the classification and regression is achieved, enabling classification scores to be aware of location state, not just appearance similarity. Besides, we propose a separate localization-aware quality prediction (LAQP) branch to produce location quality scores to further modify the classification scores. To guide a more reliable modification, a novel localization-aware feature aggregation (LAFA) module is designed and embedded into this branch. Consequently, the resulting target confidence scores are more discriminative for the location state, allowing accurate prediction boxes tend to be predicted as high scores. Extensive experiments are conducted on six challenging benchmarks, including GOT10 k, TrackingNet, LaSOT, TNL2K, OTB100 and VOT2018. Our SiamLA achieves competitive performance in terms of both accuracy and efficiency. Furthermore, a stability analysis reveals that our tracking paradigm is relatively stable, implying that the paradigm is potential for real-world applications.
引用
收藏
页码:6194 / 6206
页数:13
相关论文
共 50 条
  • [41] FPSiamRPN: Feature Pyramid Siamese Network With Region Proposal Network for Target Tracking
    Rao, Yunbo
    Cheng, Yiming
    Xue, Junmin
    Pu, Jiansu
    Wang, Qiujie
    Jin, Rize
    Wang, Qifei
    IEEE ACCESS, 2020, 8 : 176158 - 176169
  • [42] On the Use of Deep Reinforcement Learning for Visual Tracking: A Survey
    Cruciata, Giorgio
    Lo Presti, Liliana
    La Cascia, Marco
    IEEE ACCESS, 2021, 9 : 120880 - 120900
  • [43] Visual and Language Collaborative Learning for RGBT Object Tracking
    Wang, Jiahao
    Liu, Fang
    Jiao, Licheng
    Gao, Yingjia
    Wang, Hao
    Li, Shuo
    Li, Lingling
    Chen, Puhua
    Liu, Xu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 12770 - 12781
  • [44] Object Tracking Using Siamese Network-Based Reinforcement Learning
    Park, Sung Jun
    Hwang, Seung Jun
    Baek, Joong-Hwan
    IEEE ACCESS, 2022, 10 : 63339 - 63352
  • [45] Visual Tracking by Adaptive Continual Meta-Learning
    Choi, Janghoon
    Baik, Sungyong
    Choi, Myungsub
    Kwon, Junseok
    Lee, Kyoung Mu
    IEEE ACCESS, 2022, 10 : 9022 - 9035
  • [46] Target-Aware Transformer Tracking
    Zheng, Yuhui
    Zhang, Yan
    Xiao, Bin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4542 - 4551
  • [47] Hierarchical Attention Siamese Network for Thermal Infrared Target Tracking
    Yuan, Di
    Liao, Donghai
    Huang, Feng
    Qiu, Zhaobing
    Shu, Xiu
    Tian, Chunwei
    Liu, Qiao
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [48] Learning Adaptive Target-and-Surrounding Soft Mask for Correlation Filter Based Visual Tracking
    Zhang, Ke
    Wang, Wuwei
    Wang, Jingyu
    Wang, Qi
    Li, Xuelong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) : 3708 - 3721
  • [49] Learning Semantic-Aware Local Features for Long Term Visual Localization
    Fan, Bin
    Zhou, Junjie
    Feng, Wensen
    Pu, Huayan
    Yang, Yuzhu
    Kong, Qingqun
    Wu, Fuchao
    Liu, Hongmin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4842 - 4855
  • [50] Learning Channel-Aware Correlation Filters for Robust Object Tracking
    Nai, Ke
    Li, Zhiyong
    Wang, Haidong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7843 - 7857