Learning Localization-Aware Target Confidence for Siamese Visual Tracking

被引：20

作者：

Nie, Jiahao ^{[1
]}

He, Zhiwei ^{[1
]}

Yang, Yuxiang ^{[2
]}

Gao, Mingyu ^{[1
]}

Dong, Zhekang ^{[3
]}

机构：

[1] Hangzhou Dianzi Univ, Sch Elect Informat, Hangzhou 310018, Peoples R China

[2] Univ Sci & Technol China, Sch Control Sci & Engn, Hefei 230052, Peoples R China

[3] Zhejiang Univ, Sch Elect Engn, Hangzhou 310058, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2023年 / 25卷

基金：

中国国家自然科学基金;

关键词：

Target tracking; Task analysis; Feature extraction; Training; Location awareness; Visualization; Smoothing methods; Localization-aware components; Siamese tracking paradigm; task misalignment; OBJECT TRACKING;

D O I：

10.1109/TMM.2022.3206668

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Siamese tracking paradigm has achieved great success, providing effective appearance discrimination and size estimation by classification and regression. While such a paradigm typically optimizes the classification and regression independently, leading to task misalignment (accurate prediction boxes have no high target confidence scores). In this paper, to alleviate this misalignment, we propose a novel tracking paradigm, called SiamLA. Within this paradigm, a series of simple, yet effective localization-aware components are introduced to generate localization-aware target confidence scores. Specifically, with the proposed localization-aware dynamic label (LADL) loss and localization-aware label smoothing (LALS) strategy, collaborative optimization between the classification and regression is achieved, enabling classification scores to be aware of location state, not just appearance similarity. Besides, we propose a separate localization-aware quality prediction (LAQP) branch to produce location quality scores to further modify the classification scores. To guide a more reliable modification, a novel localization-aware feature aggregation (LAFA) module is designed and embedded into this branch. Consequently, the resulting target confidence scores are more discriminative for the location state, allowing accurate prediction boxes tend to be predicted as high scores. Extensive experiments are conducted on six challenging benchmarks, including GOT10 k, TrackingNet, LaSOT, TNL2K, OTB100 and VOT2018. Our SiamLA achieves competitive performance in terms of both accuracy and efficiency. Furthermore, a stability analysis reveals that our tracking paradigm is relatively stable, implying that the paradigm is potential for real-world applications.

引用

页码：6194 / 6206

页数：13

共 50 条

[41] FPSiamRPN: Feature Pyramid Siamese Network With Region Proposal Network for Target Tracking
Rao, Yunbo
Cheng, Yiming
Xue, Junmin
Pu, Jiansu
Wang, Qiujie
Jin, Rize
Wang, Qifei
IEEE ACCESS, 2020, 8 : 176158 - 176169
[42] On the Use of Deep Reinforcement Learning for Visual Tracking: A Survey
Cruciata, Giorgio
Lo Presti, Liliana
La Cascia, Marco
IEEE ACCESS, 2021, 9 : 120880 - 120900
[43] Visual and Language Collaborative Learning for RGBT Object Tracking
Wang, Jiahao
Liu, Fang
Jiao, Licheng
Gao, Yingjia
Wang, Hao
Li, Shuo
Li, Lingling
Chen, Puhua
Liu, Xu
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 12770 - 12781
[44] Object Tracking Using Siamese Network-Based Reinforcement Learning
Park, Sung Jun
Hwang, Seung Jun
Baek, Joong-Hwan
IEEE ACCESS, 2022, 10 : 63339 - 63352
[45] Visual Tracking by Adaptive Continual Meta-Learning
Choi, Janghoon
Baik, Sungyong
Choi, Myungsub
Kwon, Junseok
Lee, Kyoung Mu
IEEE ACCESS, 2022, 10 : 9022 - 9035
[46] Target-Aware Transformer Tracking
Zheng, Yuhui
Zhang, Yan
Xiao, Bin
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4542 - 4551
[47] Hierarchical Attention Siamese Network for Thermal Infrared Target Tracking
Yuan, Di
Liao, Donghai
Huang, Feng
Qiu, Zhaobing
Shu, Xiu
Tian, Chunwei
Liu, Qiao
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
[48] Learning Adaptive Target-and-Surrounding Soft Mask for Correlation Filter Based Visual Tracking
Zhang, Ke
Wang, Wuwei
Wang, Jingyu
Wang, Qi
Li, Xuelong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) : 3708 - 3721
[49] Learning Semantic-Aware Local Features for Long Term Visual Localization
Fan, Bin
Zhou, Junjie
Feng, Wensen
Pu, Huayan
Yang, Yuzhu
Kong, Qingqun
Wu, Fuchao
Liu, Hongmin
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4842 - 4855
[50] Learning Channel-Aware Correlation Filters for Robust Object Tracking
Nai, Ke
Li, Zhiyong
Wang, Haidong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7843 - 7857

← 1 2 3 4 5 →