Siamada: visual tracking based on Siamese adaptive learning network

被引:0
作者
Xin Lu
Fusheng Li
Wanqi Yang
机构
[1] University of Electronic Science and Technology of China,School of Automation Engineering
[2] University of Electronic Science and Technology of China,Yangtze Delta Region Institute (Huzhou)
来源
Neural Computing and Applications | 2024年 / 36卷
关键词
Siamese trackers; Anchor assignment; Localization branch; Multi-task learning;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, Siamese trackers based on region proposal networks (RPN) have gained a lot of popularity. However, the design of RPN requires manual tuning of parameters such as object-anchor intersection over union (IoU) and relative weights for different tasks, which is a difficult and expensive process for model training. To address this issue, we propose a novel Siamese adaptive learning network (SiamAda) for visual tracking, allowing the model trained in a flexible way. Rather than IoU-based anchor assignment, the proposed network uses spatial alignment and model learning status as criteria for anchor quality evaluation, and a Gaussian mixture distribution for adaptive assignment. Moreover, aiming at the inconsistency problem between classification confidence and localization accuracy, a localization branch is designed to predict the IoU for each candidate anchor box, responsible for localization quality assessment. Furthermore, to avoid the tricky relative weight tuning between each task’s loss, multi-task learning with homoscedastic uncertainty is employed to adaptively weigh these multiple losses. Extensive experiments on challenging benchmarks, namely OTB2015, VOT2018, DTB70, UAV20L, GOT-10k and LaSOT validate the superiority of our tracker. The ablation studies also illustrate the advantage of each strategy presented in this paper.
引用
收藏
页码:7639 / 7656
页数:17
相关论文
共 50 条
[21]   Discriminative Siamese Tracker Based on Multi-Channel-Aware and Adaptive Hierarchical Deep Features [J].
Zhang, Huanlong ;
Duan, Rui ;
Zheng, Anping ;
Zhang, Jie ;
Li, Linwei ;
Wang, Fengxian .
SYMMETRY-BASEL, 2021, 13 (12)
[22]   Nocal-Siam: Refining Visual Features and Response With Advanced Non-Local Blocks for Real-Time Siamese Tracking [J].
Tan, Huibin ;
Zhang, Xiang ;
Zhang, Zhipeng ;
Lan, Long ;
Zhang, Wenju ;
Luo, Zhigang .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :2656-2668
[23]   Multi-task Learning Based Keywords Weighted Siamese Model for Semantic Retrieval [J].
Kuang, Mengmeng ;
Chen, Zhenhong ;
Wang, Weiyan ;
Kang, Lie ;
Yan, Qiang ;
Tang, Min ;
Hao, Penghui .
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT III, 2023, 13937 :86-98
[24]   Robust Visual Tracking via Structured Multi-Task Sparse Learning [J].
Zhang, Tianzhu ;
Ghanem, Bernard ;
Liu, Si ;
Ahuja, Narendra .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 101 (02) :367-383
[25]   Robust Visual Tracking via Structured Multi-Task Sparse Learning [J].
Tianzhu Zhang ;
Bernard Ghanem ;
Si Liu ;
Narendra Ahuja .
International Journal of Computer Vision, 2013, 101 :367-383
[26]   Visual Odometry Algorithm Based on Deep Learning [J].
Zhang Zaiteng ;
Zhang Rongfen ;
Liu Yuhong .
LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (04)
[27]   Multi-Task Hierarchical Feature Learning for Real-Time Visual Tracking [J].
Kuai, Yangliu ;
Wen, Gongjian ;
Li, Dongdong .
IEEE SENSORS JOURNAL, 2019, 19 (05) :1961-1968
[28]   Resource-Efficient Adaptive-Network Inference Framework with Knowledge Distillation-based Unified Learning [J].
Gaire, Rebati ;
Tabrizchi, Sepehr ;
Roohi, Arman .
2024 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, ISVLSI, 2024, :508-513
[29]   STAN: Stage-Adaptive Network for Multi-Task Recommendation by Learning User Lifecycle-Based Representation [J].
Li, Wanda ;
Zheng, Wenhao ;
Xiao, Xuanji ;
Wang, Suhang .
PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, :602-612
[30]   Dual adaptive learning multi-task multi-view for graph network representation learning [J].
Han, Beibei ;
Wei, Yingmei ;
Wang, Qingyong ;
Wan, Shanshan .
NEURAL NETWORKS, 2023, 162 :297-308