Siamada: visual tracking based on Siamese adaptive learning network

被引:0
作者
Xin Lu
Fusheng Li
Wanqi Yang
机构
[1] University of Electronic Science and Technology of China,School of Automation Engineering
[2] University of Electronic Science and Technology of China,Yangtze Delta Region Institute (Huzhou)
来源
Neural Computing and Applications | 2024年 / 36卷
关键词
Siamese trackers; Anchor assignment; Localization branch; Multi-task learning;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, Siamese trackers based on region proposal networks (RPN) have gained a lot of popularity. However, the design of RPN requires manual tuning of parameters such as object-anchor intersection over union (IoU) and relative weights for different tasks, which is a difficult and expensive process for model training. To address this issue, we propose a novel Siamese adaptive learning network (SiamAda) for visual tracking, allowing the model trained in a flexible way. Rather than IoU-based anchor assignment, the proposed network uses spatial alignment and model learning status as criteria for anchor quality evaluation, and a Gaussian mixture distribution for adaptive assignment. Moreover, aiming at the inconsistency problem between classification confidence and localization accuracy, a localization branch is designed to predict the IoU for each candidate anchor box, responsible for localization quality assessment. Furthermore, to avoid the tricky relative weight tuning between each task’s loss, multi-task learning with homoscedastic uncertainty is employed to adaptively weigh these multiple losses. Extensive experiments on challenging benchmarks, namely OTB2015, VOT2018, DTB70, UAV20L, GOT-10k and LaSOT validate the superiority of our tracker. The ablation studies also illustrate the advantage of each strategy presented in this paper.
引用
收藏
页码:7639 / 7656
页数:17
相关论文
共 50 条
[41]   Simultaneous Learning of Nonlinear Manifolds Based on the Bottleneck Neural Network [J].
Seyyedsalehi, Seyyede Zohreh ;
Seyyedsalehi, Seyyed Ali .
NEURAL PROCESSING LETTERS, 2014, 40 (02) :191-209
[42]   Simultaneous Learning of Nonlinear Manifolds Based on the Bottleneck Neural Network [J].
Seyyede Zohreh Seyyedsalehi ;
Seyyed Ali Seyyedsalehi .
Neural Processing Letters, 2014, 40 :191-209
[43]   Multi-task Learning for Joint Re-identification, Team Affiliation, and Role Classification for Sports Visual Tracking [J].
Mansourian, Amir M. ;
Somers, Vladimir ;
De Vleeschouwer, Christophe ;
Kasaei, Shohreh .
PROCEEDINGS OF THE 6TH INTERNATIONAL WORKSHOP ON MULTIMEDIA CONTENT ANALYSIS IN SPORTS, MMSPORTS 2023, 2023, :103-112
[44]   Deep adaptive learning for writer identification based on single handwritten word images [J].
He, Sheng ;
Schomaker, Lambert .
PATTERN RECOGNITION, 2019, 88 :64-74
[45]   Multi-Task Hierarchical Learning Based Network Traffic Analytics [J].
Barut, Onur ;
Luo, Yan ;
Zhang, Tong ;
Li, Weigang ;
Li, Peilong .
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
[46]   Network Representation Learning Framework Based on Adversarial Graph Convolutional Networks [J].
Chen M. ;
Liu Y. .
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (11) :1042-1050
[47]   Design space exploration of neural network accelerator based on transfer learning [J].
Wu Y. ;
Zhi T. ;
Song X. ;
Li X. .
High Technology Letters, 2023, 29 (04) :416-426
[48]   TMLNet: Triad Multitask Learning Network for multiobjective based change detection [J].
Chouhan, Avinash ;
Sur, Arijit ;
Chutia, Dibyajyoti ;
Aggarwal, Shiv Prasad .
NEUROCOMPUTING, 2024, 599
[49]   Commonality and individuality based graph learning network for EEG emotion recognition [J].
Zhang, Tengxu ;
Zhou, Haiyan .
JOURNAL OF NEURAL ENGINEERING, 2025, 22 (03)
[50]   Chinese spelling correction based on Long Short-Term Memory Network-enhanced Transformer and dynamic adaptive weighted multi-task learning [J].
Xu, Mingying ;
Liu, Jie ;
Peng, Kui ;
Li, Zhen .
NATURAL LANGUAGE PROCESSING, 2025,