Siamada: visual tracking based on Siamese adaptive learning network

被引:0
作者
Xin Lu
Fusheng Li
Wanqi Yang
机构
[1] University of Electronic Science and Technology of China,School of Automation Engineering
[2] University of Electronic Science and Technology of China,Yangtze Delta Region Institute (Huzhou)
来源
Neural Computing and Applications | 2024年 / 36卷
关键词
Siamese trackers; Anchor assignment; Localization branch; Multi-task learning;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, Siamese trackers based on region proposal networks (RPN) have gained a lot of popularity. However, the design of RPN requires manual tuning of parameters such as object-anchor intersection over union (IoU) and relative weights for different tasks, which is a difficult and expensive process for model training. To address this issue, we propose a novel Siamese adaptive learning network (SiamAda) for visual tracking, allowing the model trained in a flexible way. Rather than IoU-based anchor assignment, the proposed network uses spatial alignment and model learning status as criteria for anchor quality evaluation, and a Gaussian mixture distribution for adaptive assignment. Moreover, aiming at the inconsistency problem between classification confidence and localization accuracy, a localization branch is designed to predict the IoU for each candidate anchor box, responsible for localization quality assessment. Furthermore, to avoid the tricky relative weight tuning between each task’s loss, multi-task learning with homoscedastic uncertainty is employed to adaptively weigh these multiple losses. Extensive experiments on challenging benchmarks, namely OTB2015, VOT2018, DTB70, UAV20L, GOT-10k and LaSOT validate the superiority of our tracker. The ablation studies also illustrate the advantage of each strategy presented in this paper.
引用
收藏
页码:7639 / 7656
页数:17
相关论文
共 50 条
[11]   PARTS-BASED MULTI-TASK SPARSE LEARNING FOR VISUAL TRACKING [J].
Kang, Zhengjian ;
Wong, Edward K. .
2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, :4022-4026
[12]   Co-Inference Discriminative Tracking Through Multi-Task Siamese Network [J].
Chen, Yan ;
Du, Jixiang ;
Zhong, Bineng .
IEEE ACCESS, 2021, 9 :60577-60587
[13]   Surgical Instrument Tracking for Capsulorhexis Eye Surgery Based on Siamese Networks [J].
Lafouti, M. ;
Ahmadi, M. J. ;
Allahkaram, M. S. ;
Gandomi, I. ;
Lotfi, F. ;
Mohammadzadeh, M. ;
Abdi, P. ;
Taghirad, H. D. .
2022 10TH RSI INTERNATIONAL CONFERENCE ON ROBOTICS AND MECHATRONICS (ICROM), 2022, :196-201
[14]   Multitask Extreme Learning Machine for Visual Tracking [J].
Liu, Huaping ;
Sun, Fuchun ;
Yu, Yuanlong .
COGNITIVE COMPUTATION, 2014, 6 (03) :391-404
[15]   Multitask Extreme Learning Machine for Visual Tracking [J].
Huaping Liu ;
Fuchun Sun ;
Yuanlong Yu .
Cognitive Computation, 2014, 6 :391-404
[16]   SMALL OBJECT CHANGE DETECTION BASED ON MULTITASK SIAMESE NETWORK [J].
Sharma, Shreya ;
Kaneko, Eiji ;
Toda, Masato .
IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, :300-303
[17]   Unmanned aerial vehicle visual scene understanding based on multitask learning network [J].
Nie, Zhicheng ;
Ding, Yong ;
Gao, Zeng ;
Chen, Ruyun .
JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (03)
[18]   Accurate Age Estimation Using Multi-Task Siamese Network-Based Deep Metric Learning for Frontal Face Images [J].
Jeong, Yoosoo ;
Lee, Seungmin ;
Park, Daejin ;
Park, Kil Houm .
SYMMETRY-BASEL, 2018, 10 (09)
[19]   Lightweight Target-Aware Attention Learning Network-Based Target Tracking Method [J].
Zhao, Yanchun ;
Zhang, Jiapeng ;
Duan, Rui ;
Li, Fusheng ;
Zhang, Huanlong .
MATHEMATICS, 2022, 10 (13)
[20]   PaaRPN: Probabilistic anchor assignment with region proposal network for visual tracking [J].
Yang, Kai ;
Zhang, Haijun ;
Zhou, Dongliang ;
Dong, Li .
INFORMATION SCIENCES, 2022, 598 :19-36