Multi-modal multi-task feature fusion for RGBT tracking

被引:25
作者
Cai, Yujue [1 ]
Sui, Xiubao [1 ]
Gu, Guohua [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Elect & Opt Engn, Nanjing 210014, Peoples R China
基金
中国国家自然科学基金;
关键词
RGBT tracking; Auxiliary learning; Contrastive learning; Semantic matching; Instance segmentation; NETWORK;
D O I
10.1016/j.inffus.2023.101816
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RGBT tracking has received more and more attention in recent years, and in this paper, we propose a multi-task auxiliary learning framework for RGBT tracking. Specifically, we simplify the tracking task to an instance classification task and make it the primary task of the framework. We designed three auxiliary tasks and used a hard-parameter sharing approach to jointly train multiple tasks, hoping that the primary task would benefit from them. The three auxiliary tasks are contrastive instance discrimination, one-shot instance segmentation, and instance semantic matching. The contrastive instance discrimination method promotes the classification process of the primary task by constraining the features in the representation space. One-shot instance segmentation trains the network in a weakly supervised way to focus on more fine-grained features. In addition, in order to make the network pay more attention to the invariant features of instance target during tracking, we introduce a semantic matching task to alleviate the model drift problem caused by time change. Based on the results on three RGBT tracking benchmarks, the proposed framework is not inferior to the state-of-the-art trackers.
引用
收藏
页数:17
相关论文
共 72 条
[1]  
Alonso H.M., 2016, arXiv
[2]  
[Anonymous], 27 INT C MACH LEARN
[3]   Learning Discriminative Model Prediction for Tracking [J].
Bhat, Goutam ;
Danelljan, Martin ;
Van Gool, Luc ;
Timofte, Radu .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6181-6190
[4]  
Bingel J, 2017, Arxiv, DOI arXiv:1702.08303
[5]  
Chen XL, 2020, Arxiv, DOI arXiv:2003.04297
[6]  
Chen Z, 2018, PR MACH LEARN RES, V80
[7]   Challenge-Aware RGBT Tracking [J].
Li, Chenglong ;
Liu, Lei ;
Lu, Andong ;
Ji, Qing ;
Tang, Jin .
COMPUTER VISION - ECCV 2020, PT XXII, 2020, 12367 :222-237
[8]   Self-support Few-Shot Semantic Segmentation [J].
Fan, Qi ;
Pei, Wenjie ;
Tai, Yu-Wing ;
Tang, Chi-Keung .
COMPUTER VISION, ECCV 2022, PT XIX, 2022, 13679 :701-719
[9]  
Fifty Christopher, 2021, ADV NEUR IN, V34
[10]  
Gao Yuan, 2019, P IEEECVF INT C COMP