Multibranch Adaptive Fusion Network for RGBT Tracking

被引:15
作者
Li, Yadong [1 ]
Lai, Huicheng [1 ]
Wang, Liejun [1 ]
Jia, Zhenhong [1 ]
机构
[1] Xinjiang Univ, Sch Informat Sci & Engn, Autonomous Univ Key Lab Signal & Informat Proc, Urumqi 830046, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolution; Target tracking; Kernel; Feature extraction; Aggregates; Sensors; Adaptive systems; RGBT tracking; multiscale feature; multibranch; adaptive fusion;
D O I
10.1109/JSEN.2022.3154657
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
RGBT tracking has been increasingly investigated in visual tracking due to the strong complementary nature of visible and infrared images. However, in the established RGBT tracking algorithms, multiscale information has not been well exploited and utilized, which limits the performance of the tracker. In this paper, a novel multibranch adaptive fusion network is proposed, which aggregates multiscale information from multiple branches. Specifically, our backbone network draws on the modified VGG-M. To extract the multiscale features, we design a multiscale adapter, which adds two small convolution kernel branches to the backbone in each layer and each modality in a parallel manner. We also design a multibranch fusion module to adaptively aggregate the features from multiple branches and the previous layer. Moreover, we propose a multimodal fusion module for aggregating features between modalities, which could mitigate the impact of noise from low-quality sources. Finally, many results on two recent RGBT tracking datasets show that our method significantly outperforms other state-of-the-art tracking methods.
引用
收藏
页码:7084 / 7093
页数:10
相关论文
共 44 条
[21]   Multi-Adapter RGBT Tracking [J].
Li, Chenglong ;
Lu, Andong ;
Zheng, Aihua ;
Tu, Zhengzheng ;
Tang, Jin .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :2262-2270
[22]   RGB-T object tracking: Benchmark and baseline [J].
Li, Chenglong ;
Liang, Xinyan ;
Lu, Yijuan ;
Zhao, Nan ;
Tang, Jin .
PATTERN RECOGNITION, 2019, 96
[23]   Weighted Sparse Representation Regularized Graph Learning for RGB-T Object Tracking [J].
Li, Chenglong ;
Zhao, Nan ;
Lu, Yijuan ;
Zhu, Chengli ;
Tang, Jin .
PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, :1856-1864
[24]   Cross-Modal Ranking with Soft Consistency and Noisy Labels for Robust RGB-T Tracking [J].
Li, Chenglong ;
Zhu, Chengli ;
Huang, Yan ;
Tang, Jin ;
Wang, Liang .
COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 :831-847
[25]   Learning Collaborative Sparse Representation for Grayscale-Thermal Tracking [J].
Li, Chenglong ;
Cheng, Hui ;
Hu, Shiyi ;
Liu, Xiaobai ;
Tang, Jin ;
Lin, Liang .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (12) :5743-5756
[26]   Selective Kernel Networks [J].
Li, Xiang ;
Wang, Wenhai ;
Hu, Xiaolin ;
Yang, Jian .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :510-519
[27]  
Lu A., IEEE T IMAGE PROCESS, V30, P2021
[28]   Discriminative Correlation Filter with Channel and Spatial Reliability [J].
Lukezic, Alan ;
Vojir, Tomas ;
Zajc, Luka Cehovin ;
Matas, Jiri ;
Kristan, Matej .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4847-4856
[29]  
Muller N., PROC IEEECVF C COMPU, P6071
[30]   Learning Multi-Domain Convolutional Neural Networks for Visual Tracking [J].
Nam, Hyeonseob ;
Han, Bohyung .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4293-4302