Fusion Tree Network for RGBT Tracking

被引:7
作者
Cheng, Zhiyuan [1 ]
Lu, Andong [1 ]
Zhang, Zhang [4 ,5 ]
Li, Chenglong [2 ,3 ]
Wang, Liang [4 ,5 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Hefei, Peoples R China
[2] Anhui Prov Key Lab Multimodal Cognit Computat, Hefei, Peoples R China
[3] Anhui Univ, Sch Artificial Intelligence, Hefei, Peoples R China
[4] Ctr Res Intelligent Percept & Comp, NLPR, CASIA, Beijing, Peoples R China
[5] Univ Chinese Acad Sci, Beijing, Peoples R China
来源
2022 18TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2022) | 2022年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/AVSS56176.2022.9959406
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RGBT tracking is often affected by complex scenes ( i.e., occlusions, scale changes, noisy background, etc). Existing works usually adopt a single-strategy RGBT tracking fusion scheme to handle modalityfitsion in all scenarios. However, due to the limitation of fusion model capacity, it is difficult to fully integrate the discriminative features between different modalities. 'lb tackle this problem, we propose a Fusion Tree Network (FTNet), which provides a multistrategy fusion model with high capacity to efficiently fuse different modalities. Specifically, we combine three kinds of attention modules ( i.e., channel attention, spatial attention, and location attention) in a tree structure to achieve multi-path hybrid attention in the deeper convolutional stages of the object tracking network Extensive experiments are performed on three RGBT tracking datasets, and the results show that our method achieves superior performance among state-of-the-art RGBT tracking models.
引用
收藏
页数:8
相关论文
共 32 条
[31]  
Zhu Y., 2018, FANET QUALITY AWARE
[32]   Dense Feature Aggregation and Pruning for RGBT Tracking [J].
Zhu, Yabin ;
Li, Chenglong ;
Luo, Bin ;
Tang, Jin ;
Wang, Xiao .
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, :465-472