Feature enhancement and coarse-to-fine detection for RGB-D tracking

被引：0

作者：

Zhu, Xue-Feng ^{[1
]}

Xu, Tianyang ^{[1
]}

Wu, Xiao-Jun ^{[1
]}

Kittler, Josef ^{[2
]}

机构：

[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Peoples R China

[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, England

来源：

PATTERN RECOGNITION LETTERS | 2024年 / 179卷

基金：

英国工程与自然科学研究理事会; 中国国家自然科学基金;

关键词：

RGB-D tracking; Feature enhancement; Cross-attention; Long-term; Re-detection;

D O I：

10.1016/j.patrec.2024.02.007

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing RGB-D tracking algorithms advance the performance by constructing typical appearance models from the RGB-only tracking frameworks. There is no attempt to exploit any complementary visual information from the multi -modal input. This paper addresses this deficit and presents a novel algorithm to boost the performance of RGB-D tracking by taking advantage of collaborative clues. To guarantee input consistency, depth images are encoded into the three -channel HHA representation to create input of a similar structure to the RGB images, so that the deep CNN features can be extracted from both modalities. To highlight the discriminatory information in multi -modal features, a feature enhancement module using a cross -attention strategy is proposed. With the attention map produced by the proposed cross -attention method, the target area of the features can be enhanced and the negative influence of the background is suppressed. Besides, we address the potential tracking failure by introducing a long-term mechanism. The experimental results obtained on the well-known benchmarking datasets, including PTB, STC, and CTDB, demonstrate the superiority of the proposed RGB-D tracker. On PTB, the proposed method achieves the highest AUC scores against compared trackers across scenarios with five distinct challenging attributes. On STC and CDTB, our FECD obtains an overall AUC of 0.630 and an F -score of 0.630, respectively.

引用

页码：130 / 136

页数：7

共 50 条

[1] Coarse-to-Fine Depth Super-Resolution With Adaptive RGB-D Feature Attention
Zhang, Fan
Liu, Na
Duan, Fuqing
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2621 - 2633
[2] Coarse-to-Fine Depth Super-Resolution With Adaptive RGB-D Feature Attention
Zhang, Fan
Liu, Na
Duan, Fuqing
IEEE Transactions on Multimedia, 2024, 26 : 2621 - 2633
[3] Coarse-to-Fine semantic parsing method for RGB-D indoor scenes
Liu T.
Feng X.
Gu Y.
Dai X.
Luo J.
Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2016, 46 (04): : 681 - 687
[4] Mapping Indoor Spaces by Adaptive Coarse-to-Fine Registration of RGB-D Data
dos Santos, Daniel R.
Basso, Marcos A.
Khoshelham, Kourosh
de Oliveira, Elizeu, Jr.
Pavan, Nadisson L.
Vosselman, George
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (02) : 262 - 266
[5] Regression Forest Based RGB-D Visual Relocalization Using Coarse-to-Fine Strategy
Wang, Jikai
Wang, Peng
Dai, Deyun
Xu, Meng
Chen, Zonghai
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (03): : 4431 - 4438
[6] Multi-Scale Guided Mask Refinement for Coarse-to-Fine RGB-D Perception
Chen, Chongyu
Huang, Haoguang
Chen, Chuangrong
Zheng, Zhuoqi
Cheng, Hui
IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (02) : 217 - 221
[7] Fine-To-Coarse Global Registration of RGB-D Scans
Halber, Maciej
Funkhouser, Thomas
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6660 - 6669
[8] Object Detection and Tracking in RGB-D SLAM via Hierarchical Feature Grouping
Ataer-Cansizoglu, Esra
Taguchi, Yuichi
2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 4164 - 4171
[9] RGB-D Object Tracking with Occlusion Detection
Xie, Yujun
Lu, Yao
Gu, Shuang
2019 15TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2019), 2019, : 11 - 15
[10] Beyond feature integration: a coarse-to-fine framework for cascade correlation tracking
Dongdong Li
Gongjian Wen
Yangliu Kuai
Fatih Porikli
Machine Vision and Applications, 2019, 30 : 519 - 528

← 1 2 3 4 5 →