Feature enhancement and coarse-to-fine detection for RGB-D tracking

被引:0
|
作者
Zhu, Xue-Feng [1 ]
Xu, Tianyang [1 ]
Wu, Xiao-Jun [1 ]
Kittler, Josef [2 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Peoples R China
[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, England
基金
英国工程与自然科学研究理事会; 中国国家自然科学基金;
关键词
RGB-D tracking; Feature enhancement; Cross-attention; Long-term; Re-detection;
D O I
10.1016/j.patrec.2024.02.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing RGB-D tracking algorithms advance the performance by constructing typical appearance models from the RGB-only tracking frameworks. There is no attempt to exploit any complementary visual information from the multi -modal input. This paper addresses this deficit and presents a novel algorithm to boost the performance of RGB-D tracking by taking advantage of collaborative clues. To guarantee input consistency, depth images are encoded into the three -channel HHA representation to create input of a similar structure to the RGB images, so that the deep CNN features can be extracted from both modalities. To highlight the discriminatory information in multi -modal features, a feature enhancement module using a cross -attention strategy is proposed. With the attention map produced by the proposed cross -attention method, the target area of the features can be enhanced and the negative influence of the background is suppressed. Besides, we address the potential tracking failure by introducing a long-term mechanism. The experimental results obtained on the well-known benchmarking datasets, including PTB, STC, and CTDB, demonstrate the superiority of the proposed RGB-D tracker. On PTB, the proposed method achieves the highest AUC scores against compared trackers across scenarios with five distinct challenging attributes. On STC and CDTB, our FECD obtains an overall AUC of 0.630 and an F -score of 0.630, respectively.
引用
收藏
页码:130 / 136
页数:7
相关论文
共 50 条
  • [1] Coarse-to-Fine Depth Super-Resolution With Adaptive RGB-D Feature Attention
    Zhang, Fan
    Liu, Na
    Duan, Fuqing
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2621 - 2633
  • [2] Coarse-to-Fine Depth Super-Resolution With Adaptive RGB-D Feature Attention
    Zhang, Fan
    Liu, Na
    Duan, Fuqing
    IEEE Transactions on Multimedia, 2024, 26 : 2621 - 2633
  • [3] Coarse-to-Fine semantic parsing method for RGB-D indoor scenes
    Liu T.
    Feng X.
    Gu Y.
    Dai X.
    Luo J.
    Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2016, 46 (04): : 681 - 687
  • [4] Mapping Indoor Spaces by Adaptive Coarse-to-Fine Registration of RGB-D Data
    dos Santos, Daniel R.
    Basso, Marcos A.
    Khoshelham, Kourosh
    de Oliveira, Elizeu, Jr.
    Pavan, Nadisson L.
    Vosselman, George
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (02) : 262 - 266
  • [5] Regression Forest Based RGB-D Visual Relocalization Using Coarse-to-Fine Strategy
    Wang, Jikai
    Wang, Peng
    Dai, Deyun
    Xu, Meng
    Chen, Zonghai
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (03): : 4431 - 4438
  • [6] Multi-Scale Guided Mask Refinement for Coarse-to-Fine RGB-D Perception
    Chen, Chongyu
    Huang, Haoguang
    Chen, Chuangrong
    Zheng, Zhuoqi
    Cheng, Hui
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (02) : 217 - 221
  • [7] Fine-To-Coarse Global Registration of RGB-D Scans
    Halber, Maciej
    Funkhouser, Thomas
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6660 - 6669
  • [8] Object Detection and Tracking in RGB-D SLAM via Hierarchical Feature Grouping
    Ataer-Cansizoglu, Esra
    Taguchi, Yuichi
    2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 4164 - 4171
  • [9] RGB-D Object Tracking with Occlusion Detection
    Xie, Yujun
    Lu, Yao
    Gu, Shuang
    2019 15TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2019), 2019, : 11 - 15
  • [10] Beyond feature integration: a coarse-to-fine framework for cascade correlation tracking
    Dongdong Li
    Gongjian Wen
    Yangliu Kuai
    Fatih Porikli
    Machine Vision and Applications, 2019, 30 : 519 - 528