Transformer Tracking for Satellite Video: Matching, Propagation, and Prediction

被引:0
|
作者
Zhao, Manqi [1 ,2 ]
Li, Shengyang [1 ,3 ]
Yang, Jian [1 ,3 ]
机构
[1] Chinese Acad Sci, Technol & Engn Ctr Space Utilizat, Key Lab Space Utilizat, Beijing 100094, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China
[3] Univ Chinese Acad Sci, Sch Aeronaut & Astronaut, Beijing 100049, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Target tracking; Satellites; Transformers; Training; Object tracking; Predictive models; Pipelines; Adaptation models; Feature extraction; Accuracy; Satellite video object tracking; sequence prediction; static matching; temporal propagation; transformer; OBJECT TRACKING; CORRELATION FILTER;
D O I
10.1109/TGRS.2024.3501380
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Recently, transformer-based trackers have brought overwhelming advantages in general video. However, their performance in satellite video has been hindered by insufficient satellite-specific training and a lack of designs tailored to satellite targets and scene characteristics. To tackle these challenges, we propose a novel transformer-based tracking framework for satellite video object tracking: Transformer Matching, Propagation, and Prediction (TransMPP). TransMPP combines three stages: static matching, dynamic propagation, and prediction, to ensure accurate tracking in satellite videos. Specifically, the Matching model uses a one-stream pipeline for simultaneous feature extraction and relationship modeling across extensive search and template areas, thereby improving foreground and background discrimination capabilities. In addition, the Propagation and Prediction models enhance temporal modeling capabilities through local long-term and short-term feature propagation and global sequence prediction, respectively, boosting tracking robustness. Moreover, to ensure a fair comparison and evaluation, we also developed SatSOT-train, a large-scale training dataset for the SatSOT benchmark. After comprehensive training, TransMPP demonstrates state-of-the-art (SOTA) performance on the SatSOT dataset, achieving an area under the curve (AUC) score of 59.9% and a precision score of 71.5%, bringing improvements of 6.3% and 5.3%, respectively. The code will be available at https://github.com/DonDominic/TransMPP.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Siamese Transformer Network for Real-Time Aerial Object Tracking
    Wang, Haijun
    Zhang, Shengyan
    IEEE ACCESS, 2022, 10 : 105201 - 105213
  • [42] MP2Net: Mask Propagation and Motion Prediction Network for Multiobject Tracking in Satellite Videos
    Zhao, Manqi
    Li, Shengyang
    Wang, Han
    Yang, Jian
    Sun, Yuhan
    Gu, Yanfeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [43] Video Tracking Based on Template Matching and Particle Filter
    Lin, Shinfeng D.
    Chen, Ting-Yi
    2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
  • [44] TFTN: A Transformer-Based Fusion Tracking Framework of Hyperspectral and RGB
    Zhao, Chunhui
    Liu, Hongjiao
    Su, Nan
    Yan, Yiming
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [45] Revisiting Monocular Satellite Pose Estimation With Transformer
    Wang, Zi
    Zhang, Zhuo
    Sun, Xiaoliang
    Li, Zhang
    Yu, Qifeng
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2022, 58 (05) : 4279 - 4294
  • [46] Satellite Video Tracking by Multi-Feature Correlation Filters with Motion Estimation
    Zhang, Yan
    Chen, Deng
    Zheng, Yuhui
    REMOTE SENSING, 2022, 14 (11)
  • [47] A Spatiotemporal Fusion Transformer Model for Chlorophyll-a Concentrations Prediction Over Large Areas With Satellite Time Series Data
    Zhou, Gaoxiang
    Liu, Ming
    Li, Liangzhi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [48] Relation Learning Reasoning Meets Tiny Object Tracking in Satellite Videos
    Yang, Xiaoyan
    Jiao, Licheng
    Li, Yangyang
    Liu, Xu
    Liu, Fang
    Li, Lingling
    Chen, Puhua
    Yang, Shuyuan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 16
  • [49] CTsynther: Contrastive Transformer Model for End-to-End Retrosynthesis Prediction
    Lu, Hao
    Wei, Zhiqiang
    Zhang, Kun
    Wang, Xuze
    Ali, Liaqat
    Liu, Hao
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2024, 21 (06) : 2235 - 2245
  • [50] MR-Transformer: Multiresolution Transformer for Multivariate Time Series Prediction
    Zhu, Siying
    Zheng, Jiawei
    Ma, Qianli
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 13