Transformer Tracking for Satellite Video: Matching, Propagation, and Prediction

被引：0

作者：

Zhao, Manqi ^{[1
,2
]}

Li, Shengyang ^{[1
,3
]}

Yang, Jian ^{[1
,3
]}

机构：

[1] Chinese Acad Sci, Technol & Engn Ctr Space Utilizat, Key Lab Space Utilizat, Beijing 100094, Peoples R China

[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China

[3] Univ Chinese Acad Sci, Sch Aeronaut & Astronaut, Beijing 100049, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷

关键词：

Target tracking; Satellites; Transformers; Training; Object tracking; Predictive models; Pipelines; Adaptation models; Feature extraction; Accuracy; Satellite video object tracking; sequence prediction; static matching; temporal propagation; transformer; OBJECT TRACKING; CORRELATION FILTER;

D O I：

10.1109/TGRS.2024.3501380

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Recently, transformer-based trackers have brought overwhelming advantages in general video. However, their performance in satellite video has been hindered by insufficient satellite-specific training and a lack of designs tailored to satellite targets and scene characteristics. To tackle these challenges, we propose a novel transformer-based tracking framework for satellite video object tracking: Transformer Matching, Propagation, and Prediction (TransMPP). TransMPP combines three stages: static matching, dynamic propagation, and prediction, to ensure accurate tracking in satellite videos. Specifically, the Matching model uses a one-stream pipeline for simultaneous feature extraction and relationship modeling across extensive search and template areas, thereby improving foreground and background discrimination capabilities. In addition, the Propagation and Prediction models enhance temporal modeling capabilities through local long-term and short-term feature propagation and global sequence prediction, respectively, boosting tracking robustness. Moreover, to ensure a fair comparison and evaluation, we also developed SatSOT-train, a large-scale training dataset for the SatSOT benchmark. After comprehensive training, TransMPP demonstrates state-of-the-art (SOTA) performance on the SatSOT dataset, achieving an area under the curve (AUC) score of 59.9% and a precision score of 71.5%, bringing improvements of 6.3% and 5.3%, respectively. The code will be available at https://github.com/DonDominic/TransMPP.

引用

页数：16

共 50 条

[21] Aircraft Tracking Based on an Antidrift Multifilter Tracker in Satellite Video Data
Pang, Ran
Gao, Fang
Zhang, Peng
Li, Xiangkun
Zhai, Yuwei
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 4439 - 4456
[22] Deep Learning-Based Object Tracking in Satellite Videos: A Comprehensive Survey With a New Dataset
Li, Yuxuan
Jiao, Licheng
Huang, Zhongjian
Zhang, Xin
Zhang, Ruohan
Song, Xue
Tian, Chenxi
Zhang, Zixiao
Liu, Fang
Shuyuan, Yang
Hou, Biao
Ma, Wenping
Liu, Xu
Li, Lingling
IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2022, 10 (04) : 181 - 212
[23] HRSiam: High-Resolution Siamese Network, Towards Space-Borne Satellite Video Tracking
Shao, Jia
Du, Bo
Wu, Chen
Gong, Mingming
Liu, Tongliang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3056 - 3068
[24] Sparse Transformer-Based Sequence Generation for Visual Object Tracking
Tian, Dan
Liu, Dong-Xin
Wang, Xiao
Hao, Ying
IEEE ACCESS, 2024, 12 : 154418 - 154425
[25] Small Target Tracking in Satellite Videos Using Background Compensation
Wang, Yunming
Wang, Taoyang
Zhang, Guo
Cheng, Qian
Wu, Jia-qi
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (10): : 7010 - 7021
[26] MBLT: Learning Motion and Background for Vehicle Tracking in Satellite Videos
Zhang, Wenhua
Jiao, Licheng
Liu, Fang
Li, Lingling
Liu, Xu
Liu, Jia
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[27] AFMtrack: Attention-Based Feature Matching for Multiple Object Tracking
Cuong Bui, Duy
Anh Hoang, Hiep
Yoo, Myungsik
IEEE ACCESS, 2024, 12 : 82897 - 82910
[28] Video tracking using block matching
Hariharakrishnan, K
Schonfeld, D
Raffy, P
Yassa, F
2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 945 - 948
[29] Unsupervised Nighttime Object Tracking Based on Transformer and Domain Adaptation Fusion Network
Wei, Haoran
Fu, Yanyun
Wang, Deyong
Guo, Rui
Zhao, Xueyi
Fang, Jian
IEEE ACCESS, 2024, 12 : 130896 - 130913
[30] VTST: Efficient Visual Tracking With a Stereoscopic Transformer
Gu, Fengwei
Lu, Jun
Cai, Chengtao
Zhu, Qidan
Ju, Zhaojie
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (03): : 2401 - 2416

← 1 2 3 4 5 →