Autoregressive Visual Tracking

被引:148
作者
Wei, Xing [1 ]
Bai, Yifan [1 ]
Zheng, Yongchao [1 ]
Shi, Dahu [2 ,3 ]
Gong, Yihong [1 ]
机构
[1] Xi An Jiao Tong Univ, Xian, Peoples R China
[2] Zhejiang Univ, Hangzhou, Peoples R China
[3] Hikvis Res Inst, Hangzhou, Peoples R China
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.00935
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present ARTrack, an autoregressive framework for visual object tracking. ARTrack tackles tracking as a coordinate sequence interpretation task that estimates object trajectories progressively, where the current estimate is induced by previous states and in turn affects subsequences. This time-autoregressive approach models the sequential evolution of trajectories to keep tracing the object across frames, making it superior to existing template matching based trackers that only consider the per-frame localization accuracy. ARTrack is simple and direct, eliminating customized localization heads and post-processings. Despite its simplicity, ARTrack achieves state-of-the-art performance on prevailing benchmark datasets. Source code is available at https://github.com/MIV-XJTU/ARTrack.
引用
收藏
页码:9697 / 9706
页数:10
相关论文
共 69 条
[11]   Learning Discriminative Model Prediction for Tracking [J].
Bhat, Goutam ;
Danelljan, Martin ;
Van Gool, Luc ;
Timofte, Radu .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6181-6190
[12]  
Cao Z, 2021, C COMP VIS PATT REC, P12417
[13]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[14]  
Chen T., 2022, ICLR
[15]  
Chen Ting, 2022, NEURIPS
[16]  
Cui Yutao, 2022, CVIU
[17]  
Cui Yutao, 2022, CVPR
[18]  
Dai Kenan, 2020, CVPR
[19]   ECO: Efficient Convolution Operators for Tracking [J].
Danelljan, Martin ;
Bhat, Goutam ;
Khan, Fahad Shahbaz ;
Felsberg, Michael .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6931-6939
[20]  
Danelljan Martin, 2019, CVPR