Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking

被引:221
作者
Cao, Jinkun [1 ]
Pang, Jiangmiao [2 ]
Weng, Xinshuo [3 ]
Khirodkar, Rawal [1 ]
Kitani, Kris [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Shanghai AI Lab, Shanghai, Peoples R China
[3] Nvidia, Santa Clara, CA USA
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年
关键词
FILTER;
D O I
10.1109/CVPR52729.2023.00934
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Kalman filter (KF) based methods for multi-object tracking (MOT) make an assumption that objects move linearly. While this assumption is acceptable for very short periods of occlusion, linear estimates of motion for prolonged time can be highly inaccurate. Moreover, when there is no measurement available to update Kalman filter parameters, the standard convention is to trust the priori state estimations for posteriori update. This leads to the accumulation of errors during a period of occlusion. The error causes significant motion direction variance in practice. In this work, we show that a basic Kalman filter can still obtain state-of-the-art tracking performance if proper care is taken to fix the noise accumulated during occlusion. Instead of relying only on the linear state estimate (i.e., estimation-centric approach), we use object observations (i.e., the measurements by object detector) to compute a virtual trajectory over the occlusion period to fix the error accumulation of filter parameters. This allows more time steps to correct errors accumulated during occlusion. We name our method Observation-Centric SORT (OC-SORT). It remains Simple, Online, and Real-Time but improves robustness during occlusion and non-linear motion. Given off-the-shelf detections as input, OC-SORT runs at 700+ FPS on a single CPU. It achieves state-of-the-art on multiple datasets, including MOT17, MOT20, KITTI, head tracking, and especially DanceTrack where the object motion is highly non-linear. The code and models are available at https://github.com/noahcao/OC_SORT.
引用
收藏
页码:9686 / 9696
页数:11
相关论文
共 75 条
  • [1] [Anonymous], 2020, INT C IM AN REC, DOI DOI 10.1109/ICSMARTGRID49881.2020.9144773
  • [2] The Probabilistic Data Association Filter ESTIMATION IN THE PRESENCE OF MEASUREMENT ORIGIN UNCERTAINTY
    Bar-Shalom, Yaakov
    Daum, Fred
    Huang, Jim
    [J]. IEEE CONTROL SYSTEMS MAGAZINE, 2009, 29 (06): : 82 - 100
  • [3] Basu S., 1996, Proceedings of the 13th International Conference on Pattern Recognition, P611, DOI 10.1109/ICPR.1996.547019
  • [4] Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
  • [5] Learning a Neural Solver for Multiple Object Tracking
    Braso, Guillem
    Leal-Taixe, Laura
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6246 - 6256
  • [6] Cai Jiarui, 2022, CVPR
  • [7] Cross-Domain Adaptation for Animal Pose Estimation
    Cao, Jinkun
    Tang, Hongyang
    Fang, Hao-Shu
    Shen, Xiaoyong
    Lu, Cewu
    Tai, Yu-Wing
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9497 - 9506
  • [8] Cao Jinkun, 2022, ARXIV221009455
  • [9] Cao Jinkun, Object tracking by hierarchical partwhole attention
  • [10] Argoverse: 3D Tracking and Forecasting with Rich Maps
    Chang, Ming-Fang
    Lambert, John
    Sangkloy, Patsorn
    Singh, Jagjeet
    Bak, Slawomir
    Hartnett, Andrew
    Wang, De
    Carr, Peter
    Lucey, Simon
    Ramanan, Deva
    Hays, James
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8740 - 8749