Unifying Short and Long-Term Tracking with Graph Hierarchies

被引:47
作者
Cetintas, Orcun [1 ]
Braso, Guillem [1 ,2 ]
Leal-Taixe, Laura [1 ,3 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] Munich Ctr Machine Lean, Munich, Germany
[3] NVIDIA, Santa Clara, CA USA
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年
关键词
D O I
10.1109/CVPR52729.2023.02191
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tracking objects over long videos effectively means solving a spectrum of problems, from short-term association for un-occluded objects to long-term association for objects that are occluded and then reappear in the scene. Methods tackling these two tasks are often disjoint and crafted for specific scenarios, and top-performing approaches are often a mix of techniques, which yields engineering-heavy solutions that lack generality. In this work, we question the need for hybrid approaches and introduce SUSHI, a unified and scalable multi-object tracker. Our approach processes long clips by splitting them into a hierarchy of subclips, which enables high scalability. We leverage graph neural networks to process all levels of the hierarchy, which makes our model unified across temporal scales and highly general. As a result, we obtain significant improvements over state-of-the-art on four diverse datasets. Our code and models are available at bit.ly/sushi-mot.
引用
收藏
页码:22877 / 22887
页数:11
相关论文
共 68 条
[1]  
[Anonymous], 2021, CVPR, DOI DOI 10.1109/CVPR46437.2021.00247
[2]  
[Anonymous], 2016, CVPRW
[3]  
[Anonymous], 2021, CVPR, DOI DOI 10.1109/CVPR46437.2021.00526
[4]  
[Anonymous], 2021, ICCV, DOI DOI 10.1109/ICCV48922.2021.01068
[5]  
[Anonymous], 2021, CVPR, DOI DOI 10.1109/CVPR46437.2021.00023
[6]  
[Anonymous], 2020, CVPR, DOI DOI 10.1109/CVPR42600.2020.00682
[7]  
[Anonymous], 2017, CVPR, DOI DOI 10.1109/CVPR.2017.394
[8]  
[Anonymous], CVPR
[9]  
[Anonymous], 2017, IEEE INT C MACH LEAR
[10]  
[Anonymous], 2020, CVPR, DOI DOI 10.1109/CVPR42600.2020.00628