Enhancing Robustness of Multi-Object Trackers With Temporal Feature Mix

被引:0
作者
Shim, Kyujin [1 ]
Byun, Junyoung [2 ]
Ko, Kangwook [1 ]
Hwang, Jubi [1 ]
Kim, Changick [1 ]
机构
[1] Korea Adv Inst Sci & Technol KAIST, Sch Elect Engn, Daejeon 34141, South Korea
[2] Samsung Adv Inst Technol SAIT, Suwon 16678, South Korea
基金
新加坡国家研究基金会;
关键词
Robustness; Target tracking; Training; Feature extraction; Circuits and systems; Noise measurement; Faces; Temporal feature mix; multi-object tracking; corruption robustness; MODEL;
D O I
10.1109/TCSVT.2024.3403166
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Despite its recent advancements, multi-object tracking (MOT), one of the major research areas in video technology, still faces various challenges, including severe occlusion and diversity of tracking targets. In this paper, we introduce a novel strategy, Temporal Feature Mix (TFM), that can improve the overall robustness of multi-object trackers in diverse scenarios. More specifically, our approach simulates new and challenging scenes that can train networks to better localize the targets by blending high-level features from temporally adjacent frames with the insights that the high-level features are mainly activated on salient targets and the targets on the adjacent frames are nearly located. Therefore, our TFM can offer novel and diversified training experiences to the networks, achieved through the intensive augmentation of the high-level features of each target. As a result, our approach demonstrates notable performance improvement with three major MOT benchmarks and a newly constructed corruption dataset for MOT, underscoring its potential to enhance the robustness of MOT systems in real-world scenarios. Every related source code is released at https://github.com/kamkyu94/Temporal_Feature_Mix.
引用
收藏
页码:9822 / 9835
页数:14
相关论文
共 69 条
[1]  
Aharon N, 2022, Arxiv, DOI arXiv:2206.14651
[2]  
Bastani F, 2021, ADV NEUR IN, V34
[3]  
Bengio Y., 2013, INT C MACH LEARN
[4]  
Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
[5]   Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking [J].
Cao, Jinkun ;
Pang, Jiangmiao ;
Weng, Xinshuo ;
Khirodkar, Rawal ;
Kitani, Kris .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :9686-9696
[6]   Factors Influencing Pediatric Emergency Department Visits for Low-Acuity Conditions [J].
Long, Christina M. ;
Mehrhoff, Casey ;
Abdel-Latief, Eman ;
Rech, Megan ;
Laubham, Matthew .
PEDIATRIC EMERGENCY CARE, 2021, 37 (05) :265-268
[7]   Toward Robust Neural Image Compression: Adversarial Attack and Model Finetuning [J].
Chen, Tong ;
Ma, Zhan .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) :7842-7856
[8]   Beyond triplet loss: a deep quadruplet network for person re-identification [J].
Chen, Weihua ;
Chen, Xiaotang ;
Zhang, Jianguo ;
Huang, Kaiqi .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1320-1329
[9]   Learning a Proposal Classifier for Multiple Object Tracking [J].
Dai, Peng ;
Weng, Renliang ;
Choi, Wongun ;
Zhang, Changshui ;
He, Zhangping ;
Ding, Wei .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :2443-2452
[10]  
Dendorfer P, 2020, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, DOI [10.48550/arXiv.2003.09003, DOI 10.48550/ARXIV.2003.09003]