Multi-object tracking algorithm based on interactive attention network and adaptive trajectory reconnection

被引:7
作者
Ma, Sugang [1 ,4 ]
Duan, Shuaipeng [1 ]
Hou, Zhiqiang [1 ]
Yu, Wangsheng [2 ]
Pu, Lei [3 ]
Zhao, Xiangmo [4 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Comp Sci & Technol, Xian 710121, Peoples R China
[2] Air Force Engn Univ, Sch Informat & Nav, Xian 710077, Peoples R China
[3] Rocket Force Engn Univ, Sch Operat Support, Xian 710025, Peoples R China
[4] Changan Univ, Sch Informat Engn, Xian 710064, Peoples R China
关键词
Multi-object tracking; Interactive attention network; Camera motion compensation; Cost matrix; Adaptive trajectory reconnection;
D O I
10.1016/j.eswa.2024.123581
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-object tracking (MOT) detects multiple targets in an image and assigns a unique identifier to each target. However, challenges such as rapid motion, occlusion, and camera motion in the tracking scene may lead to identity switches (IDs) and missing trajectory problems, which degrade the performance of the tracker. To address these issues, this paper presents an MOT algorithm based on an interactive attention network and adaptive trajectory reconnection. First, an interactive attention network was created to learn the features for two different tasks of detection and tracking to alleviate feature conflicts in order to extract sufficient feature information. A new cost matrix was then designed to fuse the motion and feature information, thereby reducing the number of IDs. Meanwhile, the extreme gradient boosting reconnection module was used to achieve adaptive trajectory reconnection and reduce missing trajectories. The proposed algorithm achieved 61.5 % and 55.4 % HOTA using the standard MOT17 and MOT20 datasets, respectively. In comparison to FairMOT, our algorithm showcased notable enhancements of 3% and 1.6% on these datasets. Furthermore, when compared to state-ofthe-art algorithms, the proposed algorithm demonstrated superior tracking performance.
引用
收藏
页数:12
相关论文
共 59 条
  • [1] Aharon N, 2022, Arxiv, DOI [arXiv:2206.14651, 10.48550/arXiv.2206.14651, DOI 10.48550/ARXIV.2206.14651]
  • [2] Tracking without bells and whistles
    Bergmann, Philipp
    Meinhardt, Tim
    Leal-Taixe, Laura
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 941 - 951
  • [3] Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
  • [4] Bochinski E, 2018, 2018 15TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), P435
  • [5] MeMOT: Multi-Object Tracking with Memory
    Cai, Jiarui
    Xu, Mingze
    Li, Wei
    Xiong, Yuanjun
    Xia, Wei
    Tu, Zhuowen
    Soatto, Stefano
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8080 - 8090
  • [6] RetinaMOT: rethinking anchor-free YOLOv5 for online multiple object tracking
    Cao, Jie
    Zhang, Jianxun
    Li, Bowen
    Gao, Linfeng
    Zhang, Jie
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (05) : 5115 - 5133
  • [7] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [8] Learning a Proposal Classifier for Multiple Object Tracking
    Dai, Peng
    Weng, Renliang
    Choi, Wongun
    Zhang, Changshui
    He, Zhangping
    Ding, Wei
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2443 - 2452
  • [9] Dendorfer P., 2020, PREPRINT
  • [10] GIAOTracker: A comprehensive framework for MCMOT with global information and optimizing strategies in VisDrone 2021
    Du, Yunhao
    Wan, Junfeng
    Zhao, Yanyun
    Zhang, Binyu
    Tong, Zhihang
    Dong, Junhao
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 2809 - 2819