Pedestrian Tracking Algorithm Based on Convolutional Block Attention Module and Anchor-free Detection Network

被引:2
作者
Zhang Hongying [1 ]
He Pengyi [1 ]
机构
[1] Civil Aviat Univ China, Coll Elect Informat & Automat, Tianjin 300300, Peoples R China
基金
国家重点研发计划;
关键词
IDentity switch (IDs); High-resolution feature extraction network; Convolutional Block Attention Module (CBAM); Anchor-free detection netword; Head network; FairMOT;
D O I
10.11999/JEIT210634
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
According to the target identity switch and tracking trajectory interruption, a multi-pedestrian tracking algorithm based on Convolutional Block Attention Module (CBAM) and anchor-free detection network is proposed. Firstly, attention mechanism is introduced to HrnetV2's stem stage to extract more expressive features, thus strengthening the training of re-recognition branch. Secondly, in order to improve the operation speed of algorithm, detection task and recognition one share feature weights and are carried out simultaneously. Meanwhile, the convolutional channel's number and parameter amount are reduced in the head network. Finally, the network is fully trained with proper parameters, and the algorithm is validated by multiple test sets. Experimental results show that compared with FairMOT, the accuracy of the proposed algorithm on 2DMOT15, MOT17 and MOT20 data sets is improved by 1.1%, 1.1%, 0.2% respectively, and the speed is improved by 0.82, 0.88 and 0.41 fps respectively. Compared with other mainstream algorithms, the proposed algorithm has the least number of target identity switching. The proposed algorithm improves effectively realtime performance of network model, which could be better applied to the scenes with severe occlusion.
引用
收藏
页码:3299 / 3307
页数:9
相关论文
共 17 条
[1]   Review of pedestrian tracking: Algorithms and applications [J].
Cao Zi-Qiang ;
Sai Bin ;
Lu Xin .
ACTA PHYSICA SINICA, 2020, 69 (08)
[2]  
Dendorfer P., 2020, arXiv, DOI DOI 10.48550/ARXIV.2003.09003
[3]   CornerNet: Detecting Objects as Paired Keypoints [J].
Law, Hei ;
Deng, Jia .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :765-781
[4]  
Leal-Taix‚ L, 2015, Arxiv, DOI [arXiv:1504.01942, 10.48550/arXiv.1504.01942, DOI 10.48550/ARXIV.1504.01942]
[5]  
Li ZM, 2017, Arxiv, DOI [arXiv:1711.07264, 10.48550/arXiv.1711.07264, DOI 10.48550/ARXIV.1711.07264]
[6]  
Liang C, 2022, Arxiv, DOI arXiv:2010.12138
[7]  
Milan A, 2016, Arxiv, DOI [arXiv:1603.00831, 10.48550/arXiv.1603.00831, DOI 10.48550/ARXIV.1603.00831]
[8]   TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training Model [J].
Pang, Bo ;
Li, Yizhuo ;
Zhang, Yifan ;
Li, Muchen ;
Lu, Cewu .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6307-6317
[9]  
Sun K, 2019, Arxiv, DOI arXiv:1904.04514
[10]  
Wang ZD, 2020, Arxiv, DOI arXiv:1909.12605