Multi-object pedestrian tracking method based on improved high resolution neural network

被引:0
|
作者
Zhang H. [1 ]
He P. [1 ]
Peng X. [1 ]
机构
[1] College of Electronic Information and Automation, Civil Aviation University of China, Tianjin
关键词
Bottle2neck; Efficient Channel Attention(ECA); FairMOT; high-resolution feature extraction network; ID switch;
D O I
10.37188/OPE.20233106.0860
中图分类号
学科分类号
摘要
This study proposes an improved high-resolution neural network to address the issue of detection and tracking failures caused by target blockage in a multi-target pedestrian tracking process. First,to enhance the initial feature extraction capability of the network for pedestrian targets,a second-generation bottleneck residual block structure was introduced into the backbone of a high-resolution neural network,thus improving the receptive field and feature expression capability. Second,a new residual detection block architecture with a two-layer efficient channel attention module was designed to replace the one at the multi-scale information exchange stage of the original network,thus improving the test performance of the entire network system. Finally,the network was fully trained by selecting appropriate parameters,and subsequently,the algorithm was tested using multiple test sets. The test results indicated that the tracking accuracy of the proposed algorithm was 0. 1%,1. 6%,and 0. 8% higher than that of FairMOT on 2DMOT15,MOT17,and MOT20 datasets,respectively. In conclusion,the proposed algorithm-tracking stability for longer video sequences was greatly improved. Therefore,it can be applied to special scenarios with more targets and occlusion area. © 2023 Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:860 / 871
页数:11
相关论文
共 20 条
  • [1] CAO Z Q, Review of pedestrian tracking: Algorithms and applications[J], Acta Physica Sinica, 69, 8, pp. 41-58, (2020)
  • [2] DENG J., CornerNet:Detecting objects as paired keypoints[J], International Journal of Computer Vision, 128, 3, pp. 642-656, (2020)
  • [3] ZHOU X, W, ANG D,, KRAHENBUHL P., Objects as points [EB/OL], (2019)
  • [4] WANG J D, CHENG T H,, Et al., Deep high-resolution representation learning for visual recognition[J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 43, 10, pp. 3349-3364, (2021)
  • [5] ZHAN Y, WANG X, Et al., A simple baseline for multi-object tracking[J]
  • [6] TrackFormer:multi-object tracking with transformers[EB/OL], (2021)
  • [7] PARMAR N,, Et al., Attention is all You need[EB/OL], (2017)
  • [8] ZHANG H Y, HE P Y., Pedestrian tracking algorithm based on convolutional block attention module and anchor-free detection network[J], Journal of Electronics & Information Technology, 44, 9, pp. 3299-3307, (2022)
  • [9] WANG Q L, ZHU P F,, Et al., ECA-Net:Efficient Channel Attention for Deep Convolu-tional Neural Networks[C], 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 11531-11539, (2020)
  • [10] GAO S H, CHENG M M,, ZHAO K,, Et al., Res2Net:a new multi-scale backbone architecture [J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 43, 2, pp. 652-662, (2021)