Real-Time Multiple Pedestrian Tracking With Joint Detection and Embedding Deep Learning Model for Embedded Systems

被引:4
作者
Lin, Hung-Wei [1 ]
Shivanna, Vinay Malligere [1 ]
Chang, Hsiu Chi [2 ]
Guo, Jiun-In [1 ,3 ,4 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Inst Elect, Dept Elect Engn, Hsinchu 30010, Taiwan
[2] Chunghua Telecom Co Ltd, Taoyuan 32661, Taiwan
[3] Natl Yang Ming Chiao Tung Univ, Pervas Artificial Intelligence Res PAIR Labs, Hsinchu 30010, Taiwan
[4] Natl Yang Ming Chiao Tung Univ, Wistron NCTU Embedded Artificial Intelligence Res, Hsinchu 30010, Taiwan
关键词
Detectors; Feature extraction; Computational modeling; Object tracking; Object detection; Target tracking; Real-time systems; Multiple object tracking; embedded system; advanced driver assistance system (ADAS); smart transportation;
D O I
10.1109/ACCESS.2022.3173408
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes an improvement to the multi-object tracking system framework based on the image inputs. By analyzing the role and performance of each block in the original multi-objects tracking system, the blocks of the original system are reconstructed to enhance the efficiency and yield a faster processing speed suiting the real-time applications. In the proposed method, the first two parts of the multi-object tracking system are merged into a single neural network designed for object detection and feature extraction. A new object association judgment method and JDE inspired prediction head are included in order to achieve a better and an outstanding association effect resulting in the overall improvement of the original system by 45.2%. The enhanced method is aimed at the application of smart roadside units and uses fixed-viewpoint image input to achieve multi-object tracking on embedded platforms. The proposed method is implemented on the NVIDIA Jetson AGX Xavier embedded platform. The NVIDIA TensorRT software development kit is used to accelerate the neural network. The overall performance of the proposed system yields better efficiency compared to that of the original SDE design and the overall computing performance achieve up to 14-26 images per second, making it ideal for the real-time smart roadside unit applications.
引用
收藏
页码:51458 / 51471
页数:14
相关论文
共 48 条
[1]  
[Anonymous], INTELLIGENT TRANSPOR
[2]   Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics [J].
Bernardin, Keni ;
Stiefelhagen, Rainer .
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2008, 2008 (1)
[3]  
Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
[4]  
Bochkovskiy A., 2020, ARXIV 200410934
[5]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[6]  
Dollár P, 2009, PROC CVPR IEEE, P304, DOI 10.1109/CVPRW.2009.5206631
[7]  
Ess A, 2008, PROC CVPR IEEE, P1857
[8]  
Freeway Bureau Ministry of Transportation and Communications(MOTC) Taiwan, 2008, INTR INT TRANSP SYST
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]  
He KM, 2014, LECT NOTES COMPUT SC, V8691, P346, DOI [arXiv:1406.4729, 10.1007/978-3-319-10578-9_23]