Online Multiple Object Tracking Using Spatial Pyramid Pooling Hashing and Image Retrieval for Autonomous Driving

被引:7
作者
Wei, Hongjian [1 ,2 ]
Huang, Yingping [1 ]
机构
[1] Univ Shanghai Sci & Technol, Sch Opt Elect & Comp Engn, Shanghai 200093, Peoples R China
[2] Fuyang Normal Univ, Sch Phys & Elect Engn, Fuyang 236037, Peoples R China
关键词
multiple object tracking; spatial pyramid pooling hashing; image retrieval; object representation; data association; DEEP;
D O I
10.3390/machines10080668
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multiple object tracking (MOT) is a fundamental issue and has attracted considerable attention in the autonomous driving community. This paper presents a novel MOT framework for autonomous driving. The framework consists of two stages of object representation and data association. In the stage of object representation, we employ appearance, motion, and position features to characterize objects. We design a spatial pyramidal pooling hash network (SPPHNet) to generate the appearance features. Multiple-level representative features in the SPPHNet are mapped into a similarity-preserving binary space, called hash features. The hash features retain the visual discriminability of high-dimensional features and are beneficial for computational efficiency. For data association, a two-tier data association scheme is designed to address the occlusion issue, consisting of an affinity cost model and a hash-based image retrieval model. The affinity cost model accommodates the hash features, disparity, and optical flow as the first tier of data association. The hash-based image retrieval model exploits the hash features and adopts image retrieval technology to handle reappearing objects as the second tier of data association. Experiments on the KITTI public benchmark dataset and our campus scenario sequences show that our method has superior tracking performance to the state-of-the-art vision-based MOT methods.
引用
收藏
页数:20
相关论文
共 37 条
[1]   Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics [J].
Bernardin, Keni ;
Stiefelhagen, Rainer .
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2008, 2008 (1)
[2]   YOLOv4-5D: An Effective and Efficient Object Detector for Autonomous Driving [J].
Cai, Yingfeng ;
Luan, Tianyu ;
Gao, Hongbo ;
Wang, Hai ;
Chen, Long ;
Li, Yicheng ;
Sotelo, Miguel Angel ;
Li, Zhixiong .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
[3]  
Cai YJ, 2020, AAAI CONF ARTIF INTE, V34, P10478
[4]   Deep Cauchy Hashing for Hamming Space Retrieval [J].
Cao, Yue ;
Long, Mingsheng ;
Liu, Bin ;
Wang, Jianmin .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1229-1237
[5]  
Chaabane M., 2021, ARXIV
[6]   Online Obstacle Trajectory Prediction for Autonomous Buses [J].
Chong, Yue Linn ;
Lee, Christina Dao Wen ;
Chen, Liushifeng ;
Shen, Chongjiang ;
Chan, Ken Kok Hoe ;
Ang, Marcelo H., Jr. .
MACHINES, 2022, 10 (03)
[7]   Vision meets robotics: The KITTI dataset [J].
Geiger, A. ;
Lenz, P. ;
Stiller, C. ;
Urtasun, R. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1231-1237
[8]  
Gonzalez Nicolas Franco, 2020, Image Analysis and Recognition. 17th International Conference (ICIAR 2020). Proceedings. Lecture Notes in Computer Science (LNCS 12132), P48, DOI 10.1007/978-3-030-50516-5_5
[9]   An edge-aware based adaptive multi-feature set extraction for stereo matching of binocular images [J].
Haq, Qazi Mazhar ul ;
Lin, Chang Hong ;
Ruan, Shanq-Jang ;
Gregor, Derlis .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 13 (04) :1953-1967
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778