Deep Learning-Based 3D Multi-Object Tracking Using Multimodal Fusion in Smart Cities

被引:1
作者
Li, Hui [1 ,2 ]
Liu, Xiang [1 ]
Jia, Hong [3 ]
Ahanger, Tariq Ahamed [4 ]
Xu, Lingwei [1 ,2 ,3 ]
Alzamil, Zamil [5 ]
Li, Xingwang [6 ]
机构
[1] Qingdao Univ Sci & Technol, Sch Informat Sci & Technol, Qingdao, Peoples R China
[2] Minist Educ, Engn Res Ctr Integrat & Applicat Digital Learning, Beijing, Peoples R China
[3] Xiamen Univ, Sch Informat, Fujian Key Lab Sensing & Comp Smart Cities, Xiamen, Peoples R China
[4] Prince Sattam bin Abdulaziz Univ, Coll Comp Engn & Sci, Alkharj, Saudi Arabia
[5] Majmaah Univ, Coll Comp & Informat Sci, Dept Comp Sci, Al Majmaah, Saudi Arabia
[6] Henan Polytech Univ, Sch Phys & Elect Informat Engn, Jiaozuo, Peoples R China
基金
中国国家自然科学基金;
关键词
Smart Cities; Visual Perception; 3D Multi-Object Tracking; Multimodal Feature Fusion; Position Affinity; Matrix; Data Association; OBJECT DETECTION; LIDAR;
D O I
10.22967/HCIS.2024.14.047
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The intelligent processing of visual perception information is one of the core technologies of smart cities. Deep learning-based 3D multi-object tracking is important in improving the intelligence and safety of robots in smart cities. However, 3D multi-object tracking still faces many challenges due to the complexity of the environment and uncertainty of the object. In this paper, we make the most of the multimodal information of image and point cloud and propose a multimodal adaptive feature gating fusion module to improve the feature fusion effect. In the object association stage, we designed an orientation-position-aware affinity matrix (EO-IoU) by using Euclidean distance, orientation similarity, and intersection over union, which is more suitable for the association to solve the problem of association failure when there is little or no overlap between the detection box and the prediction box. At the same time, we adopt a more robust two-stage data association method to solve the trajectory fragmentation and identity switching caused by discarding low-scoring detection boxes. The results of extensive experiments on the KITTI and NuScenes benchmark datasets demonstrate that our method outperforms existing state-of-the-art methods with better robustness and accuracy.
引用
收藏
页数:19
相关论文
共 43 条
[1]  
Baser E, 2019, IEEE INT VEH SYM, P1426, DOI [10.1109/ivs.2019.8813779, 10.1109/IVS.2019.8813779]
[2]   Score refinement for confidence-based 3D multi-object tracking [J].
Benbarka, Nuri ;
Schroder, Jona ;
Zell, Andreas .
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, :8083-8090
[3]   Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics [J].
Bernardin, Keni ;
Stiefelhagen, Rainer .
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2008, 2008 (1)
[4]   3D Multi-Object Tracking Using Graph Neural Networks With Cross-Edge Modality Attention [J].
Buechner, Martin ;
Valada, Abhinav .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) :9707-9714
[5]   nuScenes: A multimodal dataset for autonomous driving [J].
Caesar, Holger ;
Bankiti, Varun ;
Lang, Alex H. ;
Vora, Sourabh ;
Liong, Venice Erin ;
Xu, Qiang ;
Krishnan, Anush ;
Pan, Yu ;
Baldan, Giancarlo ;
Beijbom, Oscar .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628
[6]   Probabilistic 3D Multi-Modal, Multi-Object Tracking for Autonomous Driving [J].
Chiu, Hsu-kuang ;
Lie, Jie ;
Ambrus, Rares ;
Bohg, Jeannette .
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, :14227-14233
[7]   Vision meets robotics: The KITTI dataset [J].
Geiger, A. ;
Lenz, P. ;
Stiller, C. ;
Urtasun, R. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1231-1237
[8]   A niching backtracking search algorithm with adaptive local search for multimodal multiobjective optimization [J].
Hu, Zhongbo ;
Zhou, Ting ;
Su, Qinghua ;
Liu, Mianfang .
SWARM AND EVOLUTIONARY COMPUTATION, 2022, 69
[9]   A novel evolutionary algorithm based on even difference grey model [J].
Hu, Zhongbo ;
Gao, Cong ;
Su, Qinghua .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 176
[10]   Joint Multi-Object Detection and Tracking with Camera-LiDAR Fusion for Autonomous Driving [J].
Huang, Kemiao ;
Hao, Qi .
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, :6983-6989