Deep Learning-Based 3D Multi-Object Tracking Using Multimodal Fusion in Smart Cities

被引:1
|
作者
Li, Hui [1 ,2 ]
Liu, Xiang [1 ]
Jia, Hong [3 ]
Ahanger, Tariq Ahamed [4 ]
Xu, Lingwei [1 ,2 ,3 ]
Alzamil, Zamil [5 ]
Li, Xingwang [6 ]
机构
[1] Qingdao Univ Sci & Technol, Sch Informat Sci & Technol, Qingdao, Peoples R China
[2] Minist Educ, Engn Res Ctr Integrat & Applicat Digital Learning, Beijing, Peoples R China
[3] Xiamen Univ, Sch Informat, Fujian Key Lab Sensing & Comp Smart Cities, Xiamen, Peoples R China
[4] Prince Sattam bin Abdulaziz Univ, Coll Comp Engn & Sci, Alkharj, Saudi Arabia
[5] Majmaah Univ, Coll Comp & Informat Sci, Dept Comp Sci, Al Majmaah, Saudi Arabia
[6] Henan Polytech Univ, Sch Phys & Elect Informat Engn, Jiaozuo, Peoples R China
基金
中国国家自然科学基金;
关键词
Smart Cities; Visual Perception; 3D Multi-Object Tracking; Multimodal Feature Fusion; Position Affinity; Matrix; Data Association; OBJECT DETECTION; LIDAR;
D O I
10.22967/HCIS.2024.14.047
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The intelligent processing of visual perception information is one of the core technologies of smart cities. Deep learning-based 3D multi-object tracking is important in improving the intelligence and safety of robots in smart cities. However, 3D multi-object tracking still faces many challenges due to the complexity of the environment and uncertainty of the object. In this paper, we make the most of the multimodal information of image and point cloud and propose a multimodal adaptive feature gating fusion module to improve the feature fusion effect. In the object association stage, we designed an orientation-position-aware affinity matrix (EO-IoU) by using Euclidean distance, orientation similarity, and intersection over union, which is more suitable for the association to solve the problem of association failure when there is little or no overlap between the detection box and the prediction box. At the same time, we adopt a more robust two-stage data association method to solve the trajectory fragmentation and identity switching caused by discarding low-scoring detection boxes. The results of extensive experiments on the KITTI and NuScenes benchmark datasets demonstrate that our method outperforms existing state-of-the-art methods with better robustness and accuracy.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] 3D multi-object tracking based on parallel multimodal data association
    Tan, Shiyu
    Li, Xu
    Xu, Qimin
    Zhu, Jianxiao
    MACHINE VISION AND APPLICATIONS, 2025, 36 (03)
  • [2] 3D LiDAR Multi-Object Tracking Using Multi Positive Contrastive Learning and Deep Reinforcement Learning
    Cho, Minho
    Kim, Euntai
    IEEE ACCESS, 2025, 13 : 12447 - 12457
  • [3] 3D Multi-Object Tracking Based on Radar-Camera Fusion
    Lin, Zihao
    Hu, Jianming
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 2502 - 2507
  • [4] Deep Learning-Based Robust Multi-Object Tracking via Fusion of mmWave Radar and Camera Sensors
    Cheng, Lei
    Sengupta, Arindam
    Cao, Siyang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 17218 - 17233
  • [5] Fast and Accurate Deep Learning-Based Framework for 3D Multi-Object Detector for Autonomous Vehicles
    Hoang Duy Loc
    Kim, Gon-Woo
    2022 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (IEEE BIGCOMP 2022), 2022, : 320 - 322
  • [6] EagerMOT: 3D Multi-Object Tracking via Sensor Fusion
    Kim, Aleksandr
    Osep, Aljosa
    Leal-Taixe, Laura
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 11315 - 11321
  • [7] DeepFusionMOT: A 3D Multi-Object Tracking Framework Based on Camera-LiDAR Fusion With Deep Association
    Wang, Xiyang
    Fu, Chunyun
    Li, Zhankun
    Lai, Ying
    He, Jiawei
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03): : 8260 - 8267
  • [8] A systematic survey on recent deep learning-based approaches to multi-object tracking
    Harshit Agrawal
    Agrya Halder
    Pratik Chattopadhyay
    Multimedia Tools and Applications, 2024, 83 : 36203 - 36259
  • [9] Deep Learning for Real-Time 3D Multi-Object Detection, Localisation, and Tracking: Application to Smart Mobility
    Mauri, Antoine
    Khemmar, Redouane
    Decoux, Benoit
    Ragot, Nicolas
    Rossi, Romain
    Trabelsi, Rim
    Boutteau, Remi
    Ertaud, Jean-Yves
    Savatier, Xavier
    SENSORS, 2020, 20 (02)
  • [10] A systematic survey on recent deep learning-based approaches to multi-object tracking
    Agrawal, Harshit
    Halder, Agrya
    Chattopadhyay, Pratik
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 36203 - 36259