Deep Learning-Based 3D Multi-Object Tracking Using Multimodal Fusion in Smart Cities

被引:1
|
作者
Li, Hui [1 ,2 ]
Liu, Xiang [1 ]
Jia, Hong [3 ]
Ahanger, Tariq Ahamed [4 ]
Xu, Lingwei [1 ,2 ,3 ]
Alzamil, Zamil [5 ]
Li, Xingwang [6 ]
机构
[1] Qingdao Univ Sci & Technol, Sch Informat Sci & Technol, Qingdao, Peoples R China
[2] Minist Educ, Engn Res Ctr Integrat & Applicat Digital Learning, Beijing, Peoples R China
[3] Xiamen Univ, Sch Informat, Fujian Key Lab Sensing & Comp Smart Cities, Xiamen, Peoples R China
[4] Prince Sattam bin Abdulaziz Univ, Coll Comp Engn & Sci, Alkharj, Saudi Arabia
[5] Majmaah Univ, Coll Comp & Informat Sci, Dept Comp Sci, Al Majmaah, Saudi Arabia
[6] Henan Polytech Univ, Sch Phys & Elect Informat Engn, Jiaozuo, Peoples R China
基金
中国国家自然科学基金;
关键词
Smart Cities; Visual Perception; 3D Multi-Object Tracking; Multimodal Feature Fusion; Position Affinity; Matrix; Data Association; OBJECT DETECTION; LIDAR;
D O I
10.22967/HCIS.2024.14.047
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The intelligent processing of visual perception information is one of the core technologies of smart cities. Deep learning-based 3D multi-object tracking is important in improving the intelligence and safety of robots in smart cities. However, 3D multi-object tracking still faces many challenges due to the complexity of the environment and uncertainty of the object. In this paper, we make the most of the multimodal information of image and point cloud and propose a multimodal adaptive feature gating fusion module to improve the feature fusion effect. In the object association stage, we designed an orientation-position-aware affinity matrix (EO-IoU) by using Euclidean distance, orientation similarity, and intersection over union, which is more suitable for the association to solve the problem of association failure when there is little or no overlap between the detection box and the prediction box. At the same time, we adopt a more robust two-stage data association method to solve the trajectory fragmentation and identity switching caused by discarding low-scoring detection boxes. The results of extensive experiments on the KITTI and NuScenes benchmark datasets demonstrate that our method outperforms existing state-of-the-art methods with better robustness and accuracy.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] 3D multi-object tracking based on parallel multimodal data association
    Tan, Shiyu
    Li, Xu
    Xu, Qimin
    Zhu, Jianxiao
    MACHINE VISION AND APPLICATIONS, 2025, 36 (03)
  • [2] Review of Multi-Object Tracking Based on Deep Learning
    Li, Jiaxin
    Zhao, Lei
    Zheng, Zhaohuang
    Yong, Ting
    2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 719 - 725
  • [3] Multi-Object tracking of 3D cuboids using aggregated features
    Muresan, Mircea Paul
    Nedevschi, Sergiu
    2019 IEEE 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP 2019), 2019, : 11 - 18
  • [4] DeepFusionMOT: A 3D Multi-Object Tracking Framework Based on Camera-LiDAR Fusion With Deep Association
    Wang, Xiyang
    Fu, Chunyun
    Li, Zhankun
    Lai, Ying
    He, Jiawei
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03): : 8260 - 8267
  • [5] Deep Learning for Real-Time 3D Multi-Object Detection, Localisation, and Tracking: Application to Smart Mobility
    Mauri, Antoine
    Khemmar, Redouane
    Decoux, Benoit
    Ragot, Nicolas
    Rossi, Romain
    Trabelsi, Rim
    Boutteau, Remi
    Ertaud, Jean-Yves
    Savatier, Xavier
    SENSORS, 2020, 20 (02)
  • [6] A Multi-Modal Fusion-Based 3D Multi-Object Tracking Framework With Joint Detection
    Wang, Xiyang
    Fu, Chunyun
    He, Jiawei
    Huang, Mingguang
    Meng, Ting
    Zhang, Siyu
    Zhou, Hangning
    Xu, Ziyao
    Zhang, Chi
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (01): : 532 - 539
  • [7] Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility
    Mauri, Antoine
    Khemmar, Redouane
    Decoux, Benoit
    Haddad, Madjid
    Boutteau, Remi
    JOURNAL OF IMAGING, 2021, 7 (08)
  • [8] LiDAR-based 3D Multi-object Tracking for Unmanned Vehicles
    Xiong Z.-K.
    Cheng X.-Q.
    Wu Y.-D.
    Zuo Z.-Q.
    Liu J.-S.
    Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (10): : 2073 - 2083
  • [9] 3D Multi-object Tracking Based on Simultaneous Optimization of Object Detection and Scene Flow Estimation
    Wang, Guangming
    Song, Liang
    Shen, Yueling
    Wang, Hesheng
    Jiqiren/Robot, 2024, 46 (05): : 554 - 561
  • [10] Multi-object tracking with discriminant correlation filter based deep learning tracker
    Yang, Tao
    Cappelle, Cindy
    Ruichek, Yassine
    El Bagdouri, Mohammed
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2019, 26 (03) : 273 - 284