3D-DIoU: 3D Distance Intersection over Union for Multi-Object Tracking in Point Cloud

被引:9
作者
Mohammed, Sazan Ali Kamal [1 ,2 ]
Razak, Mohd Zulhakimi Ab [1 ]
Rahman, Abdul Hadi Abd [3 ]
机构
[1] Univ Kebangsaan Malaysia, Inst Microengn & Nanoelect IMEN, Bangi 43600, Malaysia
[2] Erbil Polytech Univ, Erbil Technol Coll, Dept Automot Technol, Erbil 44001, Iraq
[3] Univ Kebangsaan Malaysia, Ctr Artificial Intelligence Technol, Bangi 43600, Malaysia
关键词
multi-object tracking; point cloud; 3D-DIoU; DIoU-NMS; multistage data association; tracklets; motion prediction;
D O I
10.3390/s23073390
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Multi-object tracking (MOT) is a prominent and important study in point cloud processing and computer vision. The main objective of MOT is to predict full tracklets of several objects in point cloud. Occlusion and similar objects are two common problems that reduce the algorithm's performance throughout the tracking phase. The tracking performance of current MOT techniques, which adopt the 'tracking-by-detection' paradigm, is degrading, as evidenced by increasing numbers of identification (ID) switch and tracking drifts because it is difficult to perfectly predict the location of objects in complex scenes that are unable to track. Since the occluded object may have been visible in former frames, we manipulated the speed and location position of the object in the previous frames in order to guess where the occluded object might have been. In this paper, we employed a unique intersection over union (IoU) method in three-dimension (3D) planes, namely a distance IoU non-maximum suppression (DIoU-NMS) to accurately detect objects, and consequently we use 3D-DIoU for an object association process in order to increase tracking robustness and speed. By using a hybrid 3D DIoU-NMS and 3D-DIoU method, the tracking speed improved significantly. Experimental findings on the Waymo Open Dataset and nuScenes dataset, demonstrate that our multistage data association and tracking technique has clear benefits over previously developed algorithms in terms of tracking accuracy. In comparison with other 3D MOT tracking methods, our proposed approach demonstrates significant enhancement in tracking performances.
引用
收藏
页数:15
相关论文
共 43 条
[1]   Score refinement for confidence-based 3D multi-object tracking [J].
Benbarka, Nuri ;
Schroder, Jona ;
Zell, Andreas .
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, :8083-8090
[2]   Tracking without bells and whistles [J].
Bergmann, Philipp ;
Meinhardt, Tim ;
Leal-Taixe, Laura .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :941-951
[3]   Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics [J].
Bernardin, Keni ;
Stiefelhagen, Rainer .
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2008, 2008 (1)
[4]  
Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
[5]   nuScenes: A multimodal dataset for autonomous driving [J].
Caesar, Holger ;
Bankiti, Varun ;
Lang, Alex H. ;
Vora, Sourabh ;
Liong, Venice Erin ;
Xu, Qiang ;
Krishnan, Anush ;
Pan, Yu ;
Baldan, Giancarlo ;
Beijbom, Oscar .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628
[6]   Virtual target guidance-based distributed model predictive control for formation control of multiple UAVs [J].
Cai, Zhihao ;
Wang, Longhong ;
Zhao, Jiang ;
Wu, Kun ;
Wang, Yingxun .
CHINESE JOURNAL OF AERONAUTICS, 2020, 33 (03) :1037-1056
[7]  
Chiu HK, 2020, Arxiv, DOI arXiv:2001.05673
[8]   Probabilistic 3D Multi-Modal, Multi-Object Tracking for Autonomous Driving [J].
Chiu, Hsu-kuang ;
Lie, Jie ;
Ambrus, Rares ;
Bohg, Jeannette .
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, :14227-14233
[9]   Lane detection in intelligent vehicle system using optimal 2-tier deep convolutional neural network [J].
Dewangan, Deepak Kumar ;
Sahu, Satya Prakash .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (05) :7293-7317
[10]   The Way They Move: Tracking Multiple Targets with Similar Appearance [J].
Dicle, Caglayan ;
Camps, Octavia I. ;
Sznaier, Mario .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :2304-2311