GAL: Graph-Induced Adaptive Learning for Weakly Supervised 3D Object Detection

被引:1
作者
Yin, Dongshuo [1 ,2 ,3 ]
Yu, Hongfeng [1 ,2 ,3 ]
Liu, Nayu [1 ,2 ,3 ]
Yao, Fanglong [1 ,2 ,3 ]
He, Qibin [1 ,2 ,3 ]
Li, Jihao [1 ,2 ,3 ]
Yang, Yiran [1 ,2 ,3 ]
Yan, Shiyao [1 ,2 ,3 ]
Sun, Xian [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Network Informat Syst Technol NIST, Beijing 100190, Peoples R China
关键词
Three-dimensional displays; Point cloud compression; Clustering algorithms; Object detection; Annotations; Laser radar; Adaptive learning; 3D object detection; weakly supervised learning; multimodal alignment; minimum spanning tree; NETWORK;
D O I
10.1109/TITS.2023.3265387
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Weakly Supervised 3D Object Detection (WS3DOD) aims to perform 3D object detection with little reliance on 3D labels, which greatly reduces the cost of 3D annotations. In recent literature, the pseudo-label-based approach brings impressive performance, which generates 3D pseudo-labels from 2D bounding boxes. Despite their success, two key issues remain unresolved that reduce the quality of 3D pseudo-labels: 1) the existing local object locating algorithm can not capture complete clusters of points globally, and 2) the existing algorithm can not capture sparse points caused by the unevenly distributed points obtained by LiDAR cameras. Hence, we propose GAL, a Graph-induced Adaptive Learning algorithm, to generate 3D pseudo-labels. First, we propose the Cluster Locating algorithm based on the Minimum Spanning Tree (MST) to globally locate the objects, which can leverage the characteristic that points inside an object are compact while points between objects are discrete. Second, we propose a density-guided adaptive learning algorithm to optimise the Cluster Locating algorithm, named Cuboid Drift. Cuboid Drift considers the inhomogeneous distribution of reflected points on different reflective surfaces of LiDAR imaging. Finally, 3D pseudo-labels generated by GAL are leveraged to train 3D detectors. Extensive experiments on the challenging KITTI and DAIR-V2X-V dataset demonstrate that GAL without 3D labels can be comparable with strongly supervised approaches and outperforms the previous state-of-the-art WS3DOD methods. Moreover, our method saves 88% of the time spent on pseudo-label generation.
引用
收藏
页码:9684 / 9697
页数:14
相关论文
共 67 条
[41]   CenterNet3D: An Anchor Free Object Detector for Point Cloud [J].
Wang, Guojun ;
Wu, Jian ;
Tian, Bin ;
Teng, Siyu ;
Chen, Long ;
Cao, Dongpu .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) :12953-12965
[42]   3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object Detection [J].
Wang, He ;
Cong, Yezhen ;
Litany, Or ;
Gao, Yue ;
Guibas, Leonidas J. .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14610-14619
[43]   Feature extraction of point clouds based on region clustering segmentation [J].
Wang, XiaoHui ;
Chen, HuaWei ;
Wu, LuShen .
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (17-18) :11861-11889
[44]   A Deep Neural Network With Spatial Pooling (DNNSP) for 3-D Point Cloud Classification [J].
Wang, Zhen ;
Zhang, Liqiang ;
Zhang, Liang ;
Li, Roujing ;
Zheng, Yibo ;
Zhu, Zidong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (08) :4594-4604
[45]  
Wang ZX, 2019, IEEE INT C INT ROBOT, P1742, DOI [10.1109/iros40897.2019.8968513, 10.1109/IROS40897.2019.8968513]
[46]   FGR: Frustum-Aware Geometric Reasoning for Weakly Supervised 3D Vehicle Detection [J].
Wei, Yi ;
Su, Shang ;
Lu, Jiwen ;
Zhou, Jie .
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, :4348-4354
[47]   An Extended Minimum Spanning Tree method for characterizing local urban patterns [J].
Wu, Bin ;
Yu, Bailang ;
Wu, Qiusheng ;
Chen, Zuoqi ;
Yao, Shenjun ;
Huang, Yan ;
Wu, Jianping .
INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2018, 32 (03) :450-475
[48]   3D Multi-Object Tracking in Point Clouds Based on Prediction Confidence-Guided Data Association [J].
Wu, Hai ;
Han, Wenkai ;
Wen, Chenglu ;
Li, Xin ;
Wang, Cheng .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (06) :5668-5677
[49]   SECOND: Sparsely Embedded Convolutional Detection [J].
Yan, Yan ;
Mao, Yuxing ;
Li, Bo .
SENSORS, 2018, 18 (10)
[50]   PIXOR: Real-time 3D Object Detection from Point Clouds [J].
Yang, Bin ;
Luo, Wenjie ;
Urtasun, Raquel .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7652-7660