GAL: Graph-Induced Adaptive Learning for Weakly Supervised 3D Object Detection

被引:1
作者
Yin, Dongshuo [1 ,2 ,3 ]
Yu, Hongfeng [1 ,2 ,3 ]
Liu, Nayu [1 ,2 ,3 ]
Yao, Fanglong [1 ,2 ,3 ]
He, Qibin [1 ,2 ,3 ]
Li, Jihao [1 ,2 ,3 ]
Yang, Yiran [1 ,2 ,3 ]
Yan, Shiyao [1 ,2 ,3 ]
Sun, Xian [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Network Informat Syst Technol NIST, Beijing 100190, Peoples R China
关键词
Three-dimensional displays; Point cloud compression; Clustering algorithms; Object detection; Annotations; Laser radar; Adaptive learning; 3D object detection; weakly supervised learning; multimodal alignment; minimum spanning tree; NETWORK;
D O I
10.1109/TITS.2023.3265387
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Weakly Supervised 3D Object Detection (WS3DOD) aims to perform 3D object detection with little reliance on 3D labels, which greatly reduces the cost of 3D annotations. In recent literature, the pseudo-label-based approach brings impressive performance, which generates 3D pseudo-labels from 2D bounding boxes. Despite their success, two key issues remain unresolved that reduce the quality of 3D pseudo-labels: 1) the existing local object locating algorithm can not capture complete clusters of points globally, and 2) the existing algorithm can not capture sparse points caused by the unevenly distributed points obtained by LiDAR cameras. Hence, we propose GAL, a Graph-induced Adaptive Learning algorithm, to generate 3D pseudo-labels. First, we propose the Cluster Locating algorithm based on the Minimum Spanning Tree (MST) to globally locate the objects, which can leverage the characteristic that points inside an object are compact while points between objects are discrete. Second, we propose a density-guided adaptive learning algorithm to optimise the Cluster Locating algorithm, named Cuboid Drift. Cuboid Drift considers the inhomogeneous distribution of reflected points on different reflective surfaces of LiDAR imaging. Finally, 3D pseudo-labels generated by GAL are leveraged to train 3D detectors. Extensive experiments on the challenging KITTI and DAIR-V2X-V dataset demonstrate that GAL without 3D labels can be comparable with strongly supervised approaches and outperforms the previous state-of-the-art WS3DOD methods. Moreover, our method saves 88% of the time spent on pseudo-label generation.
引用
收藏
页码:9684 / 9697
页数:14
相关论文
共 67 条
[1]  
[Anonymous], 2009, P C HIGH PERFORMANCE
[2]   A Survey on 3D Object Detection Methods for Autonomous Driving Applications [J].
Arnold, Eduardo ;
Al-Jarrah, Omar Y. ;
Dianati, Mehrdad ;
Fallah, Saber ;
Oxtoby, David ;
Mouzakitis, Alex .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (10) :3782-3795
[3]  
Chen X., 2017, PROC CVPR IEEE, V1, P3, DOI [DOI 10.1109/CVPR.2017.691, 10.1109/CVPR.2017.691]
[4]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[5]  
Deng JJ, 2021, AAAI CONF ARTIF INTE, V35, P1201
[6]   RANDOM SAMPLE CONSENSUS - A PARADIGM FOR MODEL-FITTING WITH APPLICATIONS TO IMAGE-ANALYSIS AND AUTOMATED CARTOGRAPHY [J].
FISCHLER, MA ;
BOLLES, RC .
COMMUNICATIONS OF THE ACM, 1981, 24 (06) :381-395
[7]  
Geetha M., 2014, 2014 International Conference on Communications and Signal Processing (ICCSP), P833, DOI 10.1109/ICCSP.2014.6949960
[8]  
Geiger A., 2012, C COMP VIS PATT REC
[9]   Weakly Supervised Learning of Rigid 3D Scene Flow [J].
Gojcic, Zan ;
Litany, Or ;
Wieser, Andreas ;
Guibas, Leonidas J. ;
Birdal, Tolga .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :5688-5699
[10]   A survey of deep learning techniques for autonomous driving [J].
Grigorescu, Sorin ;
Trasnea, Bogdan ;
Cocias, Tiberiu ;
Macesanu, Gigel .
JOURNAL OF FIELD ROBOTICS, 2020, 37 (03) :362-386