RS-Aug: Improve 3D Object Detection on LiDAR With Realistic Simulator Based Data Augmentation

被引:4
作者
An, Pei [1 ]
Liang, Junxiong [2 ]
Ma, Jie [2 ]
Chen, Yanfei [1 ]
Wang, Liheng [1 ]
Yang, You [3 ,4 ]
Liu, Qiong [3 ,4 ]
机构
[1] Wuhan Inst Technol, Sch Elect & Informat Engn, Wuhan 430205, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430074, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
[4] Wuhan Natl Lab Optoelect, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Laser radar; Point cloud compression; Training; Object detection; Rendering (computer graphics); Detectors; Light detection and ranging; data augmentation; semantic segmentation; 3D object detection; autonomous driving; POINT CLOUDS;
D O I
10.1109/TITS.2023.3266727
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Light detection and ranging (LiDAR) is an essential sensor for three dimensional (3D) object detection via generating 3D point cloud of the surroundings, and it has been widely used in the various visual applications, especially autonomous driving. However, limited numbers of labeled LiDAR datasets brutally restrain the development of 3D object detector, and this situation breeds an urgent demand on data augmentation in this field. By far, most of the traditional methods reuse the labeled samples, while those unlabeled are hastily untaken. Motivated by this, we propose a Realistic Simulator based data augmentation (RS-Aug). It aims to construct augmented real scenes to enrich the diversity of training dataset. To train 3D object detector in a supervised learning way, the first step of RS-Aug is auto-annotation. Time-continuous LiDAR frames are used to construct the dense scene, which is beneficial to annotation and the subsequent rendering augmentation. However, 3D points with incorrect semantic labels are naturally gathered during multi-view reconstruction, causing the negative effect on auto-annotation. We propose an algorithm of cluster guided $k$ -nearest neighbor (c- kNN). It emphasizes on de-nosing semantic labels of clustered points using distance and intensity constraints. Then, the next step of RS-Aug is rendering augmentation on the real scene. To enhance the rendering quality using collision and distance constraints with the less computation complexity, we propose a scheme of heuristic search (HS) based object insertion. It estimates the proper position of the inserted object from 2D bird's eye view (BEV). Experiments demonstrate the de-noising accuracy of c- kNN, rendering quality of HS based object insertion, and improvement of RS-Aug on object detection.
引用
收藏
页码:10165 / 10176
页数:12
相关论文
共 40 条
  • [1] Lambertian Model-Based Normal Guided Depth Completion for LiDAR-Camera System
    An, Pei
    Fu, Wenxing
    Gao, Yingshuo
    Ma, Jie
    Zhang, Jun
    Yu, Kun
    Fang, Bin
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [2] Deep structural information fusion for 3D object detection on LiDAR-camera system
    An, Pei
    Liang, Junxiong
    Yu, Kun
    Fang, Bin
    Ma, Jie
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 214
  • [3] Bijelic M, 2020, PROC CVPR IEEE, P11679, DOI 10.1109/CVPR42600.2020.01170
  • [4] 3D Point Cloud Processing and Learning for Autonomous Driving: Impacting Map Creation, Localization, and Perception
    Chen, Siheng
    Liu, Baoan
    Feng, Chen
    Vallespi-Gonzalez, Carlos
    Wellington, Carl
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2021, 38 (01) : 68 - 86
  • [5] Deng JJ, 2021, AAAI CONF ARTIF INTE, V35, P1201
  • [6] TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual Environments
    Dokania, Shubham
    Subramanian, Anbumani
    Chandraker, Manmohan
    Jawahar, C. V.
    [J]. COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 592 - 608
  • [7] Dosovitskiy A., 2017, P 1 ANN C ROB LEARN, P1, DOI DOI 10.48550/ARXIV.1711.03938
  • [8] Multi-level height maps-based registration method for sparse LiDAR point clouds in an urban scene
    Fang, Bin
    Ma, Jie
    An, Pei
    Wang, Zhao
    Zhang, Jun
    Yu, Kun
    [J]. APPLIED OPTICS, 2021, 60 (14) : 4154 - 4164
  • [9] LiDAR-Aug: A General Rendering-based Augmentation Framework for 3D Object Detection
    Fang, Jin
    Zuo, Xinxin
    Zhou, Dingfu
    Jin, Shengze
    Wang, Sen
    Zhang, Liangjun
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4708 - 4718
  • [10] Augmented LiDAR Simulator for Autonomous Driving
    Fang, Jin
    Zhou, Dingfu
    Yan, Feilong
    Zhao, Tongtong
    Zhang, Feihu
    Ma, Yu
    Wang, Liang
    Yang, Ruigang
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) : 1931 - 1938