RS-Aug: Improve 3D Object Detection on LiDAR With Realistic Simulator Based Data Augmentation

被引：9

作者：

An, Pei ^{[1
]}

Liang, Junxiong ^{[2
]}

Ma, Jie ^{[2
]}

Chen, Yanfei ^{[1
]}

Wang, Liheng ^{[1
]}

Yang, You ^{[3
,4
]}

Liu, Qiong ^{[3
,4
]}

机构：

[1] Wuhan Inst Technol, Sch Elect & Informat Engn, Wuhan 430205, Peoples R China

[2] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430074, Peoples R China

[3] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China

[4] Wuhan Natl Lab Optoelect, Wuhan 430074, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2023年 / 24卷 / 09期

基金：

中国国家自然科学基金;

关键词：

Three-dimensional displays; Laser radar; Point cloud compression; Training; Object detection; Rendering (computer graphics); Detectors; Light detection and ranging; data augmentation; semantic segmentation; 3D object detection; autonomous driving; POINT CLOUDS;

D O I：

10.1109/TITS.2023.3266727

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Light detection and ranging (LiDAR) is an essential sensor for three dimensional (3D) object detection via generating 3D point cloud of the surroundings, and it has been widely used in the various visual applications, especially autonomous driving. However, limited numbers of labeled LiDAR datasets brutally restrain the development of 3D object detector, and this situation breeds an urgent demand on data augmentation in this field. By far, most of the traditional methods reuse the labeled samples, while those unlabeled are hastily untaken. Motivated by this, we propose a Realistic Simulator based data augmentation (RS-Aug). It aims to construct augmented real scenes to enrich the diversity of training dataset. To train 3D object detector in a supervised learning way, the first step of RS-Aug is auto-annotation. Time-continuous LiDAR frames are used to construct the dense scene, which is beneficial to annotation and the subsequent rendering augmentation. However, 3D points with incorrect semantic labels are naturally gathered during multi-view reconstruction, causing the negative effect on auto-annotation. We propose an algorithm of cluster guided $k$ -nearest neighbor (c- kNN). It emphasizes on de-nosing semantic labels of clustered points using distance and intensity constraints. Then, the next step of RS-Aug is rendering augmentation on the real scene. To enhance the rendering quality using collision and distance constraints with the less computation complexity, we propose a scheme of heuristic search (HS) based object insertion. It estimates the proper position of the inserted object from 2D bird's eye view (BEV). Experiments demonstrate the de-noising accuracy of c- kNN, rendering quality of HS based object insertion, and improvement of RS-Aug on object detection.

引用

页码：10165 / 10176

页数：12

共 40 条

[1] Lambertian Model-Based Normal Guided Depth Completion for LiDAR-Camera System [J].

An, Pei ;

Fu, Wenxing ;

Gao, Yingshuo ;

Ma, Jie ;

Zhang, Jun ;

Yu, Kun ;

Fang, Bin .

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19

[2] Deep structural information fusion for 3D object detection on LiDAR-camera system [J].

An, Pei ;

Liang, Junxiong ;

Yu, Kun ;

Fang, Bin ;

Ma, Jie .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 214

[3] Seeing Through Fog Without Seeing Fog: Deep Multimodal Sensor Fusion in Unseen Adverse Weather [J].

Bijelic, Mario ;

Gruber, Tobias ;

Mannan, Fahim ;

Kraus, Florian ;

Ritter, Werner ;

Dietmayer, Klaus ;

Heide, Felix .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11679-11689

[4] 3D Point Cloud Processing and Learning for Autonomous Driving: Impacting Map Creation, Localization, and Perception [J].

Chen, Siheng ;

Liu, Baoan ;

Feng, Chen ;

Vallespi-Gonzalez, Carlos ;

Wellington, Carl .

IEEE SIGNAL PROCESSING MAGAZINE, 2021, 38 (01) :68-86

[5]

Deng JJ, 2021, AAAI CONF ARTIF INTE, V35, P1201

[6] TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual Environments [J].

Dokania, Shubham ;

Subramanian, Anbumani ;

Chandraker, Manmohan ;

Jawahar, C. V. .

COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 :592-608

[7]

Dosovitskiy A, 2017, PR MACH LEARN RES, V78

[8] Multi-level height maps-based registration method for sparse LiDAR point clouds in an urban scene [J].

Fang, Bin ;

Ma, Jie ;

An, Pei ;

Wang, Zhao ;

Zhang, Jun ;

Yu, Kun .

APPLIED OPTICS, 2021, 60 (14) :4154-4164

[9] LiDAR-Aug: A General Rendering-based Augmentation Framework for 3D Object Detection [J].

Fang, Jin ;

Zuo, Xinxin ;

Zhou, Dingfu ;

Jin, Shengze ;

Wang, Sen ;

Zhang, Liangjun .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4708-4718

[10] Augmented LiDAR Simulator for Autonomous Driving [J].

Fang, Jin ;

Zhou, Dingfu ;

Yan, Feilong ;

Zhao, Tongtong ;

Zhang, Feihu ;

Ma, Yu ;

Wang, Liang ;

Yang, Ruigang .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :1931-1938

← 1 2 3 4 →