Are We Hungry for 3D LiDAR Data for Semantic Segmentation? A Survey of Datasets and Methods

被引:60
作者
Gao, Biao [1 ,2 ]
Pan, Yancheng [1 ,2 ]
Li, Chengkun [1 ,2 ]
Geng, Sibo [1 ,2 ]
Zhao, Huijing [1 ,2 ]
机构
[1] Peking Univ, Key Lab Machine Percept MOE, Beijing 100871, Peoples R China
[2] Peking Univ, Sch Elect Engn & Comp Sci, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
Data hunger; 3D LiDAR; semantic segmentation; deep learning; POINT CLOUDS; NEURAL-NETWORKS; CLASSIFICATION; MARGIN; IMAGE;
D O I
10.1109/TITS.2021.3076844
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
3D semantic segmentation is a fundamental task for robotic and autonomous driving applications. Recent works have been focused on using deep learning techniques, whereas developing tine-annotated 3D LiDAR datasets is extremely labor intensive and requires professional skills. The performance limitation caused by insufficient datasets is called data hunger problem. This research provides a comprehensive survey on the question: are we hungry for 3D LiDAR data for semantic segmentation? The studies are conducted at three levels. First, a broad review to the main 3D LiDAR datasets is conducted, followed by a statistical analysis on three representative datasets to gain an in-depth view on the datasets' size, diversity and quality, which are the critical factors in learning deep models. Second, an organized survey of 3D semantic segmentation methods is given with a focus on the mainstream of the latest research trend using deep learning techniques, followed by a systematic survey to the existing efforts to solve the data hunger problem. Finally, an insightful discussion of the remaining problems on both methodological and datasets' viewpoints, and the open questions on dataset bias, domain and semantic gap are given, leading to potential topics in future works. To the best of our knowledge, this is the first work to study the data hunger problem for 3D semantic segmentation using deep learning techniques, which are addressed in both methodological and dataset review, and we share findings and discussions through a comprehensive dataset analysis.
引用
收藏
页码:6063 / 6081
页数:19
相关论文
共 205 条
[1]  
Abdou M., 2019, ARXIV190610964
[2]  
Aksoy E. Erdal, 2019, ARXIV190908291
[3]  
Alonso I., 2020, ARXIV201012239
[4]   3D-MiniNet: Learning a 2D Representation From Point Clouds for Fast and Efficient 3D LIDAR Semantic Segmentation [J].
Alonso, Inigo ;
Riazuelo, Luis ;
Montesano, Luis ;
Murillo, Ana C. .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) :5432-5439
[5]   Contextually guided semantic labeling and search for three-dimensional point clouds [J].
Anand, Abhishek ;
Koppula, Hema Swetha ;
Joachims, Thorsten ;
Saxena, Ashutosh .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (01) :19-34
[6]  
Anguelov D, 2005, PROC CVPR IEEE, P169
[7]  
Nguyen A, 2013, PROCEEDINGS OF THE 2013 6TH IEEE CONFERENCE ON ROBOTICS, AUTOMATION AND MECHATRONICS (RAM), P225, DOI 10.1109/RAM.2013.6758588
[8]  
[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.482
[9]  
[Anonymous], 2014, ARXIV
[10]  
[Anonymous], 2013, AUSTR C ROB AUT