Deep learning-based semantic segmentation of three-dimensional point cloud: a comprehensive review

被引:7
作者
Singh, Dheerendra Pratap [1 ]
Yadav, Manohar [1 ]
机构
[1] Motilal Nehru Natl Inst Technol Allahabad, Geog Informat Syst GIS Cell, Prayagraj 211004, India
关键词
LiDAR; Point cloud; image; Deep learning; semantic segmentation; 3D OBJECT RECOGNITION; NEURAL-NETWORK; LIDAR DATA; CLASSIFICATION; FUSION; NET; DATASET;
D O I
10.1080/01431161.2023.2297177
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Point cloud has emerged as the most popular three-dimensional (3D) data format in recent years for several scientific and industrial applications. Point cloud semantic segmentation has piqued the researcher's interest, which is a crucial stage in 3D analysis and scene comprehension. Deep learning-based processing is more feasible to increase the availability of point cloud acquisition tools that is LiDAR systems at the user end. The point cloud learning achieves tremendous success in object detection, object categorization, and semantic segmentation. To summarize the recent works with chronological development, comprehensive review of projection-, voxel-, and direct point-based point cloud semantic segmentation methods is performed from various perspectives. The commonly used point cloud benchmark datasets with their characteristics are discussed, and they are used for the performance analysis and comparison of several state-of-the-art segmentation methods. The quantitative performance analysis of these deep learning models summarizes the trend of semantic segmentation of point clouds. In the context of point cloud semantic segmentation, the various methods have specific roles. Based on the review of methods working and their performance analysis, it is concluded that the projection-based methods prioritize efficiency, which is ideal in unavailability of high-performance computing system. Voxel-based methods capture overall context, serving well in 3D object classification. Point-based approaches excel in fine details and efficiency, suited for tasks like 3D semantic segmentation. Choosing the suitable method depends on the task, data, and resources. KPConv and DGCNN are popular choices, especially for precision and adaptability to point density. However, method performance varies, underlining the need for tailored selection. Hybrid approaches, combining method strengths, promise superior results.
引用
收藏
页码:532 / 586
页数:55
相关论文
共 175 条
[1]   Artificial Intelligence in Government: Potentials, Challenges, and the Future [J].
Ahn, Michael J. ;
Chen, Yu-Che .
PROCEEDINGS OF THE 21ST ANNUAL INTERNATIONAL CONFERENCE ON DIGITAL GOVERNMENT RESEARCH, DGO 2020, 2020, :243-252
[2]  
Aksoy EE, 2020, IEEE INT VEH SYM, P926, DOI [10.13140/rg.2.2.22837.83689, 10.1109/IV47402.2020.9304694]
[3]   Voxel Change: Big Data-Based Change Detection for Aerial Urban LiDAR of Unequal Densities [J].
Aljumaily, Harith ;
Laefer, Debra F. ;
Cuadra, Dolores ;
Velasco, Manuel .
JOURNAL OF SURVEYING ENGINEERING, 2021, 147 (04)
[4]   Multi Projection Fusion for Real-time Semantic Segmentation of 3D LiDAR Point Clouds [J].
Alnaggar, Yara Ali ;
Afifi, Mohamed ;
Amer, Karim ;
ElHelw, Mohamed .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :1799-1808
[5]  
[Anonymous], 2019, ISPRS INT J GEO-INF, DOI DOI 10.3390/ijgi8050213
[6]  
[Anonymous], 2015, International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences (ISPRS)
[7]   3D Semantic Parsing of Large-Scale Indoor Spaces [J].
Armeni, Iro ;
Sener, Ozan ;
Zamir, Amir R. ;
Jiang, Helen ;
Brilakis, Ioannis ;
Fischer, Martin ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1534-1543
[8]  
Athitsos V, 2008, PROC CVPR IEEE, P1666
[9]   Machine Learning-Based Supervised Classification of Point Clouds Using Multiscale Geometric Features [J].
Atik, Muhammed Enes ;
Duran, Zaide ;
Seker, Dursun Zafer .
ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (03)
[10]   Point Convolutional Neural Networks by Extension Operators [J].
Atzmon, Matan ;
Maron, Haggai ;
Lipman, Yaron .
ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04)