A Review on Deep Learning Techniques for 3D Sensed Data Classification

被引:124
作者
Griffiths, David [1 ]
Boehm, Jan [1 ]
机构
[1] UCL, Dept Civil Environm & Geomat Engn, Gower St, London WC1E 6BT, England
关键词
point cloud; deep learning; classification; semantics; segmentation; machine learning; OBJECT RECOGNITION; POINT CLOUD; CONTEXTUAL CLASSIFICATION; LIDAR DATA; SEGMENTATION; EXTRACTION; MULTIVIEW; FEATURES; IMAGES; TREES;
D O I
10.3390/rs11121499
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Over the past decade deep learning has driven progress in 2D image understanding. Despite these advancements, techniques for automatic 3D sensed data understanding, such as point clouds, is comparatively immature. However, with a range of important applications from indoor robotics navigation to national scale remote sensing there is a high demand for algorithms that can learn to automatically understand and classify 3D sensed data. In this paper we review the current state-of-the-art deep learning architectures for processing unstructured Euclidean data. We begin by addressing the background concepts and traditional methodologies. We review the current main approaches, including RGB-D, multi-view, volumetric and fully end-to-end architecture designs. Datasets for each category are documented and explained. Finally, we give a detailed discussion about the future of deep learning for 3D sensed data, using literature to justify the areas where future research would be most valuable.
引用
收藏
页数:29
相关论文
共 117 条
[81]   Deep fusion of multi-view and multimodal representation of ALS point cloud for 3D terrain scene recognition [J].
Qin, Nannan ;
Hu, Xiangyun ;
Dai, Hengming .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 143 :205-212
[82]  
Quadros A, 2012, IEEE INT CONF ROBOT, P4428, DOI 10.1109/ICRA.2012.6225239
[83]   You Only Look Once: Unified, Real-Time Object Detection [J].
Redmon, Joseph ;
Divvala, Santosh ;
Girshick, Ross ;
Farhadi, Ali .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :779-788
[84]   OctNet: Learning Deep 3D Representations at High Resolutions [J].
Riegler, Gernot ;
Ulusoy, Ali Osman ;
Geiger, Andreas .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6620-6629
[85]  
Riemenschneider H, 2014, LECT NOTES COMPUT SC, V8693, P516, DOI 10.1007/978-3-319-10602-1_34
[86]   Paris-Lille-3D: A large and high-quality ground-truth urban point cloud dataset for automatic segmentation and classification [J].
Roynard, Xavier ;
Deschaud, Jean-Emmanuel ;
Goulette, Francois .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2018, 37 (06) :545-557
[87]   The Earth Mover's Distance as a metric for image retrieval [J].
Rubner, Y ;
Tomasi, C ;
Guibas, LJ .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2000, 40 (02) :99-121
[88]  
Sauder J., 2019, ARXIV190108396
[89]   The Graph Neural Network Model [J].
Scarselli, Franco ;
Gori, Marco ;
Tsoi, Ah Chung ;
Hagenbuchner, Markus ;
Monfardini, Gabriele .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (01) :61-80
[90]  
Schult J., 2018, P EUR C COMP VIS ECC