RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds

被引:1376
作者
Hu, Qingyong [1 ]
Yang, Bo [1 ]
Xie, Linhai [1 ]
Rosa, Stefano [1 ]
Guo, Yulan [2 ,3 ]
Wang, Zhihua [1 ]
Trigoni, Niki [1 ]
Markham, Andrew [1 ]
机构
[1] Univ Oxford, Oxford, England
[2] Sun Yat Sen Univ, Guangzhou, Peoples R China
[3] Natl Univ Def Technol, Changsha, Peoples R China
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020) | 2020年
关键词
D O I
10.1109/CVPR42600.2020.01112
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the problem of efficient semantic segmentation for large-scale 3D point clouds. By relying on expensive sampling techniques or computationally heavy pre/post-processing steps, most existing approaches are only able to be trained and operate over small-scale point clouds. In this paper, we introduce RandLA-Net, an efficient and lightweight neural architecture to directly infer per-point semantics for large-scale point clouds. The key to our approach is to use random point sampling instead of more complex point selection approaches. Although remarkably computation and memory efficient, random sampling can discard key features by chance. To overcome this, we introduce a novel local feature aggregation module to progressively increase the receptive field for each 3D point, thereby effectively preserving geometric details. Extensive experiments show that our RandLA-Net can process 1 million points in a single pass with up to 200x faster than existing approaches. Moreover, our RandLA-Net clearly surpasses state-of-the-art approaches for semantic segmentation on two large-scale benchmarks Semantic3D and SemanticKITTI.
引用
收藏
页码:11105 / 11114
页数:10
相关论文
共 70 条
[1]  
Abid A., 2019, ICML
[2]  
[Anonymous], 2018, CVPR, DOI DOI 10.1109/CVPR.2018.00959
[3]  
[Anonymous], R RES LANDSCAPE ENV
[4]  
[Anonymous], 2017, 3dor@ eurographics
[5]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.01032
[6]  
[Anonymous], 2019, IEEE INT CONF ROBOT, DOI DOI 10.1109/ICRA.2019.8793495
[7]  
Armeni I, 2017, 3D-semantic data for indoor scene understanding
[8]   SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences [J].
Behley, Jens ;
Garbade, Martin ;
Milioto, Andres ;
Quenzel, Jan ;
Behnke, Sven ;
Stachniss, Cyrill ;
Gall, Juergen .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9296-9306
[9]   Progressive Feature Alignment for Unsupervised Domain Adaptation [J].
Chen, Chaoqi ;
Xie, Weiping ;
Huang, Wenbing ;
Rong, Yu ;
Ding, Xinghao ;
Huang, Yue ;
Xu, Tingyang ;
Huang, Junzhou .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :627-636
[10]  
Chen L., 2019, arXiv