Large-scale 3D point-cloud semantic segmentation of urban and rural scenes using data volume decomposition coupled with pipeline parallelism

被引:17
作者
Chew, Alvin Wei Ze [1 ]
Ji, Ankang [2 ,3 ]
Zhang, Limao [3 ]
机构
[1] Bentley Syst Res Off, 1 Harbourfront Pl,HarbourFront Tower One, Singapore 098633, Singapore
[2] Harbin Inst Technol, Sch Management, Harbin 150001, Peoples R China
[3] Nanyang Technol Univ, Sch Civil & Environm Engn, 50 Nanyang Ave, Singapore 639798, Singapore
关键词
Deep learning; Image segmentation analysis; Encoder-decoder; Pipeline parallelism; 3D point-cloud data; EMPIRICAL MODE DECOMPOSITION; CRACK DETECTION; CLASSIFICATION; NETWORK; OBJECTS;
D O I
10.1016/j.autcon.2021.103995
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This study proposes a generic approach which performs a series of systematic analyses by first introducing a data volume decomposition method to generate useful data features for performing semantic segmentation analysis involving 3D point-cloud data. Pipeline parallelism protocol is then implemented to accelerate the deep learning model's training phase. Our proposed approach is verified by decomposing around 2.0 billion point-cloud data points, as extracted from an open-source Semantic3D dataset, into many 3D regular structures with defined numbers of voxels. Each derived 3D structure has imposed normality in their data distribution of the respective label classes. Using the optimal hyperparameters for model training, the resulting trained model achieves average overall accuracy (mOA) and average intersection over union (mIOU) values of 0.984 and 0.752, respectively, on a testing dataset having close to 800 million point-cloud data points. The results are comparable with that of other state-of-the-art models in the literature.
引用
收藏
页数:19
相关论文
共 88 条
  • [1] Automatic Detection and Modeling of Underground Pipes Using a Portable 3D LiDAR System
    Aijazi, Ahmad K.
    Malaterre, Laurent
    Trassoudaine, Laurent
    Chateau, Thierry
    Checchin, Paul
    [J]. SENSORS, 2019, 19 (24)
  • [2] Aksoy EE, 2020, IEEE INT VEH SYM, P926, DOI [10.1109/IV47402.2020.9304694, 10.13140/rg.2.2.22837.83689]
  • [3] [Anonymous], 2018, Hal-01763469
  • [4] 3D Semantic Parsing of Large-Scale Indoor Spaces
    Armeni, Iro
    Sener, Ozan
    Zamir, Amir R.
    Jiang, Helen
    Brilakis, Ioannis
    Fischer, Martin
    Savarese, Silvio
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1534 - 1543
  • [5] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [6] Encoder-decoder network for pixel-level road crack detection in black-box images
    Bang, Seongdeok
    Park, Somin
    Kim, Hongjo
    Kim, Hyoungkwan
    [J]. COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2019, 34 (08) : 713 - 727
  • [7] Classification of sensor independent point cloud data of building objects using random forests
    Bassier, Maarten
    Van Genechten, Bjorn
    Vergauwen, Maarten
    [J]. JOURNAL OF BUILDING ENGINEERING, 2019, 21 : 468 - 477
  • [8] Bayu A, 2019, 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION, NETWORKS AND SATELLITE (COMNETSAT), P73, DOI [10.1109/comnetsat.2019.8844074, 10.1109/COMNETSAT.2019.8844074]
  • [9] Boulch A., 2017, 3dor@eurographics, P17, DOI [DOI 10.2312/3DOR.20171047, 10.2312/3dor.20171047]
  • [10] Boulch A., 2021, FKACONV FEATURE KERN