A Random Fusion of Mix3D and PolarMix to Improve Semantic Segmentation Performance in 3D Lidar Point Cloud

被引:0
作者
Liu, Bo [1 ,2 ]
Feng, Li [1 ]
Chen, Yufeng [3 ]
机构
[1] Macao Univ Sci & Technol, Sch Comp Sci & Engn, Macau 999078, Peoples R China
[2] Chaohu Univ, Sch Comp Sci & Artificial Intelligence, Chaohu 238000, Peoples R China
[3] Hubei Univ Automot Technol, Inst Vehicle Informat Control & Network Technol, Shiyan 442002, Peoples R China
来源
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES | 2024年 / 140卷 / 01期
关键词
3D lidar point cloud; data augmentation; RandomFusion; semantic segmentation;
D O I
10.32604/cmes.2024.047695
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper focuses on the effective utilization of data augmentation techniques for 3D lidar point clouds to enhance the performance of neural network models. These point clouds, which represent spatial information through a collection of 3D coordinates, have found wide-ranging applications. Data augmentation has emerged as a potent solution to the challenges posed by limited labeled data and the need to enhance model generalization capabilities. Much of the existing research is devoted to crafting novel data augmentation methods specifically for 3D lidar point clouds. However, there has been a lack of focus on making the most of the numerous existing augmentation techniques. Addressing this deficiency, this research investigates the possibility of combining two fundamental data augmentation strategies. The paper introduces PolarMix and Mix3D, two commonly employed augmentation techniques, and presents a new approach, named RandomFusion. Instead of using a fixed or predetermined combination of augmentation methods, RandomFusion randomly chooses one method from a pool of options for each instance or sample. This innovative data augmentation technique randomly augments each point in the point cloud with either PolarMix or Mix3D. The crux of this strategy is the random choice between PolarMix and Mix3D for the augmentation of each point within the point cloud data set. The results of the experiments conducted validate the efficacy of the RandomFusion strategy in enhancing the performance of neural network models for 3D lidar point cloud semantic segmentation tasks. This is achieved without compromising computational efficiency. By examining the potential of merging different augmentation techniques, the research contributes significantly to a more comprehensive understanding of how to utilize existing augmentation methods for 3D lidar point clouds. RandomFusion data augmentation technique offers a simple yet effective method to leverage the diversity of augmentation techniques and boost the robustness of models. The insights gained from this research can pave the way for future work aimed at developing more advanced and efficient data augmentation strategies for 3D lidar point cloud analysis.
引用
收藏
页码:845 / 862
页数:18
相关论文
共 33 条
  • [1] SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences
    Behley, Jens
    Garbade, Martin
    Milioto, Andres
    Quenzel, Jan
    Behnke, Sven
    Stachniss, Cyrill
    Gall, Juergen
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9296 - 9306
  • [2] ADA: Adversarial Data Augmentation for Object Detection
    Behpour, Sima
    Kitani, Kris M.
    Ziebart, Brian D.
    [J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1243 - 1252
  • [3] Intrinsic feature extraction for unsupervised domain adaptation
    Cao, Xinzhi
    Guo, Yinsai
    Yang, Wenbin
    Luo, Xiangfeng
    Xie, Shaorong
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2023, 19 (5/6) : 173 - 189
  • [4] A Robust Shape-Aware Rib Fracture Detection and Segmentation Framework With Contrastive Learning
    Cao, Zheng
    Xu, Liming
    Chen, Danny Z.
    Gao, Honghao
    Wu, Jian
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1584 - 1591
  • [5] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [6] Chen PG, 2024, Arxiv, DOI [arXiv:2001.04086, 10.48550/arXiv.2001.04086]
  • [7] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [8] BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation
    Dai, Jifeng
    He, Kaiming
    Sun, Jian
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1635 - 1643
  • [9] DeVries T, 2017, Arxiv, DOI [arXiv:1708.04552, DOI 10.48550/ARXIV.1708.04552]
  • [10] Com-DDPG: Task Offloading Based on Multiagent Reinforcement Learning for Information-Communication-Enhanced Mobile Edge Computing in the Internet of Vehicles
    Gao, Honghao
    Wang, Xuejie
    Wei, Wei
    Al-Dulaimi, Anwer
    Xu, Yueshen
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (01) : 348 - 361