A Random Fusion of Mix3D and PolarMix to Improve Semantic Segmentation Performance in 3D Lidar Point Cloud

被引：0

作者：

Liu, Bo ^{[1
,2
]}

Feng, Li ^{[1
]}

Chen, Yufeng ^{[3
]}

机构：

[1] Macao Univ Sci & Technol, Sch Comp Sci & Engn, Macau 999078, Peoples R China

[2] Chaohu Univ, Sch Comp Sci & Artificial Intelligence, Chaohu 238000, Peoples R China

[3] Hubei Univ Automot Technol, Inst Vehicle Informat Control & Network Technol, Shiyan 442002, Peoples R China

来源：

CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES | 2024年 / 140卷 / 01期

关键词：

3D lidar point cloud; data augmentation; RandomFusion; semantic segmentation;

D O I：

10.32604/cmes.2024.047695

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

This paper focuses on the effective utilization of data augmentation techniques for 3D lidar point clouds to enhance the performance of neural network models. These point clouds, which represent spatial information through a collection of 3D coordinates, have found wide-ranging applications. Data augmentation has emerged as a potent solution to the challenges posed by limited labeled data and the need to enhance model generalization capabilities. Much of the existing research is devoted to crafting novel data augmentation methods specifically for 3D lidar point clouds. However, there has been a lack of focus on making the most of the numerous existing augmentation techniques. Addressing this deficiency, this research investigates the possibility of combining two fundamental data augmentation strategies. The paper introduces PolarMix and Mix3D, two commonly employed augmentation techniques, and presents a new approach, named RandomFusion. Instead of using a fixed or predetermined combination of augmentation methods, RandomFusion randomly chooses one method from a pool of options for each instance or sample. This innovative data augmentation technique randomly augments each point in the point cloud with either PolarMix or Mix3D. The crux of this strategy is the random choice between PolarMix and Mix3D for the augmentation of each point within the point cloud data set. The results of the experiments conducted validate the efficacy of the RandomFusion strategy in enhancing the performance of neural network models for 3D lidar point cloud semantic segmentation tasks. This is achieved without compromising computational efficiency. By examining the potential of merging different augmentation techniques, the research contributes significantly to a more comprehensive understanding of how to utilize existing augmentation methods for 3D lidar point clouds. RandomFusion data augmentation technique offers a simple yet effective method to leverage the diversity of augmentation techniques and boost the robustness of models. The insights gained from this research can pave the way for future work aimed at developing more advanced and efficient data augmentation strategies for 3D lidar point cloud analysis.

引用

页码：845 / 862

页数：18

共 33 条

[1] SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences
Behley, Jens
Garbade, Martin
Milioto, Andres
Quenzel, Jan
Behnke, Sven
Stachniss, Cyrill
Gall, Juergen
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9296 - 9306
[2] ADA: Adversarial Data Augmentation for Object Detection
Behpour, Sima
Kitani, Kris M.
Ziebart, Brian D.
[J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1243 - 1252
[3] Intrinsic feature extraction for unsupervised domain adaptation
Cao, Xinzhi
Guo, Yinsai
Yang, Wenbin
Luo, Xiangfeng
Xie, Shaorong
[J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2023, 19 (5/6) : 173 - 189
[4] A Robust Shape-Aware Rib Fracture Detection and Segmentation Framework With Contrastive Learning
Cao, Zheng
Xu, Liming
Chen, Danny Z.
Gao, Honghao
Wu, Jian
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1584 - 1591
[5] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Chen, Liang-Chieh
Papandreou, George
Kokkinos, Iasonas
Murphy, Kevin
Yuille, Alan L.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
[6] Chen PG, 2024, Arxiv, DOI [arXiv:2001.04086, 10.48550/arXiv.2001.04086]
[7] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
[8] BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation
Dai, Jifeng
He, Kaiming
Sun, Jian
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1635 - 1643
[9] DeVries T, 2017, Arxiv, DOI [arXiv:1708.04552, DOI 10.48550/ARXIV.1708.04552]
[10] Com-DDPG: Task Offloading Based on Multiagent Reinforcement Learning for Information-Communication-Enhanced Mobile Edge Computing in the Internet of Vehicles
Gao, Honghao
Wang, Xuejie
Wei, Wei
Al-Dulaimi, Anwer
Xu, Yueshen
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (01) : 348 - 361

← 1 2 3 4 →