Camera-LiDAR Cross-Modality Fusion Water Segmentation for Unmanned Surface Vehicles

被引：5

作者：

Gao, Jiantao ^{[1
]}

Zhang, Jingting ^{[1
]}

Liu, Chang ^{[1
]}

Li, Xiaomao ^{[1
]}

Peng, Yan ^{[1
]}

机构：

[1] Shanghai Univ, Sch Mechatron Engn & Automat, Res Inst USV Engn, Shanghai 200444, Peoples R China

来源：

JOURNAL OF MARINE SCIENCE AND ENGINEERING | 2022年 / 10卷 / 06期

基金：

中国国家自然科学基金;

关键词：

water segmentation; semantic segmentation; image segmentation; LiDAR point cloud; deep learning; unmanned surface vessel;

D O I：

10.3390/jmse10060744

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

Water segmentation is essential for the autonomous driving system of unmanned surface vehicles (USVs), which provides reliable navigation for making safety decisions. However, existing methods have only used monocular images as input, which often suffer from the changes in illumination and weather. Compared with monocular images, LiDAR point clouds can be collected independently of ambient light and provide sufficient 3D information but lack the color and texture that images own. Thus, in this paper, we propose a novel camera-LiDAR cross-modality fusion water segmentation method, which combines the data characteristics of the 2D image and 3D LiDAR point cloud in water segmentation for the first time. Specifically, the 3D point clouds are first supplemented with 2D color and texture information from the images and then distinguished into water surface points and non-water points by the early 3D cross-modality segmentation module. Subsequently, the 3D segmentation results and features are fed into the late 2D cross-modality segmentation module to perform 2D water segmentation. Finally, the 2D and 3D water segmentation results are fused for the refinement by an uncertainty-aware cross-modality fusion module. We further collect, annotate and present a novel Cross-modality Water Segmentation (CMWS) dataset to validate our proposed method. To the best of our knowledge, this is the first water segmentation dataset for USVs in inland waterways consisting of images and corresponding point clouds. Extensive experiments on the CMWS dataset demonstrate that our proposed method can significantly improve image-only-based methods, achieving improvements in accuracy and MaxF of approximately 2% for all the image-only-based methods.

引用

页数：17

共 50 条

[1] Camera-LiDAR Cross-Modality Gait Recognition
Guo, Wenxuan
Liang, Yingping
Pan, Zhiyu
Xi, Ziheng
Feng, Jianjiang
Zhou, Jie
COMPUTER VISION - ECCV 2024, PT XXXIV, 2025, 15092 : 439 - 455
[2] CMDFusion: Bidirectional Fusion Network With Cross-Modality Knowledge Distillation for LiDAR Semantic Segmentation
Cen, Jun
Zhang, Shiwei
Pei, Yixuan
Li, Kun
Zheng, Hang
Luo, Maochun
Zhang, Yingya
Chen, Qifeng
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01) : 771 - 778
[3] A Camera-LiDAR Fusion Framework for Traffic Monitoring
Sochaniwsky, Adrian
Huangfu, Yixin
Habibi, Saeid
Von Mohrenschildt, Martin
Ahmed, Ryan
Bhuiyan, Mymoon
Wyndham-West, Kyle
Vidal, Carlos
2024 IEEE TRANSPORTATION ELECTRIFICATION CONFERENCE AND EXPO, ITEC 2024, 2024,
[4] Camera-LiDAR Fusion With Latent Correlation for Cross-Scene Place Recognition
Pan, Yan
Xie, Jiapeng
Wu, Jiajie
Zhou, Bo
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2025, 72 (03) : 2801 - 2809
[5] Targetless Lidar-Camera Calibration via Cross-Modality Structure Consistency
Ou, Ni
Cai, Hanyu
Wang, Junzheng
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 2636 - 2648
[6] A spatially enhanced network with camera-lidar fusion for 3D semantic segmentation
Ye, Chao
Pan, Huihui
Yu, Xinghu
Gao, Huijun
NEUROCOMPUTING, 2022, 484 : 59 - 66
[7] Keypoints and Descriptors Based on Cross-Modality Information Fusion for Camera Localization
MA Shuo
GAO Yongbin
TIAN Fangzheng
LU Junxin
HUANG Bo
GU Jia
ZHOU Yilong
Wuhan University Journal of Natural Sciences, 2021, 26 (02) : 128 - 136
[8] Cross-Modality Features Fusion for Synthetic Aperture Radar Image Segmentation
Gao, Fei
Huang, Heqing
Yue, Zhenyu
Li, Dongyu
Ge, Shuzhi Sam
Lee, Tong Heng
Zhou, Huiyu
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[9] Multiple Objects Localization With Camera-LIDAR Sensor Fusion
Hocaoglu, Gokce Sena
Benli, Emrah
IEEE SENSORS JOURNAL, 2025, 25 (07) : 11892 - 11905
[10] Security Analysis of Camera-LiDAR Fusion Against Black-Box Attacks on Autonomous Vehicles
Hallyburton, R. Spencer
Liu, Yupei
Cao, Yulong
Mao, Z. Morley
Pajic, Miroslav
PROCEEDINGS OF THE 31ST USENIX SECURITY SYMPOSIUM, 2022, : 1903 - 1920

← 1 2 3 4 5 →