Camera-LiDAR Cross-Modality Fusion Water Segmentation for Unmanned Surface Vehicles

被引:5
|
作者
Gao, Jiantao [1 ]
Zhang, Jingting [1 ]
Liu, Chang [1 ]
Li, Xiaomao [1 ]
Peng, Yan [1 ]
机构
[1] Shanghai Univ, Sch Mechatron Engn & Automat, Res Inst USV Engn, Shanghai 200444, Peoples R China
基金
中国国家自然科学基金;
关键词
water segmentation; semantic segmentation; image segmentation; LiDAR point cloud; deep learning; unmanned surface vessel;
D O I
10.3390/jmse10060744
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Water segmentation is essential for the autonomous driving system of unmanned surface vehicles (USVs), which provides reliable navigation for making safety decisions. However, existing methods have only used monocular images as input, which often suffer from the changes in illumination and weather. Compared with monocular images, LiDAR point clouds can be collected independently of ambient light and provide sufficient 3D information but lack the color and texture that images own. Thus, in this paper, we propose a novel camera-LiDAR cross-modality fusion water segmentation method, which combines the data characteristics of the 2D image and 3D LiDAR point cloud in water segmentation for the first time. Specifically, the 3D point clouds are first supplemented with 2D color and texture information from the images and then distinguished into water surface points and non-water points by the early 3D cross-modality segmentation module. Subsequently, the 3D segmentation results and features are fed into the late 2D cross-modality segmentation module to perform 2D water segmentation. Finally, the 2D and 3D water segmentation results are fused for the refinement by an uncertainty-aware cross-modality fusion module. We further collect, annotate and present a novel Cross-modality Water Segmentation (CMWS) dataset to validate our proposed method. To the best of our knowledge, this is the first water segmentation dataset for USVs in inland waterways consisting of images and corresponding point clouds. Extensive experiments on the CMWS dataset demonstrate that our proposed method can significantly improve image-only-based methods, achieving improvements in accuracy and MaxF of approximately 2% for all the image-only-based methods.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Camera-LiDAR Cross-Modality Gait Recognition
    Guo, Wenxuan
    Liang, Yingping
    Pan, Zhiyu
    Xi, Ziheng
    Feng, Jianjiang
    Zhou, Jie
    COMPUTER VISION - ECCV 2024, PT XXXIV, 2025, 15092 : 439 - 455
  • [2] CMDFusion: Bidirectional Fusion Network With Cross-Modality Knowledge Distillation for LiDAR Semantic Segmentation
    Cen, Jun
    Zhang, Shiwei
    Pei, Yixuan
    Li, Kun
    Zheng, Hang
    Luo, Maochun
    Zhang, Yingya
    Chen, Qifeng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01) : 771 - 778
  • [3] A Camera-LiDAR Fusion Framework for Traffic Monitoring
    Sochaniwsky, Adrian
    Huangfu, Yixin
    Habibi, Saeid
    Von Mohrenschildt, Martin
    Ahmed, Ryan
    Bhuiyan, Mymoon
    Wyndham-West, Kyle
    Vidal, Carlos
    2024 IEEE TRANSPORTATION ELECTRIFICATION CONFERENCE AND EXPO, ITEC 2024, 2024,
  • [4] Camera-LiDAR Fusion With Latent Correlation for Cross-Scene Place Recognition
    Pan, Yan
    Xie, Jiapeng
    Wu, Jiajie
    Zhou, Bo
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2025, 72 (03) : 2801 - 2809
  • [5] Targetless Lidar-Camera Calibration via Cross-Modality Structure Consistency
    Ou, Ni
    Cai, Hanyu
    Wang, Junzheng
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 2636 - 2648
  • [6] A spatially enhanced network with camera-lidar fusion for 3D semantic segmentation
    Ye, Chao
    Pan, Huihui
    Yu, Xinghu
    Gao, Huijun
    NEUROCOMPUTING, 2022, 484 : 59 - 66
  • [7] Keypoints and Descriptors Based on Cross-Modality Information Fusion for Camera Localization
    MA Shuo
    GAO Yongbin
    TIAN Fangzheng
    LU Junxin
    HUANG Bo
    GU Jia
    ZHOU Yilong
    Wuhan University Journal of Natural Sciences, 2021, 26 (02) : 128 - 136
  • [8] Cross-Modality Features Fusion for Synthetic Aperture Radar Image Segmentation
    Gao, Fei
    Huang, Heqing
    Yue, Zhenyu
    Li, Dongyu
    Ge, Shuzhi Sam
    Lee, Tong Heng
    Zhou, Huiyu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [9] Multiple Objects Localization With Camera-LIDAR Sensor Fusion
    Hocaoglu, Gokce Sena
    Benli, Emrah
    IEEE SENSORS JOURNAL, 2025, 25 (07) : 11892 - 11905
  • [10] Security Analysis of Camera-LiDAR Fusion Against Black-Box Attacks on Autonomous Vehicles
    Hallyburton, R. Spencer
    Liu, Yupei
    Cao, Yulong
    Mao, Z. Morley
    Pajic, Miroslav
    PROCEEDINGS OF THE 31ST USENIX SECURITY SYMPOSIUM, 2022, : 1903 - 1920