CSFNet: Cross-Modal Semantic Focus Network for Semantic Segmentation of Large-Scale Point Clouds

被引:1
|
作者
Luo, Yang [1 ]
Han, Ting [2 ]
Liu, Yujun [3 ]
Su, Jinhe [1 ]
Chen, Yiping [2 ]
Li, Jinyuan [1 ]
Wu, Yundong [1 ]
Cai, Guorong [1 ]
机构
[1] Jimei Univ, Sch Comp Engn, Xiamen 361021, Peoples R China
[2] Sun Yat Sen Univ, Sch Geospatial Engn & Sci, Zhuhai 519082, Peoples R China
[3] Shenzhen Univ, Sch Architecture & Urban Planning, Shenzhen 518061, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2025年 / 63卷
基金
中国国家自然科学基金;
关键词
Point cloud compression; Laser radar; Three-dimensional displays; Semantics; Feature extraction; Contrastive learning; Semantic segmentation; Roads; Transformers; Image color analysis; Constrastive learning; point clouds; semantic focus; semantic segmentation; urban scenes;
D O I
10.1109/TGRS.2025.3535800
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Semantic segmentation of large-scale point clouds is an indispensable component of outdoor scene perception, providing essential 3-D semantic insights for applications in scene reconstruction, urban planning, autonomous driving, and more. However, the discriminative capability of point clouds features declines with increasing distance from the sensor, causing current methods to usually perform poorly in segmenting distant objects. To overcome this challenge and improve the differentiation between classes with similar geometric features, we propose the cross-modal semantic focus network (CSFNet). Firstly, we design a multiscale feature dynamic fusion (MDF) module to leverage multiscale image features, thereby enriching the feature representation of point clouds with additional images color and texture information. Then, in order to extract the distinguishing features of distant and different categories of objects more efficiently, we propose a semantic focus module (SFM) that employs a multiclass contrastive learning strategy to enhance feature discrimination. Finally, we introduce cross-modal knowledge distillation (KD) to augment the model's comprehension of point clouds. Extensive experiments conducted on the SemanticKITTI and nuScenes datasets demonstrate the effectiveness of our method. Notably, our method achieves superior segmentation accuracy across multiple classes at various distances compared to current methods.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Cross-modal semantic transfer for point cloud semantic segmentation
    Cao, Zhen
    Mi, Xiaoxin
    Qiu, Bo
    Cao, Zhipeng
    Long, Chen
    Yan, Xinrui
    Zheng, Chao
    Dong, Zhen
    Yang, Bisheng
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2025, 221 : 265 - 279
  • [2] Semantic Guidance Fusion Network for Cross-Modal Semantic Segmentation
    Zhang, Pan
    Chen, Ming
    Gao, Meng
    SENSORS, 2024, 24 (08)
  • [3] Semantic segmentation of large-scale point clouds with neighborhood uncertainty
    Bao, Yong
    Wen, Haibiao
    Zhang, Baoqing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (21) : 60949 - 60964
  • [4] Dense Dual-Branch Cross Attention Network for Semantic Segmentation of Large-Scale Point Clouds
    Luo, Ziwei
    Zeng, Ziyin
    Tang, Wei
    Wan, Jie
    Xie, Zhong
    Xu, Yongyang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 16
  • [5] LessNet: Lightweight and efficient semantic segmentation for large-scale point clouds
    Feng, Guoqiang
    Li, Weilong
    Zhao, Xiaolin
    Yang, Xuemeng
    Kong, Xin
    Huang, TianXin
    Cui, Jinhao
    IET CYBER-SYSTEMS AND ROBOTICS, 2022, 4 (02) : 107 - 115
  • [6] GSIP: Green Semantic Segmentation of Large-Scale Indoor Point Clouds
    Zhang, Min
    Kadam, Pranav
    Liu, Shan
    Kuo, C. -C. Jay
    PATTERN RECOGNITION LETTERS, 2022, 164 : 9 - 15
  • [7] Learning Semantic Segmentation of Large-Scale Point Clouds With Random Sampling
    Hu, Qingyong
    Yang, Bo
    Xie, Linhai
    Rosa, Stefano
    Guo, Yulan
    Wang, Zhihua
    Trigoni, Niki
    Markham, Andrew
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 8338 - 8354
  • [8] Continuous Mapping Convolution for Large-Scale Point Clouds Semantic Segmentation
    Yan, Kunping
    Hu, Qingyong
    Wang, Hanyun
    Huang, Xiaohong
    Li, Li
    Ji, Song
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [9] BushNet: Effective semantic segmentation of bush in large-scale point clouds
    Wei, Hejun
    Xu, Enyong
    Zhang, Jinlai
    Meng, Yanmei
    Wei, Jin
    Dong, Zhen
    Li, Zhengqiang
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 193
  • [10] Fast Semantic Preserving Hashing for Large-Scale Cross-Modal Retrieval
    Wang, Xingzhi
    Liu, Xin
    Peng, Shujuan
    Cheung, Yiu-ming
    Hu, Zhikai
    Wang, Nannan
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 1348 - 1353