Diamond-Unet: A Novel Semantic Segmentation Network Based on U-Net Network and Transformer for Deep Space Rock Images

被引:4
|
作者
Li, Guocheng [1 ,2 ]
Xi, Bobo [3 ,4 ]
He, Yufei [1 ,2 ]
Zheng, Tie [1 ]
Li, Yunsong [3 ]
Xue, Changbin [1 ]
Chanussot, Jocelyn [5 ]
机构
[1] Chinese Acad Sci, Natl Space Sci Ctr, Key Lab Elect & Informat Technol Space Syst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, State Key Lab Integrated Serv Networks, Beijing 100049, Peoples R China
[3] Xidian Univ, Sch Telecommun Engn, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[4] Chinese Acad Sci, Natl Space Sci Ctr, Key Lab Elect & Informat Technol Space Syst, Beijing, Peoples R China
[5] Univ Grenoble Alpes, Inria, CNRS, Grenoble INP,LJK, F-38000 Grenoble, France
基金
中国博士后科学基金;
关键词
Deep space exploration; feature cross fusion; image segmentation; Transformer;
D O I
10.1109/LGRS.2024.3397870
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Extracting rock objects from the surface of celestial bodies in deep space exploration environments is crucial for self-service path planning, navigation of detectors, and regional information evaluation. Most existing image semantic segmentation frameworks decrease the spatial resolution of the feature maps as networks deepen, resulting in limitations in detecting small targets and the inability to accurately segment boundary regions. In this letter, we propose a novel semantic segmentation network based on U-Net network and Transformer for deep space rock images, referred to as Diamond-Unet. This model integrates overcomplete and undercomplete branches and incorporates a global-local feature extraction (GLFE) module based on Transformer and convolutional neural network (CNN) technologies to effectively capture discriminative information. Furthermore, an innovative feature cross-fusion path (FCFP) is introduced to enhance information exchange between the dual-branch networks, enabling the capture of both fine-grained details and coarse-grained semantics in the full-scale image segmentation architecture. Experimental results demonstrate that the Diamond-Unet achieves the mean intersection over union (MIoU) scores of 79.32% and 93.43% on two public datasets, which are superior to the compared methods.
引用
收藏
页码:1 / 5
页数:5
相关论文
共 50 条
  • [1] RockSeg: A Novel Semantic Segmentation Network Based on a Hybrid Framework Combining a Convolutional Neural Network and Transformer for Deep Space Rock Images
    Fan, Lili
    Yuan, Jiabin
    Niu, Xuewei
    Zha, Keke
    Ma, Weiqi
    REMOTE SENSING, 2023, 15 (16)
  • [2] Improved U-NET Semantic Segmentation Network
    Gao, Xueyan
    Fang, Lijin
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7090 - 7095
  • [3] Semantic segmentation of urban street scene images based on improved U-Net network
    Zhu, Fuzhen
    Cui, Jingyi
    Zhu, Bing
    Li, Huiling
    Liu, Yan
    OPTOELECTRONICS LETTERS, 2023, 19 (03) : 179 - 185
  • [4] Semantic Segmentation of Synthetic Aperture Radar Images Based on U-Net and Capsule Network
    Jing Shaodi
    Yu Lingjuan
    Hu Yuehong
    Yang Zezhou
    Lu Zhongliang
    Xie Xiaochun
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (20)
  • [5] Semantic segmentation of urban street scene images based on improved U-Net network
    ZHU Fuzhen
    CUI Jingyi
    ZHU Bing
    LI Huiling
    LIU Yan
    OptoelectronicsLetters, 2023, 19 (03) : 179 - 185
  • [6] Semantic segmentation of urban street scene images based on improved U-Net network
    Fuzhen Zhu
    Jingyi Cui
    Bing Zhu
    Huiling Li
    Yan Liu
    Optoelectronics Letters, 2023, 19 : 179 - 185
  • [7] Semantic segmentation network of uav image based on improved U-net
    Liu, Ziyi
    Huang, Jin
    2019 INTERNATIONAL CONFERENCE ON ADVANCES IN CIVIL ENGINEERING, ENERGY RESOURCES AND ENVIRONMENT ENGINEERING, 2019, 330
  • [8] CT-UNet: An Improved Neural Network Based on U-Net for Building Segmentation in Remote Sensing Images
    Ye, Huanran
    Liu, Sheng
    Jin, Kun
    Cheng, Haohao
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 166 - 172
  • [9] A novel reformed normaliser free network with U-Net architecture for semantic segmentation
    Kalvapalli, Sai Prabanjan Kumar
    Mala, C.
    Punitha, V.
    INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2023, 43 (02) : 97 - 108
  • [10] GT U-Net: A U-Net Like Group Transformer Network for Tooth Root Segmentation
    Li, Yunxiang
    Wang, Shuai
    Wang, Jun
    Zeng, Guodong
    Liu, Wenjun
    Zhang, Qianni
    Jin, Qun
    Wang, Yaqi
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2021, 2021, 12966 : 386 - 395