Semantic segmentation of underwater images based on the improved SegFormer

被引:0
|
作者
Chen, Bowei [1 ,2 ]
Zhao, Wei [1 ,2 ]
Zhang, Qiusheng [3 ]
Li, Mingliang [3 ]
Qi, Mingyang [3 ]
Tang, You [3 ,4 ,5 ]
机构
[1] Qingdao Innovat & Dev Base, Harbin, Peoples R China
[2] Harbin Engn Univ, Lab Underwater Intelligence, Qingdao, Peoples R China
[3] Jilin Agr Sci & Technol Univ, Elect & Informat Engn Coll, Jilin, Peoples R China
[4] Jilin Agr Univ, Coll Informat Technol, Changchun, Peoples R China
[5] Yanbian Univ, Coll Agr, Yanji, Peoples R China
关键词
underwater images; semantic segmentation; attention mechanism; feature fusion; SegFormer;
D O I
10.3389/fmars.2025.1522160
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Underwater images segmentation is essential for tasks such as underwater exploration, marine environmental monitoring, and resource development. Nevertheless, given the complexity and variability of the underwater environment, improving model accuracy remains a key challenge in underwater image segmentation tasks. To address these issues, this study presents a high-performance semantic segmentation approach for underwater images based on the standard SegFormer model. First, the Mix Transformer backbone in SegFormer is replaced with a Swin Transformer to enhance feature extraction and facilitate efficient acquisition of global context information. Next, the Efficient Multi-scale Attention (EMA) mechanism is introduced in the backbone's downsampling stages and the decoder to better capture multi-scale features, further improving segmentation accuracy. Furthermore, a Feature Pyramid Network (FPN) structure is incorporated into the decoder to combine feature maps at multiple resolutions, allowing the model to integrate contextual information effectively, enhancing robustness in complex underwater environments. Testing on the SUIM underwater image dataset shows that the proposed model achieves high performance across multiple metrics: mean Intersection over Union (MIoU) of 77.00%, mean Recall (mRecall) of 85.04%, mean Precision (mPrecision) of 89.03%, and mean F1score (mF1score) of 86.63%. Compared to the standard SegFormer, it demonstrates improvements of 3.73% in MIoU, 1.98% in mRecall, 3.38% in mPrecision, and 2.44% in mF1score, with an increase of 9.89M parameters. The results demonstrate that the proposed method achieves superior segmentation accuracy with minimal additional computation, showcasing high performance in underwater image segmentation.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Semantic Segmentation of Underwater Images Based on Improved Deeplab
    Liu, Fangfang
    Fang, Ming
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2020, 8 (03)
  • [2] Efficient Semantic Segmentation of Nuclei in Histopathology Images Using Segformer
    Khaled, Marwan
    Hammouda, Mostafa A.
    Ali, Hesham
    Elattar, Mustafa
    Selim, Sahar
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2023, 2024, 14122 : 81 - 95
  • [3] Improving Semantic Segmentation Performance in Underwater Images
    Nunes, Alexandra
    Matos, Anibal
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (12)
  • [4] DYNAMICALLY PRUNING SEGFORMER FOR EFFICIENT SEMANTIC SEGMENTATION
    Bai, Haoli
    Mao, Hongda
    Nair, Dinesh
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3298 - 3302
  • [5] A Semantic Segmentation Method for Road Sensing Images Based on an Improved PIDNet Model
    Tan, Guangxing
    Jin, Yangying
    ELECTRONICS, 2025, 14 (05):
  • [6] Semantic Segmentation of Cucumber Leaf Disease Spots Based on ECA-SegFormer
    Yang, Ruotong
    Guo, Yaojiang
    Hu, Zhiwei
    Gao, Ruibo
    Yang, Hua
    AGRICULTURE-BASEL, 2023, 13 (08):
  • [7] Mobile-SegFormer: A Lightweight Semantic Segmentation Network
    Lin, Zhenyuan
    Li, Weikun
    Gao, Dahua
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XII, ICIC 2024, 2024, 14873 : 294 - 305
  • [8] Semantic Segmentation Method for Remote Sensing Images Based on Improved DeepLabV3+
    Su Zhipeng
    Li Jingwen
    Jiang Jianwu
    Lu Yanling
    Zhu Ming
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (06)
  • [9] Semantic segmentation method of indoor obstacle images based on improved BiSeNet
    Yu M.
    Fan C.
    Li X.
    Li W.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2022, 50 (06): : 133 - 138
  • [10] DEEP LEARNING FOR SEMANTIC SEGMENTATION OF CORAL IMAGES IN UNDERWATER PHOTOGRAMMETRY
    Zhang, Hanqi
    Gruen, Armin
    Li, Ming
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 5-2 : 343 - 350