Semantic segmentation of underwater images based on the improved SegFormer

被引:0
|
作者
Chen, Bowei [1 ,2 ]
Zhao, Wei [1 ,2 ]
Zhang, Qiusheng [3 ]
Li, Mingliang [3 ]
Qi, Mingyang [3 ]
Tang, You [3 ,4 ,5 ]
机构
[1] Qingdao Innovat & Dev Base, Harbin, Peoples R China
[2] Harbin Engn Univ, Lab Underwater Intelligence, Qingdao, Peoples R China
[3] Jilin Agr Sci & Technol Univ, Elect & Informat Engn Coll, Jilin, Peoples R China
[4] Jilin Agr Univ, Coll Informat Technol, Changchun, Peoples R China
[5] Yanbian Univ, Coll Agr, Yanji, Peoples R China
关键词
underwater images; semantic segmentation; attention mechanism; feature fusion; SegFormer;
D O I
10.3389/fmars.2025.1522160
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Underwater images segmentation is essential for tasks such as underwater exploration, marine environmental monitoring, and resource development. Nevertheless, given the complexity and variability of the underwater environment, improving model accuracy remains a key challenge in underwater image segmentation tasks. To address these issues, this study presents a high-performance semantic segmentation approach for underwater images based on the standard SegFormer model. First, the Mix Transformer backbone in SegFormer is replaced with a Swin Transformer to enhance feature extraction and facilitate efficient acquisition of global context information. Next, the Efficient Multi-scale Attention (EMA) mechanism is introduced in the backbone's downsampling stages and the decoder to better capture multi-scale features, further improving segmentation accuracy. Furthermore, a Feature Pyramid Network (FPN) structure is incorporated into the decoder to combine feature maps at multiple resolutions, allowing the model to integrate contextual information effectively, enhancing robustness in complex underwater environments. Testing on the SUIM underwater image dataset shows that the proposed model achieves high performance across multiple metrics: mean Intersection over Union (MIoU) of 77.00%, mean Recall (mRecall) of 85.04%, mean Precision (mPrecision) of 89.03%, and mean F1score (mF1score) of 86.63%. Compared to the standard SegFormer, it demonstrates improvements of 3.73% in MIoU, 1.98% in mRecall, 3.38% in mPrecision, and 2.44% in mF1score, with an increase of 9.89M parameters. The results demonstrate that the proposed method achieves superior segmentation accuracy with minimal additional computation, showcasing high performance in underwater image segmentation.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] A Semantic Segmentation Method for Road Sensing Images Based on an Improved PIDNet Model
    Tan, Guangxing
    Jin, Yangying
    ELECTRONICS, 2025, 14 (05):
  • [22] Semantic Segmentation Method for Remote Sensing Images Based on Improved Swin Transformer
    Wang, Yizhong
    Hu, Yaqi
    Wu, Xiaosuo
    Yan, Haowen
    Wang, Xiaocheng
    Computer Engineering and Applications, 2024, 60 (11) : 194 - 203
  • [23] UISS-Net: Underwater Image Semantic Segmentation Network for improving boundary segmentation accuracy of underwater images
    He, Zhiqian
    Cao, Lijie
    Luo, Jialu
    Xu, Xiaoqing
    Tang, Jiayi
    Xu, Jianhao
    Xu, Gengyan
    Chen, Ziwen
    AQUACULTURE INTERNATIONAL, 2024, 32 (05) : 5625 - 5638
  • [24] Epistemic uncertainty estimation with evidential learning on semantic segmentation of underwater images
    Do Nascimento, Gustavo Henrique
    Dias De Oliveira Evald, Paulo Jefferson
    Drews Junior, Paulo Lilles Jorge
    2022 LATIN AMERICAN ROBOTICS SYMPOSIUM (LARS), 2022 BRAZILIAN SYMPOSIUM ON ROBOTICS (SBR), AND 2022 WORKSHOP ON ROBOTICS IN EDUCATION (WRE), 2022, : 163 - 168
  • [25] Semantic segmentation using synthetic images of underwater marine-growth
    Mai, Christian
    Liniger, Jesper
    Pedersen, Simon
    FRONTIERS IN ROBOTICS AND AI, 2025, 11
  • [26] Semantic Segmentation Method for Remote Sensing Images Based on Improved DeepLabV3+
    Su Zhipeng
    Li Jingwen
    Jiang Jianwu
    Lu Yanling
    Zhu Ming
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (06)
  • [27] DeepMDSCBA: An Improved Semantic Segmentation Model Based on DeepLabV3+ for Apple Images
    Mo, Lufeng
    Fan, Yishan
    Wang, Guoying
    Yi, Xiaomei
    Wu, Xiaoping
    Wu, Peng
    FOODS, 2022, 11 (24)
  • [28] Farmland Extraction from UAV Remote Sensing Images Based on Improved SegFormer Model
    Chen, Yuqing
    Wang, Xiuxin
    JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2025, 53 (02) : 421 - 433
  • [29] Using SegFormer for Effective Semantic Cell Segmentation for Fault Detection in Photovoltaic Arrays
    Mahboob, Zaid
    Khan, M. Adil
    Lodhi, Ehtisham
    Nawaz, Tahir
    Khan, Umar S.
    IEEE JOURNAL OF PHOTOVOLTAICS, 2024,
  • [30] Weed identification in broomcorn millet field using segformer semantic segmentation based on multiple loss functions
    Bi, Zhifang
    Li, Yanwen
    Guan, Jiaxiong
    Li, Juxia
    Zhang, Pengpeng
    Zhang, Xiaoying
    Han, Yuanhuai
    Wang, Linjuan
    Guo, Wenfeng
    Engineering in Agriculture, Environment and Food, 2024, 17 (01) : 27 - 36