Semantic segmentation of underwater images based on the improved SegFormer

被引：0

作者：

Chen, Bowei ^{[1
,2
]}

Zhao, Wei ^{[1
,2
]}

Zhang, Qiusheng ^{[3
]}

Li, Mingliang ^{[3
]}

Qi, Mingyang ^{[3
]}

Tang, You ^{[3
,4
,5
]}

机构：

[1] Qingdao Innovat & Dev Base, Harbin, Peoples R China

[2] Harbin Engn Univ, Lab Underwater Intelligence, Qingdao, Peoples R China

[3] Jilin Agr Sci & Technol Univ, Elect & Informat Engn Coll, Jilin, Peoples R China

[4] Jilin Agr Univ, Coll Informat Technol, Changchun, Peoples R China

[5] Yanbian Univ, Coll Agr, Yanji, Peoples R China

来源：

FRONTIERS IN MARINE SCIENCE | 2025年 / 12卷

关键词：

underwater images; semantic segmentation; attention mechanism; feature fusion; SegFormer;

D O I：

10.3389/fmars.2025.1522160

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Underwater images segmentation is essential for tasks such as underwater exploration, marine environmental monitoring, and resource development. Nevertheless, given the complexity and variability of the underwater environment, improving model accuracy remains a key challenge in underwater image segmentation tasks. To address these issues, this study presents a high-performance semantic segmentation approach for underwater images based on the standard SegFormer model. First, the Mix Transformer backbone in SegFormer is replaced with a Swin Transformer to enhance feature extraction and facilitate efficient acquisition of global context information. Next, the Efficient Multi-scale Attention (EMA) mechanism is introduced in the backbone's downsampling stages and the decoder to better capture multi-scale features, further improving segmentation accuracy. Furthermore, a Feature Pyramid Network (FPN) structure is incorporated into the decoder to combine feature maps at multiple resolutions, allowing the model to integrate contextual information effectively, enhancing robustness in complex underwater environments. Testing on the SUIM underwater image dataset shows that the proposed model achieves high performance across multiple metrics: mean Intersection over Union (MIoU) of 77.00%, mean Recall (mRecall) of 85.04%, mean Precision (mPrecision) of 89.03%, and mean F1score (mF1score) of 86.63%. Compared to the standard SegFormer, it demonstrates improvements of 3.73% in MIoU, 1.98% in mRecall, 3.38% in mPrecision, and 2.44% in mF1score, with an increase of 9.89M parameters. The results demonstrate that the proposed method achieves superior segmentation accuracy with minimal additional computation, showcasing high performance in underwater image segmentation.

引用

页数：13

共 50 条

[21] A Semantic Segmentation Method for Road Sensing Images Based on an Improved PIDNet Model
Tan, Guangxing
Jin, Yangying
ELECTRONICS, 2025, 14 (05):
[22] Semantic Segmentation Method for Remote Sensing Images Based on Improved Swin Transformer
Wang, Yizhong
Hu, Yaqi
Wu, Xiaosuo
Yan, Haowen
Wang, Xiaocheng
Computer Engineering and Applications, 2024, 60 (11) : 194 - 203
[23] UISS-Net: Underwater Image Semantic Segmentation Network for improving boundary segmentation accuracy of underwater images
He, Zhiqian
Cao, Lijie
Luo, Jialu
Xu, Xiaoqing
Tang, Jiayi
Xu, Jianhao
Xu, Gengyan
Chen, Ziwen
AQUACULTURE INTERNATIONAL, 2024, 32 (05) : 5625 - 5638
[24] Epistemic uncertainty estimation with evidential learning on semantic segmentation of underwater images
Do Nascimento, Gustavo Henrique
Dias De Oliveira Evald, Paulo Jefferson
Drews Junior, Paulo Lilles Jorge
2022 LATIN AMERICAN ROBOTICS SYMPOSIUM (LARS), 2022 BRAZILIAN SYMPOSIUM ON ROBOTICS (SBR), AND 2022 WORKSHOP ON ROBOTICS IN EDUCATION (WRE), 2022, : 163 - 168
[25] Semantic segmentation using synthetic images of underwater marine-growth
Mai, Christian
Liniger, Jesper
Pedersen, Simon
FRONTIERS IN ROBOTICS AND AI, 2025, 11
[26] Semantic Segmentation Method for Remote Sensing Images Based on Improved DeepLabV3+
Su Zhipeng
Li Jingwen
Jiang Jianwu
Lu Yanling
Zhu Ming
LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (06)
[27] DeepMDSCBA: An Improved Semantic Segmentation Model Based on DeepLabV3+ for Apple Images
Mo, Lufeng
Fan, Yishan
Wang, Guoying
Yi, Xiaomei
Wu, Xiaoping
Wu, Peng
FOODS, 2022, 11 (24)
[28] Farmland Extraction from UAV Remote Sensing Images Based on Improved SegFormer Model
Chen, Yuqing
Wang, Xiuxin
JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2025, 53 (02) : 421 - 433
[29] Using SegFormer for Effective Semantic Cell Segmentation for Fault Detection in Photovoltaic Arrays
Mahboob, Zaid
Khan, M. Adil
Lodhi, Ehtisham
Nawaz, Tahir
Khan, Umar S.
IEEE JOURNAL OF PHOTOVOLTAICS, 2024,
[30] Weed identification in broomcorn millet field using segformer semantic segmentation based on multiple loss functions
Bi, Zhifang
Li, Yanwen
Guan, Jiaxiong
Li, Juxia
Zhang, Pengpeng
Zhang, Xiaoying
Han, Yuanhuai
Wang, Linjuan
Guo, Wenfeng
Engineering in Agriculture, Environment and Food, 2024, 17 (01) : 27 - 36

← 1 2 3 4 5 →