Semantic segmentation of underwater images based on the improved SegFormer

被引：0

作者：

Chen, Bowei ^{[1
,2
]}

Zhao, Wei ^{[1
,2
]}

Zhang, Qiusheng ^{[3
]}

Li, Mingliang ^{[3
]}

Qi, Mingyang ^{[3
]}

Tang, You ^{[3
,4
,5
]}

机构：

[1] Qingdao Innovat & Dev Base, Harbin, Peoples R China

[2] Harbin Engn Univ, Lab Underwater Intelligence, Qingdao, Peoples R China

[3] Jilin Agr Sci & Technol Univ, Elect & Informat Engn Coll, Jilin, Peoples R China

[4] Jilin Agr Univ, Coll Informat Technol, Changchun, Peoples R China

[5] Yanbian Univ, Coll Agr, Yanji, Peoples R China

来源：

FRONTIERS IN MARINE SCIENCE | 2025年 / 12卷

关键词：

underwater images; semantic segmentation; attention mechanism; feature fusion; SegFormer;

D O I：

10.3389/fmars.2025.1522160

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Underwater images segmentation is essential for tasks such as underwater exploration, marine environmental monitoring, and resource development. Nevertheless, given the complexity and variability of the underwater environment, improving model accuracy remains a key challenge in underwater image segmentation tasks. To address these issues, this study presents a high-performance semantic segmentation approach for underwater images based on the standard SegFormer model. First, the Mix Transformer backbone in SegFormer is replaced with a Swin Transformer to enhance feature extraction and facilitate efficient acquisition of global context information. Next, the Efficient Multi-scale Attention (EMA) mechanism is introduced in the backbone's downsampling stages and the decoder to better capture multi-scale features, further improving segmentation accuracy. Furthermore, a Feature Pyramid Network (FPN) structure is incorporated into the decoder to combine feature maps at multiple resolutions, allowing the model to integrate contextual information effectively, enhancing robustness in complex underwater environments. Testing on the SUIM underwater image dataset shows that the proposed model achieves high performance across multiple metrics: mean Intersection over Union (MIoU) of 77.00%, mean Recall (mRecall) of 85.04%, mean Precision (mPrecision) of 89.03%, and mean F1score (mF1score) of 86.63%. Compared to the standard SegFormer, it demonstrates improvements of 3.73% in MIoU, 1.98% in mRecall, 3.38% in mPrecision, and 2.44% in mF1score, with an increase of 9.89M parameters. The results demonstrate that the proposed method achieves superior segmentation accuracy with minimal additional computation, showcasing high performance in underwater image segmentation.

引用

页数：13

共 50 条

[1] Semantic Segmentation of Underwater Images Based on Improved Deeplab
Liu, Fangfang
Fang, Ming
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2020, 8 (03)
[2] Efficient Semantic Segmentation of Nuclei in Histopathology Images Using Segformer
Khaled, Marwan
Hammouda, Mostafa A.
Ali, Hesham
Elattar, Mustafa
Selim, Sahar
MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2023, 2024, 14122 : 81 - 95
[3] Improving Semantic Segmentation Performance in Underwater Images
Nunes, Alexandra
Matos, Anibal
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (12)
[4] DYNAMICALLY PRUNING SEGFORMER FOR EFFICIENT SEMANTIC SEGMENTATION
Bai, Haoli
Mao, Hongda
Nair, Dinesh
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3298 - 3302
[5] A Semantic Segmentation Method for Road Sensing Images Based on an Improved PIDNet Model
Tan, Guangxing
Jin, Yangying
ELECTRONICS, 2025, 14 (05):
[6] Semantic Segmentation of Cucumber Leaf Disease Spots Based on ECA-SegFormer
Yang, Ruotong
Guo, Yaojiang
Hu, Zhiwei
Gao, Ruibo
Yang, Hua
AGRICULTURE-BASEL, 2023, 13 (08):
[7] Mobile-SegFormer: A Lightweight Semantic Segmentation Network
Lin, Zhenyuan
Li, Weikun
Gao, Dahua
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XII, ICIC 2024, 2024, 14873 : 294 - 305
[8] Semantic Segmentation Method for Remote Sensing Images Based on Improved DeepLabV3+
Su Zhipeng
Li Jingwen
Jiang Jianwu
Lu Yanling
Zhu Ming
LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (06)
[9] Semantic segmentation method of indoor obstacle images based on improved BiSeNet
Yu M.
Fan C.
Li X.
Li W.
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2022, 50 (06): : 133 - 138
[10] DEEP LEARNING FOR SEMANTIC SEGMENTATION OF CORAL IMAGES IN UNDERWATER PHOTOGRAMMETRY
Zhang, Hanqi
Gruen, Armin
Li, Ming
XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 5-2 : 343 - 350

← 1 2 3 4 5 →