Neural Network for Underwater Fish Image Segmentation Using an Enhanced Feature Pyramid Convolutional Architecture

被引:0
作者
Yang, Guang [1 ]
Yang, Junyi [1 ]
Fan, Wenyao [1 ]
Yang, Donghe [2 ]
机构
[1] Hangzhou Dianzi Univ, Sch Mech Engn, Hangzhou 310018, Peoples R China
[2] Zhejiang Sci Tech Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
基金
国家重点研发计划;
关键词
fish segmentation; attention mechanism; pyramid architecture; deep learning;
D O I
10.3390/jmse13020238
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Underwater fish image segmentation is a crucial technique in marine fish monitoring. However, typical underwater fish images often suffer from issues such as color distortion, low contrast, and blurriness, primarily due to the complex and dynamic nature of the marine environment. To enhance the accuracy of underwater fish image segmentation, this paper introduces an innovative neural network model that combines the attention mechanism with a feature pyramid module. After the backbone network processes the input image through convolution, the data pass through the enhanced feature pyramid module, where it is iteratively processed by multiple weighted branches. Unlike conventional methods, the multi-scale feature extraction module that we designed not only improves the extraction of high-level semantic features but also optimizes the distribution of low-level shape feature weights through the synergistic interactions of the branches, all while preserving the inherent properties of the image. This novel architecture significantly boosts segmentation accuracy, offering a new solution for fish image segmentation tasks. To further enhance the model's robustness, the Mix-up and CutMix data augmentation techniques were employed. The model was validated using the Fish4Knowledge dataset, and the experimental results demonstrate that the model achieves a Mean Intersection over Union (MIoU) of 95.1%, with improvements of 1.3%, 1.5%, and 1.7% in the MIoU, Mean Pixel Accuracy (PA), and F1 score, respectively, compared to traditional segmentation methods. Additionally, a real fish image dataset captured in deep-sea environments was constructed to verify the practical applicability of the proposed algorithm.
引用
收藏
页数:17
相关论文
共 34 条
[1]   Deep learning-based segmental analysis of fish for biomass estimation in an occulted environment [J].
Abinaya, N. S. ;
Susan, D. ;
Sidharthan, Rakesh Kumar .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 197
[2]   Aquaculture defects recognition via multi-scale semantic segmentation [J].
Akram, Waseem ;
Hassan, Taimur ;
Toubar, Hamed ;
Ahmed, Muhayyuddin ;
Miskovic, Nikola ;
Seneviratne, Lakmal ;
Hussain, Irfan .
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
[3]   Fish species identification using a convolutional neural network trained on synthetic data [J].
Allken, Vaneeda ;
Handegard, Nils Olav ;
Rosen, Shale ;
Schreyeck, Tiffanie ;
Mahiout, Thomas ;
Malde, Ketil .
ICES JOURNAL OF MARINE SCIENCE, 2019, 76 (01) :342-349
[4]   Improved deep learning framework for fish segmentation in underwater videos [J].
Alshdaifat, Nawaf Farhan Funkur ;
Talib, Abdullah Zawawi ;
Osman, Mohd Azam .
ECOLOGICAL INFORMATICS, 2020, 59
[5]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[6]  
Beyer L, 2022, Arxiv, DOI [arXiv:2106.05237, DOI 10.48550/ARXIV.2106.05237]
[7]   Automatic Lung Segmentation Algorithm on Chest X-ray Images Based on Fusion Variational Auto-Encoder and Three-Terminal Attention Mechanism [J].
Cao, Feidao ;
Zhao, Huaici .
SYMMETRY-BASEL, 2021, 13 (05)
[8]  
Chen LC, 2017, Arxiv, DOI [arXiv:1606.00915, DOI 10.1109/TPAMI.2017.2699184]
[9]  
Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
[10]  
DeVries T, 2017, Arxiv, DOI arXiv:1708.04552