Fusion of Ensembled UNET and Ensembled FPN for Semantic Segmentation

被引:2
作者
Sundarrajan, Kavitha [1 ]
Rajendran, Baskaran Kuttva [2 ]
Balasubramanian, Dhanapriya [1 ]
机构
[1] Kumaraguru Coll Technol, Dept Informat Technol, Coimbatore 641049, India
[2] Kumaraguru Coll Technol, Dept Comp Sci & Engn, Coimbatore 641049, India
关键词
UNET; intersection over union; semantic segmentation; FPN; pre-trained model; F1; score; ensembling;
D O I
10.18280/ts.400129
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image segmentation is an annotation method used to gain a deeper understanding of the images. Semantic segmentation involves constructing a pixel-by-pixel mask of an image by training a neural network. The accuracy of the semantic segmentation algorithms can be improved by eliminating background noise, and computational efficiency can be improved by using the pre-trained networks. This paper proposes a new architecture that ensemble inceptionV3, DenseNet, Resnet34 in the encoder part of UNET and ensemble inceptionV3, Resnet34, and VGG16 in the encoder part of FPN. The ensemble results are fused based on the weighted average and the predictions of the pixels are made on the fused features to perform semantic segmentation. The proposed architecture is implemented on Oxford-IIIT Pet Dataset, created by the visual geometry group, and on the SD saliency 900 dataset. The F1 score, IOU score, and Loss are used to evaluate segmentation model results. The results of the study show that the proposed architecture formed by the fusion of ensembled architectures is more accurate and efficient in segmenting oxford-IIIT pet dataset with the IoU score of 98.68% and segmenting the SD saliency 900 dataset with the IoU score of 66.78%.
引用
收藏
页码:297 / 307
页数:11
相关论文
共 16 条
[1]   Stacking and voting ensemble methods fusion to evaluate instructor performance in higher education [J].
Ahuja R. ;
Sharma S.C. .
International Journal of Information Technology, 2021, 13 (5) :1721-1731
[2]   Modified U-Net for liver cancer segmentation from computed tomography images with a new class balancing method [J].
Yodit Abebe Ayalew ;
Kinde Anlay Fante ;
Mohammed Aliy Mohammed .
BMC Biomedical Engineering, 3 (1)
[3]  
Guohua Zhu, 2020, 2020 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), P070, DOI 10.1109/ICAIIC48513.2020.9065216
[4]   Joint Attention-Guided Feature Fusion Network for Saliency Detection of Surface Defects [J].
Jiang, Xiaoheng ;
Yan, Feng ;
Lu, Yang ;
Wang, Ke ;
Guo, Shuai ;
Zhang, Tianzhu ;
Pang, Yanwei ;
Niu, Jianwei ;
Xu, Mingliang .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[5]  
kaggle, SD SAL 900
[6]   Portrait Segmentation Using Ensemble of Heterogeneous Deep-Learning Models [J].
Kim, Yong-Woon ;
Byun, Yung-Cheol ;
Krishna, Addapalli V. N. .
ENTROPY, 2021, 23 (02) :1-20
[7]  
Lozej J., 2018, 2018 IEEE international work conference on bioinspiredintelligence (IWOBI), P1, DOI [DOI 10.1109/IWOBI.2018.8464213, 10.1109/IWOBI.2018.8464213, 10.1109/IWOBI.2018]
[8]   A Fast Panoptic Segmentation Network for Self-Driving Scene Understanding [J].
Majid, Abdul ;
Kausar, Sumaira ;
Tehsin, Samabia ;
Jameel, Amina .
COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 43 (01) :27-43
[9]  
monkeydoodle@gmail.com, 2022, The Oxford-IIIT Pet Dataset
[10]   A Multi-Branch U-Net for Steel Surface Defect Type and Severity Segmentation [J].
Neven, Robby ;
Goedeme, Toon .
METALS, 2021, 11 (06)