Improved U-Net with gray channel attention for image segmentation

被引：0

作者：

Pan, Feng ^{[1
]}

Geng, Lujing ^{[1
]}

Zhang, Ning ^{[2
]}

Chen, Zuhao ^{[1
]}

机构：

[1] China Mobile Grp Design Inst Co Ltd, Network Super Prod Dept, Beijing, Peoples R China

[2] China Mobile Grp Co Ltd, Network Business Dept, Beijing, Peoples R China

来源：

2024 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMMUNICATIONS AND COMPUTING, ICICC 2024 | 2024年

关键词：

image segment; U-Net; attention mechanism; deep learning;

D O I：

10.1109/ICICC63565.2024.10780506

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Image segmentation base on deep learning methods is an important direction in computer vision field. However, these models over-rely on color features in image segmentation tasks, which leads to poor segmentation effect in scenes with the interference of similar background colors. To solve this problem, this paper successfully improves the U-Net model by introducing the technical means of combining gray channel and attention mechanism. The experimental results show that compared with the original U-Net model, the average accuracy of the improved U-Net with gray channel attention has increased from 81.69% to 82.61%. At the same time, we apply this method mechanism to improved models of U-Net such as Attention U-Net and R2U-net, and similar effect is verified. These results verify that the combination of gray channel and attention mechanism can effectively improve the robustness and accuracy of deep learning model when processing color-similar background in image segmentation tasks. This work has important practical application value and provides a new solution for image segmentation tasks in complex scenes.

引用

页码：70 / 73

页数：4

共 18 条

[1]

Chen LC, 2017, Arxiv, DOI arXiv:1706.05587

[2] Visual fire detection using deep learning: A survey [J].

Cheng, Guangtao ;

Chen, Xue ;

Wang, Chenyi ;

Li, Xiaobo ;

Xian, Baoyi ;

Yu, Hao .

NEUROCOMPUTING, 2024, 596

[3] Learning what and where to segment: A new perspective on medical image few-shot segmentation [J].

Feng, Yong ;

Wang, Yonghuai ;

Li, Honghe ;

Qu, Mingjun ;

Yang, Jinzhu .

MEDICAL IMAGE ANALYSIS, 2023, 87

[4] CE-Net: Context Encoder Network for 2D Medical Image Segmentation [J].

Gu, Zaiwang ;

Cheng, Jun ;

Fu, Huazhu ;

Zhou, Kang ;

Hao, Huaying ;

Zhao, Yitian ;

Zhang, Tianyang ;

Gao, Shenghua ;

Liu, Jiang .

IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (10) :2281-2292

[5]

Habiba A A, 2020, J. journal of cardiovascular disease research, V11, P83

[6] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[7]

Huang G, 2018, Arxiv, DOI arXiv:1608.06993

[8]

Huang H, 2020, arXiv, DOI [10.1109/ICASSP40776.2020.9053405, DOI 10.1109/ICASSP40776.2020.9053405]

[9] A survey of deep learning for MRI brain tumor segmentation methods: Trends, challenges, and future directions [J].

Krishnapriya, Srigiri ;

Karuna, Yepuganti .

HEALTH AND TECHNOLOGY, 2023, 13 (02) :181-201

[10] Chicken Image Segmentation via Multi-Scale Attention-Based Deep Convolutional Neural Network [J].

Li, Wei ;

Xiao, Yang ;

Song, Xibin ;

Lv, Na ;

Jiang, Xinbo ;

Huang, Yan ;

Peng, Jingliang .

IEEE ACCESS, 2021, 9 :61398-61407

← 1 2 →