Image semantic segmentation approach based on DeepLabV3 plus network with an attention mechanism

被引：25

作者：

Liu, Yanyan ^{[1
]}

Bai, Xiaotian ^{[2
]}

Wang, Jiafei ^{[1
]}

Li, Guoning ^{[2
]}

Li, Jin ^{[3
]}

Lv, Zengming ^{[2
]}

机构：

[1] Changchun Univ Sci & Technol, Dept Elect & Informat Engn, Changchun 130022, Peoples R China

[2] Chinese Acad Sci CIOMP, Changchun Inst Opt Fine Mech & Phys, Changchun 130033, Peoples R China

[3] Beihang Univ, Sch Instrumentat & Optoelect Engn, Beijing, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2024年 / 127卷

关键词：

Image semantic segmentation is a technique that distinguishes different kinds of things in an image by assigning a label to each point in a target category based on its semantics. The Deeplabv3+ image semantic segmentation method currently in use has high computational complexity and large memory consumption; making it difficult to deploy on embedded platforms with limited computational power. When extracting image feature information; Deeplabv3+ struggles to fully utilize multiscale information. This can result in a loss of detailed information and damage to segmentation accuracy. An improved image semantic segmentation method based on the DeepLabv3+ network is proposed; with the lightweight MobileNetv2 serving as the model's backbone. The ECAnet channel attention mechanism is applied to low-level features; reducing computational complexity and improving target boundary clarity. The polarized self-attention mechanism is introduced after the ASPP module to improve the spatial feature representation of the feature map. Validated on the VOC2012 dataset; the experimental results indicate that the improved model achieved an MloU of 69.29% and a mAP of 80.41%; which can predict finer semantic segmentation results and effectively optimize the model complexity and segmentation accuracy. © 2023;

D O I：

10.1016/j.engappai.2023.107260

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Image semantic segmentation is a technique that distinguishes different kinds of things in an image by assigning a label to each point in a target category based on its "semantics". The Deeplabv3+ image semantic segmentation method currently in use has high computational complexity and large memory consumption, making it difficult to deploy on embedded platforms with limited computational power. When extracting image feature information, Deeplabv3+ struggles to fully utilize multiscale information. This can result in a loss of detailed information and damage to segmentation accuracy. An improved image semantic segmentation method based on the DeepLabv3+ network is proposed, with the lightweight MobileNetv2 serving as the model's backbone. The ECAnet channel attention mechanism is applied to low-level features, reducing computational complexity and improving target boundary clarity. The polarized self-attention mechanism is introduced after the ASPP module to improve the spatial feature representation of the feature map. Validated on the VOC2012 dataset, the experimental results indicate that the improved model achieved an MloU of 69.29% and a mAP of 80.41%, which can predict finer semantic segmentation results and effectively optimize the model complexity and segmentation accuracy.

引用

页数：8

共 50 条

[1] Image Semantic Segmentation Method Based on Improved DeepLabv3
Cong, Xu
Li, Wang
LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (16)
[2] DPNet: Dual-Pyramid Semantic Segmentation Network Based on Improved Deeplabv3 Plus
Wang, Jun
Zhang, Xiaolin
Yan, Tianhong
Tan, Aihong
ELECTRONICS, 2023, 12 (14)
[3] Image Semantic Segmentation Based on Combination of DeepLabV3+ and Attention Mechanism
Qiu Yunfei
Wen Jinyan
LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (04)
[4] Semantic Segmentation with Extended DeepLabv3 Architecture
Yurtkulu, Salih Can
Sahin, Yusuf Huseyin
Unal, Gozde
2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
[5] The Effect of Resnet Model as Feature Extractor Network to Performance of DeepLabV3 Model for Semantic Satellite Image Segmentation
Heryadi, Yaya
Irwansyah, Edy
Miranda, Eka
Soeparno, Haryono
Herlawati
Hashimoto, Kiyota
2020 IEEE ASIA-PACIFIC CONFERENCE ON GEOSCIENCE, ELECTRONICS AND REMOTE SENSING TECHNOLOGY (AGERS 2020): UNDERSTANDING THE INTERCTION OF LAND, OCEAN AND ATMOSPHERE: DISASTER MITIGATION AND REGIONAL RESILLIENCE, 2020, : 74 - 77
[6] Multi-scale dense and attention mechanism for image semantic segmentation based on improved DeepLabv3+
Wang, Zuoshuai
Zhang, Hongyi
Huang, Zhiquan
Lin, Zhibin
Wu, Hangxing
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (05)
[7] Road Extraction Based On Improved Deeplabv3 Plus In Remote Sensing Image
Wang, Hanxiang
Yu, Fan
Xie, Junwei
Wang, Haonan
Zheng, Haotian
URBAN GEOINFORMATICS 2022, 2022, : 67 - 72
[8] Algorithm for segmentation of smoke using the improved DeeplabV3 network
Wang Z.
Su Y.
Liu Y.
Zhang W.
Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (06): : 52 - 59
[9] Fast Segmentation Algorithm of USV Accessible Area Based on Attention Fast Deeplabv3
Cheng, Liang
Xiong, Rui
Wu, Jiarong
Yan, Xuemei
Yang, Chunli
Zhang, Yongjie
He, Yunze
IEEE SENSORS JOURNAL, 2024, 24 (15) : 24168 - 24177
[10] RTC_TongueNet: An improved tongue image segmentation model based on DeepLabV3
Tang, Yan
Tan, Daiqing
Li, Huixia
Zhu, Muhua
Li, Xiaohui
Wang, Xuan
Wang, Jiaqi
Wang, Zaijian
Gao, Chenxi
Wang, Ji
Han, Aiqing
DIGITAL HEALTH, 2024, 10

← 1 2 3 4 5 →