Image semantic segmentation approach based on DeepLabV3 plus network with an attention mechanism

被引:25
|
作者
Liu, Yanyan [1 ]
Bai, Xiaotian [2 ]
Wang, Jiafei [1 ]
Li, Guoning [2 ]
Li, Jin [3 ]
Lv, Zengming [2 ]
机构
[1] Changchun Univ Sci & Technol, Dept Elect & Informat Engn, Changchun 130022, Peoples R China
[2] Chinese Acad Sci CIOMP, Changchun Inst Opt Fine Mech & Phys, Changchun 130033, Peoples R China
[3] Beihang Univ, Sch Instrumentat & Optoelect Engn, Beijing, Peoples R China
关键词
Image semantic segmentation is a technique that distinguishes different kinds of things in an image by assigning a label to each point in a target category based on its semantics. The Deeplabv3+ image semantic segmentation method currently in use has high computational complexity and large memory consumption; making it difficult to deploy on embedded platforms with limited computational power. When extracting image feature information; Deeplabv3+ struggles to fully utilize multiscale information. This can result in a loss of detailed information and damage to segmentation accuracy. An improved image semantic segmentation method based on the DeepLabv3+ network is proposed; with the lightweight MobileNetv2 serving as the model's backbone. The ECAnet channel attention mechanism is applied to low-level features; reducing computational complexity and improving target boundary clarity. The polarized self-attention mechanism is introduced after the ASPP module to improve the spatial feature representation of the feature map. Validated on the VOC2012 dataset; the experimental results indicate that the improved model achieved an MloU of 69.29% and a mAP of 80.41%; which can predict finer semantic segmentation results and effectively optimize the model complexity and segmentation accuracy. © 2023;
D O I
10.1016/j.engappai.2023.107260
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image semantic segmentation is a technique that distinguishes different kinds of things in an image by assigning a label to each point in a target category based on its "semantics". The Deeplabv3+ image semantic segmentation method currently in use has high computational complexity and large memory consumption, making it difficult to deploy on embedded platforms with limited computational power. When extracting image feature information, Deeplabv3+ struggles to fully utilize multiscale information. This can result in a loss of detailed information and damage to segmentation accuracy. An improved image semantic segmentation method based on the DeepLabv3+ network is proposed, with the lightweight MobileNetv2 serving as the model's backbone. The ECAnet channel attention mechanism is applied to low-level features, reducing computational complexity and improving target boundary clarity. The polarized self-attention mechanism is introduced after the ASPP module to improve the spatial feature representation of the feature map. Validated on the VOC2012 dataset, the experimental results indicate that the improved model achieved an MloU of 69.29% and a mAP of 80.41%, which can predict finer semantic segmentation results and effectively optimize the model complexity and segmentation accuracy.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Image Semantic Segmentation Method Based on Improved DeepLabv3
    Cong, Xu
    Li, Wang
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (16)
  • [2] DPNet: Dual-Pyramid Semantic Segmentation Network Based on Improved Deeplabv3 Plus
    Wang, Jun
    Zhang, Xiaolin
    Yan, Tianhong
    Tan, Aihong
    ELECTRONICS, 2023, 12 (14)
  • [3] Image Semantic Segmentation Based on Combination of DeepLabV3+ and Attention Mechanism
    Qiu Yunfei
    Wen Jinyan
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (04)
  • [4] Semantic Segmentation with Extended DeepLabv3 Architecture
    Yurtkulu, Salih Can
    Sahin, Yusuf Huseyin
    Unal, Gozde
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [5] The Effect of Resnet Model as Feature Extractor Network to Performance of DeepLabV3 Model for Semantic Satellite Image Segmentation
    Heryadi, Yaya
    Irwansyah, Edy
    Miranda, Eka
    Soeparno, Haryono
    Herlawati
    Hashimoto, Kiyota
    2020 IEEE ASIA-PACIFIC CONFERENCE ON GEOSCIENCE, ELECTRONICS AND REMOTE SENSING TECHNOLOGY (AGERS 2020): UNDERSTANDING THE INTERCTION OF LAND, OCEAN AND ATMOSPHERE: DISASTER MITIGATION AND REGIONAL RESILLIENCE, 2020, : 74 - 77
  • [6] Multi-scale dense and attention mechanism for image semantic segmentation based on improved DeepLabv3+
    Wang, Zuoshuai
    Zhang, Hongyi
    Huang, Zhiquan
    Lin, Zhibin
    Wu, Hangxing
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (05)
  • [7] Road Extraction Based On Improved Deeplabv3 Plus In Remote Sensing Image
    Wang, Hanxiang
    Yu, Fan
    Xie, Junwei
    Wang, Haonan
    Zheng, Haotian
    URBAN GEOINFORMATICS 2022, 2022, : 67 - 72
  • [8] Algorithm for segmentation of smoke using the improved DeeplabV3 network
    Wang Z.
    Su Y.
    Liu Y.
    Zhang W.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (06): : 52 - 59
  • [9] Fast Segmentation Algorithm of USV Accessible Area Based on Attention Fast Deeplabv3
    Cheng, Liang
    Xiong, Rui
    Wu, Jiarong
    Yan, Xuemei
    Yang, Chunli
    Zhang, Yongjie
    He, Yunze
    IEEE SENSORS JOURNAL, 2024, 24 (15) : 24168 - 24177
  • [10] RTC_TongueNet: An improved tongue image segmentation model based on DeepLabV3
    Tang, Yan
    Tan, Daiqing
    Li, Huixia
    Zhu, Muhua
    Li, Xiaohui
    Wang, Xuan
    Wang, Jiaqi
    Wang, Zaijian
    Gao, Chenxi
    Wang, Ji
    Han, Aiqing
    DIGITAL HEALTH, 2024, 10