Efficient Attention Pyramid Network for Semantic Segmentation

被引:8
|
作者
Yang, Qirui [1 ,2 ,3 ]
Ku, Tao [1 ,2 ]
Hu, Kunyuan [1 ,2 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
[2] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110169, Peoples R China
[3] Univ Chinese Acad Sci, Sch Comp & Control, Beijing 100049, Peoples R China
关键词
Semantics; Convolution; Feature extraction; Task analysis; Image segmentation; Decoding; Computer vision; Semantic segmentation; attention mechanism; spatial pyramid; PASCAL VOC 2012; Cityscapes;
D O I
10.1109/ACCESS.2021.3053316
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation is a task that covers most of the perception needs of intelligent vehicles in an unified way. Recent studies witnessed that attention mechanisms achieve impressive performance in computer vision task. Current attention mechanisms based segmentation methods differ with each other in position and form of the attention mechanism, and perform differently in practice. This paper firstly introduces the effectiveness of multi-scale context features and attention mechanisms in segmentation tasks. We find that multi-scale and channel attention can play a vital role in constructing effective context features. Based on this analysis, this paper proposes an efficient attention pyramid network (EAPNet) for semantic segmentation. Specifically, to efficient handle the problem of segmenting objects at multiple scales, we design efficient channel attention pyramid (ECAP) which employ atrous convolution with channel attention in cascade or in parallel to capture multi-scale context by using multiple atrous rates. Furthermore, we propose a residual attention fusion block (RAFB), whose purpose is to simultaneously focus on meaningful low-level feature maps and spatial location information. At the same time, we will explore different channel attention modules and spatial attention modules, and describe their impact on network performance. We empirically evaluate our EAPNet on two semantic segmentation datasets, including PASCAL VOC 2012 and Cityscapes datasets. Experimental results show that without MS COCO pre-training and any post-processing, EAPNet achieved 81.7% mIoU on the PASCAL VOC 2012 validation set. With deeplabv3+ as the benchmark, EAPNet improve the model performance of more than 1.50% mIoU.
引用
收藏
页码:18867 / 18875
页数:9
相关论文
共 50 条
  • [1] Global Attention Pyramid Network for Semantic Segmentation
    Zhang, Na
    Li, Jun
    Li, Yongrui
    Du, Yang
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8728 - 8732
  • [2] PCANet: Pyramid convolutional attention network for semantic segmentation
    Sang, Haiwei
    Zhou, Qiuhao
    Zhao, Yong
    IMAGE AND VISION COMPUTING, 2020, 103
  • [3] An Efficient Sampling-Based Attention Network for Semantic Segmentation
    He, Xingjian
    Liu, Jing
    Wang, Weining
    Lu, Hanqing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2850 - 2863
  • [4] Embedded Attention Network for Semantic Segmentation
    Lv, Qingxuan
    Feng, Mingzhe
    Sun, Xin
    Dong, Junyu
    Chen, Changrui
    Zhang, Yu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (01): : 326 - 333
  • [5] Semantic Segmentation With Attention Mechanism for Remote Sensing Images
    Zhao, Qi
    Liu, Jiahui
    Li, Yuewen
    Zhang, Hong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [6] Pyramid Fusion Transformer for Semantic Segmentation
    Qin, Zipeng
    Liu, Jianbo
    Zhang, Xiaolin
    Tian, Maoqing
    Zhou, Aojun
    Yi, Shuai
    Li, Hongsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9630 - 9643
  • [7] SPANet: Successive Pooling Attention Network for Semantic Segmentation of Remote Sensing Images
    Sun, Le
    Cheng, Shiwei
    Zheng, Yuhui
    Wu, Zebin
    Zhang, Jianwei
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 4045 - 4057
  • [8] Hybrid Multiple Attention Network for Semantic Segmentation in Aerial Images
    Niu, Ruigang
    Sun, Xian
    Tian, Yu
    Diao, Wenhui
    Chen, Kaiqiang
    Fu, Kun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [9] Satellite Component Semantic Segmentation: Video Dataset and Real-Time Pyramid Attention and Decoupled Attention Network
    Shao, Yadong
    Wu, Aodi
    Li, Shengyang
    Shu, Leizheng
    Wan, Xue
    Shao, Yuanbin
    Huo, Junyan
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (06) : 7315 - 7333
  • [10] Attention Guided Global Enhancement and Local Refinement Network for Semantic Segmentation
    Li, Jiangyun
    Zha, Sen
    Chen, Chen
    Ding, Meng
    Zhang, Tianxiang
    Yu, Hong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3211 - 3223