Efficient Attention Pyramid Network for Semantic Segmentation

被引:8
|
作者
Yang, Qirui [1 ,2 ,3 ]
Ku, Tao [1 ,2 ]
Hu, Kunyuan [1 ,2 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
[2] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110169, Peoples R China
[3] Univ Chinese Acad Sci, Sch Comp & Control, Beijing 100049, Peoples R China
关键词
Semantics; Convolution; Feature extraction; Task analysis; Image segmentation; Decoding; Computer vision; Semantic segmentation; attention mechanism; spatial pyramid; PASCAL VOC 2012; Cityscapes;
D O I
10.1109/ACCESS.2021.3053316
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation is a task that covers most of the perception needs of intelligent vehicles in an unified way. Recent studies witnessed that attention mechanisms achieve impressive performance in computer vision task. Current attention mechanisms based segmentation methods differ with each other in position and form of the attention mechanism, and perform differently in practice. This paper firstly introduces the effectiveness of multi-scale context features and attention mechanisms in segmentation tasks. We find that multi-scale and channel attention can play a vital role in constructing effective context features. Based on this analysis, this paper proposes an efficient attention pyramid network (EAPNet) for semantic segmentation. Specifically, to efficient handle the problem of segmenting objects at multiple scales, we design efficient channel attention pyramid (ECAP) which employ atrous convolution with channel attention in cascade or in parallel to capture multi-scale context by using multiple atrous rates. Furthermore, we propose a residual attention fusion block (RAFB), whose purpose is to simultaneously focus on meaningful low-level feature maps and spatial location information. At the same time, we will explore different channel attention modules and spatial attention modules, and describe their impact on network performance. We empirically evaluate our EAPNet on two semantic segmentation datasets, including PASCAL VOC 2012 and Cityscapes datasets. Experimental results show that without MS COCO pre-training and any post-processing, EAPNet achieved 81.7% mIoU on the PASCAL VOC 2012 validation set. With deeplabv3+ as the benchmark, EAPNet improve the model performance of more than 1.50% mIoU.
引用
收藏
页码:18867 / 18875
页数:9
相关论文
共 50 条
  • [21] Attention-Guided Pyramid Context Network for Polyp Segmentation in Colonoscopy Images
    Yue, Guanghui
    Li, Siying
    Cong, Runmin
    Zhou, Tianwei
    Lei, Baiying
    Wang, Tianfu
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [22] MsanlfNet: Semantic Segmentation Network With Multiscale Attention and Nonlocal Filters for High-Resolution Remote Sensing Images
    Bai, Lin
    Lin, Xiangyuan
    Ye, Zhen
    Xue, Dongling
    Yao, Cheng
    Hui, Meng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [23] Residual Pyramid Learning for Single-Shot Semantic Segmentation
    Chen, Xiaoyu
    Lou, Xiaotian
    Bai, Lianfa
    Han, Jing
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (07) : 2990 - 3000
  • [24] Convolutional Neural Network-Based Pavement Crack Segmentation Using Pyramid Attention Network
    Wang, Wenjun
    Su, Chao
    IEEE ACCESS, 2020, 8 : 206548 - 206558
  • [25] MFAFNet: A Multiscale Fully Attention Fusion Network for Remote Sensing Image Semantic Segmentation
    Dang, Yuanyuan
    Gao, Yu
    Liu, Bing
    IEEE ACCESS, 2024, 12 : 123388 - 123400
  • [26] Pyramid Self-attention for Semantic Segmentation
    Qi, Jiyang
    Wang, Xinggang
    Hu, Yao
    Tang, Xu
    Liu, Wenyu
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 480 - 492
  • [27] Light-Weight Semantic Segmentation Network for UAV Remote Sensing Images
    Liu, Siyu
    Cheng, Jian
    Liang, Leikun
    Bai, Haiwei
    Dang, Wanli
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 8287 - 8296
  • [28] BANet: Boundary-Assistant Encoder-Decoder Network for Semantic Segmentation
    Zhou, Quan
    Qiang, Yong
    Mo, Yuwei
    Wu, Xiaofu
    Latecki, Longin Jan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 25259 - 25270
  • [29] LDPNet: A Lightweight Densely Connected Pyramid Network for Real-Time Semantic Segmentation
    Hu, Xuegang
    Jing, Liyuan
    IEEE ACCESS, 2020, 8 : 212647 - 212658
  • [30] ESMS-Net: Enhancing Semantic-Mask Segmentation Network With Pyramid Atrousformer for Remote Sensing Image
    Liu, Jiamin
    Wang, Ziyi
    Luo, Fulin
    Guo, Tan
    Yang, Feng
    Gao, Xinbo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62