Efficient Attention Pyramid Network for Semantic Segmentation

被引：8

作者：

Yang, Qirui ^{[1
,2
,3
]}

Ku, Tao ^{[1
,2
]}

Hu, Kunyuan ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China

[2] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110169, Peoples R China

[3] Univ Chinese Acad Sci, Sch Comp & Control, Beijing 100049, Peoples R China

来源：

IEEE ACCESS | 2021年 / 9卷

关键词：

Semantics; Convolution; Feature extraction; Task analysis; Image segmentation; Decoding; Computer vision; Semantic segmentation; attention mechanism; spatial pyramid; PASCAL VOC 2012; Cityscapes;

D O I：

10.1109/ACCESS.2021.3053316

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Semantic segmentation is a task that covers most of the perception needs of intelligent vehicles in an unified way. Recent studies witnessed that attention mechanisms achieve impressive performance in computer vision task. Current attention mechanisms based segmentation methods differ with each other in position and form of the attention mechanism, and perform differently in practice. This paper firstly introduces the effectiveness of multi-scale context features and attention mechanisms in segmentation tasks. We find that multi-scale and channel attention can play a vital role in constructing effective context features. Based on this analysis, this paper proposes an efficient attention pyramid network (EAPNet) for semantic segmentation. Specifically, to efficient handle the problem of segmenting objects at multiple scales, we design efficient channel attention pyramid (ECAP) which employ atrous convolution with channel attention in cascade or in parallel to capture multi-scale context by using multiple atrous rates. Furthermore, we propose a residual attention fusion block (RAFB), whose purpose is to simultaneously focus on meaningful low-level feature maps and spatial location information. At the same time, we will explore different channel attention modules and spatial attention modules, and describe their impact on network performance. We empirically evaluate our EAPNet on two semantic segmentation datasets, including PASCAL VOC 2012 and Cityscapes datasets. Experimental results show that without MS COCO pre-training and any post-processing, EAPNet achieved 81.7% mIoU on the PASCAL VOC 2012 validation set. With deeplabv3+ as the benchmark, EAPNet improve the model performance of more than 1.50% mIoU.

引用

页码：18867 / 18875

页数：9

共 50 条

[31] Toward TR-PCB Bubble Detection via an Efficient Attention Segmentation Network and Dynamic Threshold
Zhou, Jiaming
Zhu, Qing
Wang, Yaonan
Zhou, Xianen
Feng, Mingtao
Liu, Xuebing
Mo, Yang
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
[32] Covariance Attention for Semantic Segmentation
Liu, Yazhou
Chen, Yuliang
Lasang, Pongsak
Sun, Quansen
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (04) : 1805 - 1818
[33] Semantic segmentation based on double pyramid network with improved global attention mechanism
Xianfeng Ou
Hanpu Wang
Guoyun Zhang
Wujing Li
Shuixiang Yu
Applied Intelligence, 2023, 53 : 18898 - 18909
[34] Semantic segmentation based on double pyramid network with improved global attention mechanism
Ou, Xianfeng
Wang, Hanpu
Zhang, Guoyun
Li, Wujing
Yu, Shuixiang
APPLIED INTELLIGENCE, 2023, 53 (15) : 18898 - 18909
[35] Dynamic attention network for semantic segmentation
Wu, Fei
Chen, Feng
Jing, Xiao-Yuan
Hu, Chang-Hui
Ge, Qi
Ji, Yimu
NEUROCOMPUTING, 2020, 384 (384) : 182 - 191
[36] Feature Pyramid Network With Level-Aware Attention for Meningioma Segmentation
Huang, Wei
Shu, Xin
Wang, Zizhou
Zhang, Lei
Chen, Chaoyue
Xu, Jianguo
Yi, Zhang
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (05): : 1201 - 1210
[37] Event-Based Semantic Segmentation With Posterior Attention
Jia, Zexi
You, Kaichao
He, Weihua
Tian, Yang
Feng, Yongxiang
Wang, Yaoyuan
Jia, Xu
Lou, Yihang
Zhang, Jingyi
Li, Guoqi
Zhang, Ziyang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1829 - 1842
[38] CTNet: Context-Based Tandem Network for Semantic Segmentation
Li, Zechao
Sun, Yanpeng
Zhang, Liyan
Tang, Jinhui
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 9904 - 9917
[39] FTNet: Feature Transverse Network for Thermal Image Semantic Segmentation
Panetta, Karen
Kamath, K. M. Shreyas
Rajeev, Srijith
Agaian, Sos S.
IEEE ACCESS, 2021, 9 : 145212 - 145227
[40] Cross-form efficient attention pyramidal network for semantic image segmentation
Maurya, Anamika
Chand, Satish
AI COMMUNICATIONS, 2022, 35 (03) : 225 - 242

← 1 2 3 4 5 →