FANet: Feature attention network for semantic segmentation

被引：0

作者：

Zhu, Lin ^{[1
]}

Li, Linxi ^{[1
]}

Tang, Mingwei ^{[1
]}

Niu, Wenrui ^{[1
]}

Xie, Jianhua ^{[1
]}

Mao, Hongyun ^{[1
]}

机构：

[1] Xihua Univ, Sch Comp & Software Engn, Chengdu 610039, Sichuan, Peoples R China

来源：

SIGNAL PROCESSING-IMAGE COMMUNICATION | 2025年 / 138卷

基金：

中国国家自然科学基金;

关键词：

Semantic segmentation; Adjustment algorithm; Attention mechanism; Hybrid extraction module; Adaptive hierarchical fusion;

D O I：

10.1016/j.image.2025.117330

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Semantic segmentation based on scene parsing specifies a category label for each pixel in the image. Existing neural network models are useful tools for understanding the objects in the scene. However, they ignore the heterogeneity of information carried by individual features, leading to pixel classification confusion and unclear boundaries. Therefore, this paper proposes a novel Feature Attention Network (FANet). Firstly, the adjustment algorithm is presented to capture attention feature matrices that can effectively cherry-pick feature dependencies. Secondly, the hybrid extraction module (HEM) is constructed to aggregate long-term dependencies based on proposed adjustment algorithm. Finally, the proposed adaptive hierarchical fusion module (AHFM) is employed to aggregated multi-scale features by learning spatially filtering conflictive information, which improves the scale invariance of features. Experimental results on popular Benchmarks (such as PASCAL VOC 2012, Cityscapes and ADE20K) indicate that our algorithm achieves better performance than other algorithms.

引用

页数：10

共 51 条

[1] Cervical Cancer Detection Using Segmentation on Pap smear Images
Arya, Mithlesh
Mittal, Namita
Singh, Girdhari
[J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATICS AND ANALYTICS (ICIA' 16), 2016,
[2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[3] Bai S., 2020, Adv. Neural Inf. Process. Syst. (NeurIPS)
[4] Chen L. -C., 2014, ARXIV
[5] CaMap: Camera-based Map Manipulation on Mobile Devices
Chen, Liang
Chen, Dongyi
[J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
[6] Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, 10.48550/arXiv.1706.05587,1706.05587]
[7] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Chen, Liang-Chieh
Papandreou, George
Kokkinos, Iasonas
Murphy, Kevin
Yuille, Alan L.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
[8] Chen W.-Y., 2019, INT C LEARN REPR
[9] Xception: Deep Learning with Depthwise Separable Convolutions
Chollet, Francois
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1800 - 1807
[10] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223

← 1 2 3 4 5 6 →