FANet: Feature attention network for semantic segmentation

被引:0
作者
Zhu, Lin [1 ]
Li, Linxi [1 ]
Tang, Mingwei [1 ]
Niu, Wenrui [1 ]
Xie, Jianhua [1 ]
Mao, Hongyun [1 ]
机构
[1] Xihua Univ, Sch Comp & Software Engn, Chengdu 610039, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantic segmentation; Adjustment algorithm; Attention mechanism; Hybrid extraction module; Adaptive hierarchical fusion;
D O I
10.1016/j.image.2025.117330
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semantic segmentation based on scene parsing specifies a category label for each pixel in the image. Existing neural network models are useful tools for understanding the objects in the scene. However, they ignore the heterogeneity of information carried by individual features, leading to pixel classification confusion and unclear boundaries. Therefore, this paper proposes a novel Feature Attention Network (FANet). Firstly, the adjustment algorithm is presented to capture attention feature matrices that can effectively cherry-pick feature dependencies. Secondly, the hybrid extraction module (HEM) is constructed to aggregate long-term dependencies based on proposed adjustment algorithm. Finally, the proposed adaptive hierarchical fusion module (AHFM) is employed to aggregated multi-scale features by learning spatially filtering conflictive information, which improves the scale invariance of features. Experimental results on popular Benchmarks (such as PASCAL VOC 2012, Cityscapes and ADE20K) indicate that our algorithm achieves better performance than other algorithms.
引用
收藏
页数:10
相关论文
共 51 条
  • [1] Cervical Cancer Detection Using Segmentation on Pap smear Images
    Arya, Mithlesh
    Mittal, Namita
    Singh, Girdhari
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATICS AND ANALYTICS (ICIA' 16), 2016,
  • [2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [3] Bai S., 2020, Adv. Neural Inf. Process. Syst. (NeurIPS)
  • [4] Chen L. -C., 2014, ARXIV
  • [5] CaMap: Camera-based Map Manipulation on Mobile Devices
    Chen, Liang
    Chen, Dongyi
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
  • [6] Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, 10.48550/arXiv.1706.05587,1706.05587]
  • [7] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [8] Chen W.-Y., 2019, INT C LEARN REPR
  • [9] Xception: Deep Learning with Depthwise Separable Convolutions
    Chollet, Francois
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1800 - 1807
  • [10] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223