FPANet: Feature pyramid aggregation network for real-time semantic segmentation

被引:46
|
作者
Wu, Yun [1 ]
Jiang, Jianyong [1 ]
Huang, Zimeng [1 ]
Tian, Youliang [1 ]
机构
[1] Guizhou Univ, Coll Comp Sci & Technol, Guiyang 550025, Guizhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Real-time; Feature pyramid network; Atrous spatial pyramid pooling; feature fusion; Border refinement;
D O I
10.1007/s10489-021-02603-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation is used in many fields, and most fields not only require models with high-quality predictions but also require real-time speed in the forward inference phase. Therefore, our goal is to perform high-quality real-time semantic segmentation, thus proposing the feature pyramid aggregation network (FPANet). This network can be regarded as an encoder-decoder model. In the encoder stage, we use ResNet and atrous spatial pyramid pooling (ASPP) to extract more high-level semantic information. In the decoder stage, to simultaneously obtain the semantic and spatial information of the image, we propose a bilateral directional feature pyramid network for semantic segmentation to fuse features at different levels, it is named SeBiFPN. In SeBiFPN, we design a lightweight feature pyramid fusion module (FPFM) to fuse features from two different levels. In addition, when predicting the border region of an image, most real-time semantic segmentation models perform poorly; therefore, we propose a border refinement module (BRM) to improve the problem of inaccurate border segmentation. To reduce the computational complexity of the model, we redesign the ASPP module and reduce the number of feature channels during feature fusion. Our method achieves a better balance of speed and accuracy compared to the state-of-the-art methods on the Cityscapes and CamVid datasets.
引用
收藏
页码:3319 / 3336
页数:18
相关论文
共 50 条
  • [1] FPANet: Feature pyramid aggregation network for real-time semantic segmentation
    Yun Wu
    Jianyong Jiang
    Zimeng Huang
    Youliang Tian
    Applied Intelligence, 2022, 52 : 3319 - 3336
  • [2] DFPNet:Dislocation Double Feature Pyramid Real-time Semantic Segmentation Network
    Fang, Qin
    Qiu, Jun
    Wu, Hao
    Yang, Jie
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 2587 - 2592
  • [3] Real-time Semantic Segmentation with Context Aggregation Network
    Yang, Michael Ying
    Kumaar, Saumya
    Lyu, Ye
    Nex, Francesco
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 178 : 124 - 134
  • [4] DFANet: Deep Feature Aggregation for Real-Time Semantic Segmentation
    Li, Hanchao
    Xiong, Pengfei
    Fan, Haoqiang
    Sun, Jian
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9514 - 9523
  • [5] Real-Time Semantic Edge Segmentation Using Modified Channelwise Feature Pyramid
    Harish H.
    Murthy A.S.
    SN Computer Science, 5 (1)
  • [6] CFPNET: CHANNEL-WISE FEATURE PYRAMID FOR REAL-TIME SEMANTIC SEGMENTATION
    Lou, Ange
    Loew, Murray
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1894 - 1898
  • [7] DSMRSeg: Dual-Stage Feature Pyramid and Multi-Range Context Aggregation for Real-Time Semantic Segmentation
    Yang, Mingdong
    Shi, Ying
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT IV, 2019, 1142 : 265 - 273
  • [8] Joint pyramid attention network for real-time semantic segmentation of urban scenes
    Hu, Xuegang
    Jing, Liyuan
    Sehar, Uroosa
    APPLIED INTELLIGENCE, 2022, 52 (01) : 580 - 594
  • [9] Joint pyramid attention network for real-time semantic segmentation of urban scenes
    Xuegang Hu
    Liyuan Jing
    Uroosa Sehar
    Applied Intelligence, 2022, 52 : 580 - 594
  • [10] LDPNet: A Lightweight Densely Connected Pyramid Network for Real-Time Semantic Segmentation
    Hu, Xuegang
    Jing, Liyuan
    IEEE ACCESS, 2020, 8 : 212647 - 212658