DFPNet:Dislocation Double Feature Pyramid Real-time Semantic Segmentation Network

被引:1
作者
Fang, Qin [1 ]
Qiu, Jun [2 ]
Wu, Hao [2 ]
Yang, Jie [3 ]
机构
[1] Zhejiang Univ, Coll Elect Engn, Hangzhou, Peoples R China
[2] NingboTech Univ, Ningbo, Peoples R China
[3] Zhejiang Univ, Ind Technol Res Inst, Hangzhou, Peoples R China
来源
2020 CHINESE AUTOMATION CONGRESS (CAC 2020) | 2020年
关键词
semantic segmentation; real time; computer vision; deep learning;
D O I
10.1109/CAC51589.2020.9327332
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The feature pyramid structure has almost become a standard configuration in the object detection network. This paper introduces a dislocation double feature pyramid structure and configure it in a lightweight segmentation network. We also use the classic module (atrous spatial pyramid pooling) in the segmentation network to extract rich contextual information. Our network is called DFPNet. In order to fully verify the gain of the dislocation double feature pyramid structure for network performance, we perform a wealth of experiments on different datasets (CitySpaces and CamVid) to show that DFPNet can obtain competitive results using our novel feature pyramid module. In particular, DFPNet achieves 73.1% Mean IoU(mIoU) on the CamVid validation set with only 5.5M parameters and runtime of 117 milliseconds per image on a single RTX 2080Ti. Our code and model have been open sourced at https://github.com/Fang789/pytorch_seg.
引用
收藏
页码:2587 / 2592
页数:6
相关论文
共 25 条
[1]  
[Anonymous], 2014, INT C LEARN REPR ICL
[2]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[3]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[4]  
Chen Yunpeng, 2017, ARXIV COMPUTER VISIO
[5]  
Cheng H.K., 2020, Cascadepsp: toward class-agnostic and very high-resolution segmentation via global and local refinement
[6]   Dual Attention Network for Scene Segmentation [J].
Fu, Jun ;
Liu, Jing ;
Tian, Haijie ;
Li, Yong ;
Bao, Yongjun ;
Fang, Zhiwei ;
Lu, Hanqing .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3141-3149
[7]  
Howard A. G., 2017, arXiv
[8]  
Li Hanchao, 2019, ARXIV COMPUTER VISIO
[9]  
LIN TY, 2017, PROC CVPR IEEE, P936, DOI DOI 10.1109/CVPR.2017.106
[10]   Path Aggregation Network for Instance Segmentation [J].
Liu, Shu ;
Qi, Lu ;
Qin, Haifang ;
Shi, Jianping ;
Jia, Jiaya .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8759-8768