Dual context prior and refined prediction for semantic segmentation

被引:9
作者
Chen, Long [1 ]
Liu, Jiajie [1 ]
Li, Han [1 ]
Zhan, Wujing [1 ]
Zhou, Baoding [2 ,3 ,4 ]
Li, Qingquan [2 ,3 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Peoples R China
[2] Shenzhen Univ, Guangdong Key Lab Urban Informat, Shenzhen, Peoples R China
[3] Shenzhen Univ, Guangdong Lab Artificial Intelligence & Digital E, Shenzhen, Peoples R China
[4] Shenzhen Univ, Civil & Transportat Engn, Shenzhen, Peoples R China
关键词
Deep learning; semantic segmentation; linear spatial propagation; context information;
D O I
10.1080/10095020.2020.1785957
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Recently, the focus of semantic segmentation research has shifted to the aggregation of context prior and refined boundary. A typical network adopts context aggregation modules to extract rich semantic features. It also utilizes top-down connection and skips connections for refining boundary details. But it still remains disadvantage, an obvious fact is that the problem of false segmentation occurs as the object has very different textures. The fusion of weak semantic and low-level features leads to context prior degradation. To tackle the issue, we propose a simple yet effective network, which integrates dual context prior and spatial propagation-dubbed DSPNet. It extends two mainstreams of current segmentation researches: (1) Designing a dual context prior module, which pays attention to context prior again with a shortcut connection. (2) The network can inherently learn semantic aware affinity values for each pixel and refine the segmentation. We will present detailed comparisons, which perform on PASCAL VOC 2012 and Cityscapes. The result demonstrates the validation of our approach.
引用
收藏
页码:228 / 240
页数:13
相关论文
共 45 条
[1]  
[Anonymous], 2014, NEURAL INFORM PROCES
[2]  
BERTASIUS G, 2016, P IEEE C COMP VIS PA
[3]  
Chen L., 2020, P IEEE, V18, P3303, DOI [10.1109/TITS.2017.2683641, DOI 10.1109/TITS.2017.2683641]
[4]  
Chen L. C., 2014, ICLR
[5]  
Chen L.C., 2018, P EUR C COMP VIS MUN
[6]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[7]   Attention to Scale: Scale-aware Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Yang, Yi ;
Wang, Jiang ;
Xu, Wei ;
Yuille, Alan L. .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3640-3649
[8]  
Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
[9]   Deep Integration: A Multi-Label Architecture for Road Scene Recognition [J].
Chen, Long ;
Zhan, Wujing ;
Tian, Wei ;
He, Yuhang ;
Zou, Qin .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (10) :4883-4898
[10]   Turn Signal Detection During Nighttime by CNN Detector and Perceptual Hashing Tracking [J].
Chen, Long ;
Hu, Xuemin ;
Xu, Tong ;
Kuang, Hulin ;
Li, Qingquan .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 18 (12) :3303-3314