Real-time Semantic Segmentation with Context Aggregation Network

被引:67
作者
Yang, Michael Ying [1 ]
Kumaar, Saumya [1 ]
Lyu, Ye [1 ]
Nex, Francesco [1 ]
机构
[1] Univ Twente, Fac Geoinformat Sci & Earth Observat ITC, Enschede, Netherlands
关键词
Semantic segmentation; Real-time; Convolutional neural network; Context aggregation network;
D O I
10.1016/j.isprsjprs.2021.06.006
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
With the increasing demand of autonomous systems, pixelwise semantic segmentation for visual scene understanding needs to be not only accurate but also efficient for potential real-time applications. In this paper, we propose Context Aggregation Network, a dual branch convolutional neural network, with significantly lower computational costs as compared to the state-of-the-art, while maintaining a competitive prediction accuracy. Building upon the existing dual branch architectures for high-speed semantic segmentation, we design a high resolution branch for effective spatial detailing and a context branch with light-weight versions of global aggregation and local distribution blocks, potent to capture both long-range and local contextual dependencies required for accurate semantic segmentation, with low computational overheads. We evaluate our method on two semantic segmentation datasets, namely Cityscapes dataset and UAVid dataset. For Cityscapes test set, our model achieves state-of-the-art results with mIOU of 75.9%, at 76 FPS on an NVIDIA RTX 2080Ti and 8 FPS on a Jetson Xavier NX. With regards to UAVid dataset, our proposed network achieves mIOU score of 63.5% with high execution speed (15 FPS).
引用
收藏
页码:124 / 134
页数:11
相关论文
共 70 条
[1]  
Agrawal P, 2014, LECT NOTES COMPUT SC, V8695, P329, DOI 10.1007/978-3-319-10584-0_22
[2]  
[Anonymous], INT C MACH LEARN ICM
[3]  
[Anonymous], 2015, ICLR
[4]  
[Anonymous], 2018, BRIT MACH VIS C
[5]  
[Anonymous], BRIT MACH VIS C
[6]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[7]  
Chen L.-C., 2017, arXiv preprint arXiv:2310.0182
[8]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[9]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[10]   NODIS: Neural Ordinary Differential Scene Understanding [J].
Cong, Yuren ;
Ackermann, Hanno ;
Liao, Wentong ;
Yang, Michael Ying ;
Rosenhahn, Bodo .
COMPUTER VISION - ECCV 2020, PT XX, 2020, 12365 :636-653