S-Net: A Lightweight Real-Time Semantic Segmentation Network for Autonomous Driving

被引:0
作者
Mazhar, Saquib [1 ]
Atif, Nadeem [1 ]
Bhuyan, M. K. [1 ]
Ahamed, Shaik Rafi [1 ]
机构
[1] Indian Inst Technol, Gauhati, Assam, India
来源
COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT II | 2024年 / 2010卷
关键词
Computer vision; Autonomous driving; Semantic segmentation; Deep learning;
D O I
10.1007/978-3-031-58174-8_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation of road-scene images for autonomous driving is a dense pixel-level prediction task performed in real-time. Deep learning models make extensive efforts to improve segmentation accuracy, among which network architecture design is essential. In edge devices, this becomes more challenging due to limited computing power. While very deep encoder-decoder-based networks perform fairly accurately, their slow inference speed and many parameters make them unsuitable for small devices. Decoder-less models are fast but suffer from accuracy loss. To this end, we propose a novel architecture with a shallow decoder. We propose a building block for our network, which leverages a multi-scale feature pyramid model. The block efficiently learns semantic and contextual features based on which we design our network. It benefits from uniquely placed encoder skip connections, which are responsible for retaining low-level features to preserve boundary information, often lost in deep networks. Experiments on highly competitive Cityscapes and CamVid datasets show the efficiency of our proposed architecture. Our model gets a mean intersection-over-union score of 72.5% and 67.5% on the Cityscapes and CamVid test set, with only 0.6 Million parameters running in real-time.
引用
收藏
页码:147 / 159
页数:13
相关论文
共 37 条
[11]   DFANet: Deep Feature Aggregation for Real-Time Semantic Segmentation [J].
Li, Hanchao ;
Xiong, Pengfei ;
Fan, Haoqiang ;
Sun, Jian .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9514-9523
[12]  
Lo S.-Y., 2019, P 1 ACM INT C MULT A, P1, DOI DOI 10.1145/3338533.3366558
[13]  
Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965
[14]   CFPNET: CHANNEL-WISE FEATURE PYRAMID FOR REAL-TIME SEMANTIC SEGMENTATION [J].
Lou, Ange ;
Loew, Murray .
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, :1894-1898
[15]   MFNet: Multi-Feature Fusion Network for Real-Time Semantic Segmentation in Road Scenes [J].
Lu, Mengxu ;
Chen, Zhenxue ;
Liu, Chengyun ;
Ma, Sile ;
Cai, Lei ;
Qin, Hao .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) :20991-21003
[16]   FRNet: Factorized and Regular Blocks Network for Semantic Segmentation in Road Scene [J].
Lu, Mengxu ;
Chen, Zhenxue ;
Wu, Q. M. Jonathan ;
Wang, Nannan ;
Rong, Xuewen ;
Yan, Xinghe .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (04) :3522-3530
[17]   ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation [J].
Mehta, Sachin ;
Rastegari, Mohammad ;
Caspi, Anat ;
Shapiro, Linda ;
Hajishirzi, Hannaneh .
COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 :561-580
[18]   HyperSeg: Patch-wise Hypernetwork for Real-time Semantic Segmentation [J].
Nirkin, Yuval ;
Wolf, Lior ;
Hassner, Tal .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4060-4069
[19]   Efficient semantic segmentation with pyramidal fusion [J].
Orsic, Marin ;
Segvic, Sinisa .
PATTERN RECOGNITION, 2021, 110
[20]  
Paszke A., 2018, 4 INT C LEARN REPR I