DRMNet: more efficient bilateral networks for real-time semantic segmentation of road scenes

被引:1
|
作者
Zhang, Wenming [1 ]
Zhang, Shaotong [1 ]
Li, Yaqian [1 ]
Li, Haibin [1 ]
Song, Tao [2 ]
机构
[1] Yanshan Univ, Key Lab Ind Comp Control Engn Hebei Prov, Qinhuangdao 066004, Peoples R China
[2] Yanshan Univ, Sch Elect Engn, Hebei Prov Key Lab Test Measurement Technol & Inst, Qinhuangdao 066004, Peoples R China
基金
中国国家自然科学基金;
关键词
Real-time; Lightweight network; Semantic segmentation; Feature fusion; Attention mechanism;
D O I
10.1007/s11554-024-01579-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation is crucial in autonomous driving because of its accurate identification and segmentation of objects and regions. However, there is a conflict between segmentation accuracy and real-time performance on embedded devices. We propose an efficient lightweight semantic segmentation network (DRMNet) to solve these problems. Employing a streamlined bilateral structure, the model encodes semantic and spatial paths, cross-fusing features during encoding, and incorporates unique skip connections to coordinate upsampling within the semantic pathway. We design a new self-calibrated aggregate pyramid pooling module (SAPPM) at the end of the semantic branch to capture more comprehensive multi-scale semantic information and balance its extraction and inference speed. Furthermore, we designed a new feature fusion module, which guides the fusion of detail features and semantic features through attention perception, alleviating the problem of semantic information quickly covering spatial detail information. Experimental results on the CityScapes, CamVid, and NightCity datasets demonstrate the effectiveness of DRMNet. On a 2080Ti GPU, DRMNet achieves 78.6% mIoU at 88.3 FPS on the CityScapes dataset, 78.9% mIoU at 149 FPS on the CamVid dataset, and 53.5% mIoU at 160.4 FPS on the NightCity dataset. These results highlight the model's ability to balance accuracy and real-time performance better, making it suitable for embedded devices in autonomous driving applications.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Research on Efficient Asymmetric Attention Module for Real-Time Semantic Segmentation Networks in Urban Scenes
    Su, Xu
    Li, Lihong
    Xiao, Jiejie
    Wang, Pengtao
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2024, 28 (03) : 562 - 572
  • [2] MFNet: Multi-Feature Fusion Network for Real-Time Semantic Segmentation in Road Scenes
    Lu, Mengxu
    Chen, Zhenxue
    Liu, Chengyun
    Ma, Sile
    Cai, Lei
    Qin, Hao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 20991 - 21003
  • [3] Rethinking DABNet: Light-Weight Network for Real-Time Semantic Segmentation of Road Scenes
    Mazhar S.
    Atif N.
    Bhuyan M.K.
    Ahamed S.R.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (06): : 3098 - 3108
  • [4] RTSNet: Real-Time Semantic Segmentation Network For Outdoor Scenes
    Ma, Mingyu
    Zou, Fengshan
    Xu, Fang
    Song, Jilai
    2019 9TH IEEE ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2019), 2019, : 659 - 664
  • [5] Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation
    Peng, Chengli
    Tian, Tian
    Chen, Chen
    Guo, Xiaojie
    Ma, Jiayi
    NEURAL NETWORKS, 2021, 137 : 188 - 199
  • [6] Bilateral network with rich semantic extractor for real-time semantic segmentation
    Zhao, Shan
    Wu, Xuan
    Tian, Kaiwen
    Yuan, Yang
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (02) : 1899 - 1916
  • [7] Bilateral network with rich semantic extractor for real-time semantic segmentation
    Shan Zhao
    Xuan Wu
    Kaiwen Tian
    Yang Yuan
    Complex & Intelligent Systems, 2024, 10 : 1899 - 1916
  • [8] BSSNet: A Real-Time Semantic Segmentation Network for Road Scenes Inspired From AutoEncoder
    Shi, Xiaoqiang
    Yin, Zhenyu
    Han, Guangjie
    Liu, Wenzhuo
    Qin, Li
    Bi, Yuanguo
    Li, Shurui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3424 - 3438
  • [9] LFFNet: lightweight feature-enhanced fusion network for real-time semantic segmentation of road scenes
    Xuegang Hu
    Jing Feng
    Juelin Gong
    Pattern Analysis and Applications, 2024, 27
  • [10] Bilateral network with dual-guided attention for real-time semantic segmentation of road scene
    Liao, Liang
    Wan, Liang
    Liu, Mingsheng
    Li, Shusheng
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)