LDMSNet: Lightweight Dual-Branch Multi-Scale Network for Real-Time Semantic Segmentation of Autonomous Driving

被引:0
作者
Yang, Haoran [1 ]
Zhang, Dan [1 ]
Liu, Jiazai [1 ]
Cao, Zekun [2 ]
Wang, Na [3 ]
机构
[1] Nanjing Forestry Univ, Coll Informat Sci & Technol & Artificial Intellige, Nanjing 210037, Peoples R China
[2] Taiyuan Univ Technol, Coll Safety & Emergency Management Engn, Taiyuan 30024, Peoples R China
[3] Shanghai Jiao Tong Univ, Shanghai Ctr Syst Biomed, Key Lab Syst Biomed, Minist Educ, Shanghai 200240, Peoples R China
关键词
Semantic segmentation; Autonomous driving; Attention mechanism; Multi-scale feature; Feature fusion; FUSION;
D O I
10.1007/s12239-024-00179-4
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Semantic segmentation plays a crucial role in autonomous driving systems, serving as a key technology for understanding and interpreting the road environment. Most existing semantic segmentation networks strive for high accuracy, but achieving true real-time performance while maintaining high accuracy remains a challenge. However, autonomous driving systems require extremely high reaction speed and real-time processing capabilities, and any processing delay may lead to safety risks. To solve this problem, this paper proposes a lightweight dual-branch multi-scale network (LDMSNet) to achieve real-time semantic segmentation. First, the effective dilated bottleneck (EDB) is proposed to efficiently extract semantic information and spatial information using complementary dual-branch structure and depth-wise dilated convolution. Second, the multi-scale pyramid pooling module (MSPPM) is proposed, which uses a hierarchical residual structure and combines with dilated convolution to extract detailed information from low-resolution branches. Third, the polarized self-attention mechanism (PSA) is introduced to further enhance the interaction and correlation between features and improve the ability to perceive global information. The experimental results show that LDMSNet achieves 74.46% MIoU at 113FPS on the Cityscapes dataset, 71.51% MloU at 153FPS on the CamVid dataset and 77.41% MIoU at 170FPS on the StreetView dataset, effectively balancing speed and accuracy compared to state-of-the-art models.
引用
收藏
页码:577 / 591
页数:15
相关论文
共 39 条
  • [1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [2] Segmentation and Recognition Using Structure from Motion Point Clouds
    Brostow, Gabriel J.
    Shotton, Jamie
    Fauqueur, Julien
    Cipolla, Roberto
    [J]. COMPUTER VISION - ECCV 2008, PT I, PROCEEDINGS, 2008, 5302 : 44 - +
  • [3] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [4] Improved 3D Semantic Segmentation Model Based on RGB Image and LiDAR Point Cloud Fusion for Automantic Driving
    Du, Jiahao
    Huang, Xiaoci
    Xing, Mengyang
    Zhang, Tao
    [J]. INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2023, 24 (03) : 787 - 797
  • [5] MLFNet: Multi-Level Fusion Network for Real-Time Semantic Segmentation of Autonomous Driving
    Fan, Jiaqi
    Wang, Fei
    Chu, Hongqing
    Hu, Xiao
    Cheng, Yifan
    Gao, Bingzhao
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (01): : 756 - 767
  • [6] MSCFNet: A Lightweight Network With Multi-Scale Context Fusion for Real-Time Semantic Segmentation
    Gao, Guangwei
    Xu, Guoan
    Yu, Yi
    Xie, Jin
    Yang, Jian
    Yue, Dong
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 25489 - 25499
  • [7] Multi-Dimensional Pruning: A Unified Framework for Model Compression
    Guo, Jinyang
    Ouyang, Wanli
    Xu, Dong
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1505 - 1514
  • [8] GhostNet: More Features from Cheap Operations
    Han, Kai
    Wang, Yunhe
    Tian, Qi
    Guo, Jianyuan
    Xu, Chunjing
    Xu, Chang
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1577 - 1586
  • [9] MGSeg: Multiple Granularity-Based Real-Time Semantic Segmentation Network
    He, Jun-Yan
    Liang, Shi-Hua
    Wu, Xiao
    Zhao, Bo
    Zhang, Lei
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 7200 - 7214
  • [10] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778