LDMSNet: Lightweight Dual-Branch Multi-Scale Network for Real-Time Semantic Segmentation of Autonomous Driving

被引：0

作者：

Yang, Haoran ^{[1
]}

Zhang, Dan ^{[1
]}

Liu, Jiazai ^{[1
]}

Cao, Zekun ^{[2
]}

Wang, Na ^{[3
]}

机构：

[1] Nanjing Forestry Univ, Coll Informat Sci & Technol & Artificial Intellige, Nanjing 210037, Peoples R China

[2] Taiyuan Univ Technol, Coll Safety & Emergency Management Engn, Taiyuan 30024, Peoples R China

[3] Shanghai Jiao Tong Univ, Shanghai Ctr Syst Biomed, Key Lab Syst Biomed, Minist Educ, Shanghai 200240, Peoples R China

来源：

INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY | 2024年

关键词：

Semantic segmentation; Autonomous driving; Attention mechanism; Multi-scale feature; Feature fusion; FUSION;

D O I：

10.1007/s12239-024-00179-4

中图分类号：

TH [机械、仪表工业];

学科分类号：

0802 ;

摘要：

Semantic segmentation plays a crucial role in autonomous driving systems, serving as a key technology for understanding and interpreting the road environment. Most existing semantic segmentation networks strive for high accuracy, but achieving true real-time performance while maintaining high accuracy remains a challenge. However, autonomous driving systems require extremely high reaction speed and real-time processing capabilities, and any processing delay may lead to safety risks. To solve this problem, this paper proposes a lightweight dual-branch multi-scale network (LDMSNet) to achieve real-time semantic segmentation. First, the effective dilated bottleneck (EDB) is proposed to efficiently extract semantic information and spatial information using complementary dual-branch structure and depth-wise dilated convolution. Second, the multi-scale pyramid pooling module (MSPPM) is proposed, which uses a hierarchical residual structure and combines with dilated convolution to extract detailed information from low-resolution branches. Third, the polarized self-attention mechanism (PSA) is introduced to further enhance the interaction and correlation between features and improve the ability to perceive global information. The experimental results show that LDMSNet achieves 74.46% MIoU at 113FPS on the Cityscapes dataset, 71.51% MloU at 153FPS on the CamVid dataset and 77.41% MIoU at 170FPS on the StreetView dataset, effectively balancing speed and accuracy compared to state-of-the-art models.

引用

页码：577 / 591

页数：15

共 39 条

[1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[2] Segmentation and Recognition Using Structure from Motion Point Clouds
Brostow, Gabriel J.
Shotton, Jamie
Fauqueur, Julien
Cipolla, Roberto
[J]. COMPUTER VISION - ECCV 2008, PT I, PROCEEDINGS, 2008, 5302 : 44 - +
[3] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
[4] Improved 3D Semantic Segmentation Model Based on RGB Image and LiDAR Point Cloud Fusion for Automantic Driving
Du, Jiahao
Huang, Xiaoci
Xing, Mengyang
Zhang, Tao
[J]. INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2023, 24 (03) : 787 - 797
[5] MLFNet: Multi-Level Fusion Network for Real-Time Semantic Segmentation of Autonomous Driving
Fan, Jiaqi
Wang, Fei
Chu, Hongqing
Hu, Xiao
Cheng, Yifan
Gao, Bingzhao
[J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (01): : 756 - 767
[6] MSCFNet: A Lightweight Network With Multi-Scale Context Fusion for Real-Time Semantic Segmentation
Gao, Guangwei
Xu, Guoan
Yu, Yi
Xie, Jin
Yang, Jian
Yue, Dong
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 25489 - 25499
[7] Multi-Dimensional Pruning: A Unified Framework for Model Compression
Guo, Jinyang
Ouyang, Wanli
Xu, Dong
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1505 - 1514
[8] GhostNet: More Features from Cheap Operations
Han, Kai
Wang, Yunhe
Tian, Qi
Guo, Jianyuan
Xu, Chunjing
Xu, Chang
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1577 - 1586
[9] MGSeg: Multiple Granularity-Based Real-Time Semantic Segmentation Network
He, Jun-Yan
Liang, Shi-Hua
Wu, Xiao
Zhao, Bo
Zhang, Lei
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 7200 - 7214
[10] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778

← 1 2 3 4 →