Real-Time Semantic Segmentation Algorithm for Street Scenes Based on Attention Mechanism and Feature Fusion

被引:1
|
作者
Wu, Bao [1 ]
Xiong, Xingzhong [2 ]
Wang, Yong [1 ]
机构
[1] Sichuan Univ Sci & Engn, Sch Automat & Informat Engn, Yibin 644000, Peoples R China
[2] Sichuan Univ Sci & Engn, Artificial Intelligence Key Lab Sichuan Prov, Yibin 644000, Peoples R China
关键词
semantic segmentation; feature fusion; feature extraction; pyramid pooling; complex street scenes; NETWORK;
D O I
10.3390/electronics13183699
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In computer vision, the task of semantic segmentation is crucial for applications such as autonomous driving and intelligent surveillance. However, achieving a balance between real-time performance and segmentation accuracy remains a significant challenge. Although Fast-SCNN is favored for its efficiency and low computational complexity, it still faces difficulties when handling complex street scene images. To address this issue, this paper presents an improved Fast-SCNN, aiming to enhance the accuracy and efficiency of semantic segmentation by incorporating a novel attention mechanism and an enhanced feature extraction module. Firstly, the integrated SimAM (Simple, Parameter-Free Attention Module) increases the network's sensitivity to critical regions of the image and effectively adjusts the feature space weights across channels. Additionally, the refined pyramid pooling module in the global feature extraction module captures a broader range of contextual information through refined pooling levels. During the feature fusion stage, the introduction of an enhanced DAB (Depthwise Asymmetric Bottleneck) block and SE (Squeeze-and-Excitation) attention optimizes the network's ability to process multi-scale information. Furthermore, the classifier module is extended by incorporating deeper convolutions and more complex convolutional structures, leading to a further improvement in model performance. These enhancements significantly improve the model's ability to capture details and overall segmentation performance. Experimental results demonstrate that the proposed method excels in processing complex street scene images, achieving a mean Intersection over Union (mIoU) of 71.7% and 69.4% on the Cityscapes and CamVid datasets, respectively, while maintaining inference speeds of 81.4 fps and 113.6 fps. These results indicate that the proposed model effectively improves segmentation quality in complex street scenes while ensuring real-time processing capabilities.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] A Semantic Segmentation Algorithm Based on Improved Attention Mechanism
    Chen, Chunyu
    Wu, Xinsheng
    Chen, An
    2020 INTERNATIONAL SYMPOSIUM ON AUTONOMOUS SYSTEMS (ISAS), 2020, : 244 - 248
  • [42] Research on Image Semantic Segmentation Based on Hybrid Cascade Feature Fusion and Detailed Attention Mechanism
    Du, Zuoqiang
    Liang, Yuan
    IEEE ACCESS, 2024, 12 : 62365 - 62377
  • [43] Real-Time Semantic Segmentation Network Based on Regional Self-Attention
    Bao Hailong
    Wan Min
    Liu Zhongxian
    Qin Mian
    Cui Haoyu
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (08)
  • [44] Parallel Complement Network for Real-Time Semantic Segmentation of Road Scenes
    Lv, Qingxuan
    Sun, Xin
    Chen, Changrui
    Dong, Junyu
    Zhou, Huiyu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (05) : 4432 - 4444
  • [45] DRMNet: more efficient bilateral networks for real-time semantic segmentation of road scenes
    Zhang, Wenming
    Zhang, Shaotong
    Li, Yaqian
    Li, Haibin
    Song, Tao
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (06)
  • [46] MFAFNet: A Lightweight and Efficient Network with Multi-Level Feature Adaptive Fusion for Real-Time Semantic Segmentation
    Lu, Kai
    Cheng, Jieren
    Li, Hua
    Ouyang, Tianyu
    SENSORS, 2023, 23 (14)
  • [47] Spatial-Semantic Fusion Network for Semantic Segmentation in Real-time
    Fang Yu
    Zhang Xuehe
    Zhang He
    Liu Gangfeng
    Li Changle
    Zhao Jie
    2019 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2019, : 30 - 35
  • [48] Exploring New Backbone and Attention Module for Semantic Segmentation in Street Scenes
    Fan, Lei
    Wang, Wei-Chien
    Zha, Fuyuan
    Yan, Jiapeng
    IEEE ACCESS, 2018, 6 : 71566 - 71580
  • [49] RELAXNet: Residual efficient learning and attention expected fusion network for real-time semantic segmentation
    Liu, Jin
    Xu, Xiaoqing
    Shi, Yiqing
    Deng, Cheng
    Shi, Miaohua
    NEUROCOMPUTING, 2022, 474 : 115 - 127
  • [50] A lightweight network with attention decoder for real-time semantic segmentation
    Wang, Kang
    Yang, Jinfu
    Yuan, Shuai
    Li, Mingai
    VISUAL COMPUTER, 2022, 38 (07) : 2329 - 2339