Real-Time Semantic Segmentation Algorithm for Street Scenes Based on Attention Mechanism and Feature Fusion

被引:1
|
作者
Wu, Bao [1 ]
Xiong, Xingzhong [2 ]
Wang, Yong [1 ]
机构
[1] Sichuan Univ Sci & Engn, Sch Automat & Informat Engn, Yibin 644000, Peoples R China
[2] Sichuan Univ Sci & Engn, Artificial Intelligence Key Lab Sichuan Prov, Yibin 644000, Peoples R China
关键词
semantic segmentation; feature fusion; feature extraction; pyramid pooling; complex street scenes; NETWORK;
D O I
10.3390/electronics13183699
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In computer vision, the task of semantic segmentation is crucial for applications such as autonomous driving and intelligent surveillance. However, achieving a balance between real-time performance and segmentation accuracy remains a significant challenge. Although Fast-SCNN is favored for its efficiency and low computational complexity, it still faces difficulties when handling complex street scene images. To address this issue, this paper presents an improved Fast-SCNN, aiming to enhance the accuracy and efficiency of semantic segmentation by incorporating a novel attention mechanism and an enhanced feature extraction module. Firstly, the integrated SimAM (Simple, Parameter-Free Attention Module) increases the network's sensitivity to critical regions of the image and effectively adjusts the feature space weights across channels. Additionally, the refined pyramid pooling module in the global feature extraction module captures a broader range of contextual information through refined pooling levels. During the feature fusion stage, the introduction of an enhanced DAB (Depthwise Asymmetric Bottleneck) block and SE (Squeeze-and-Excitation) attention optimizes the network's ability to process multi-scale information. Furthermore, the classifier module is extended by incorporating deeper convolutions and more complex convolutional structures, leading to a further improvement in model performance. These enhancements significantly improve the model's ability to capture details and overall segmentation performance. Experimental results demonstrate that the proposed method excels in processing complex street scene images, achieving a mean Intersection over Union (mIoU) of 71.7% and 69.4% on the Cityscapes and CamVid datasets, respectively, while maintaining inference speeds of 81.4 fps and 113.6 fps. These results indicate that the proposed model effectively improves segmentation quality in complex street scenes while ensuring real-time processing capabilities.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Real-time efficient semantic segmentation network based on improved ASPP and parallel fusion module in complex scenes
    Ding, Peng
    Qian, Huaming
    Zhou, Yipeng
    Yan, Shuya
    Feng, Shibao
    Yu, Shuang
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2023, 20 (03)
  • [32] A hybrid attention multi-scale fusion network for real-time semantic segmentation
    Ye, Baofeng
    Xue, Renzheng
    Wu, Qianlong
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [33] LBCNet: A lightweight bilateral cascaded feature fusion network for real-time semantic segmentation
    Song, Yuqin
    Shang, Chunliang
    Zhao, Jitao
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (06) : 7293 - 7315
  • [34] DFFNet: An IoT-perceptive dual feature fusion network for general real-time semantic segmentation
    Tang, Xiangyan
    Tu, Wenxuan
    Li, Keqiu
    Cheng, Jieren
    INFORMATION SCIENCES, 2021, 565 : 326 - 343
  • [35] LBCNet: A lightweight bilateral cascaded feature fusion network for real-time semantic segmentation
    Yuqin Song
    Chunliang Shang
    Jitao Zhao
    The Journal of Supercomputing, 2024, 80 (6) : 7293 - 7315
  • [36] Feature Fusion Network Based on Hybrid Attention for Semantic Segmentation
    Xie Xinchen
    Li, Chen
    Tian, Lihua
    2022 IEEE WORLD AI IOT CONGRESS (AIIOT), 2022, : 9 - 14
  • [37] Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation
    Peng, Chengli
    Tian, Tian
    Chen, Chen
    Guo, Xiaojie
    Ma, Jiayi
    NEURAL NETWORKS, 2021, 137 : 188 - 199
  • [38] Semantic Segmentation of Remote-Sensing Images Based on Multiscale Feature Fusion and Attention Refinement
    He, Xin
    Zhou, Yong
    Zhao, Jiaqi
    Zhang, Man
    Yao, Rui
    Liu, Bing
    Li, Haichao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [39] Real-time power line segmentation detection based on multi-attention with strong semantic feature extractor
    Zhao, Qian
    Ji, Tangyu
    Liang, Shuang
    Yu, Wentao
    Yan, Chao
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2023, 20 (06)
  • [40] Real-time power line segmentation detection based on multi-attention with strong semantic feature extractor
    Qian Zhao
    Tangyu Ji
    Shuang Liang
    WenTao Yu
    Chao Yan
    Journal of Real-Time Image Processing, 2023, 20