Real-Time Semantic Segmentation Algorithm for Street Scenes Based on Attention Mechanism and Feature Fusion

被引:1
|
作者
Wu, Bao [1 ]
Xiong, Xingzhong [2 ]
Wang, Yong [1 ]
机构
[1] Sichuan Univ Sci & Engn, Sch Automat & Informat Engn, Yibin 644000, Peoples R China
[2] Sichuan Univ Sci & Engn, Artificial Intelligence Key Lab Sichuan Prov, Yibin 644000, Peoples R China
关键词
semantic segmentation; feature fusion; feature extraction; pyramid pooling; complex street scenes; NETWORK;
D O I
10.3390/electronics13183699
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In computer vision, the task of semantic segmentation is crucial for applications such as autonomous driving and intelligent surveillance. However, achieving a balance between real-time performance and segmentation accuracy remains a significant challenge. Although Fast-SCNN is favored for its efficiency and low computational complexity, it still faces difficulties when handling complex street scene images. To address this issue, this paper presents an improved Fast-SCNN, aiming to enhance the accuracy and efficiency of semantic segmentation by incorporating a novel attention mechanism and an enhanced feature extraction module. Firstly, the integrated SimAM (Simple, Parameter-Free Attention Module) increases the network's sensitivity to critical regions of the image and effectively adjusts the feature space weights across channels. Additionally, the refined pyramid pooling module in the global feature extraction module captures a broader range of contextual information through refined pooling levels. During the feature fusion stage, the introduction of an enhanced DAB (Depthwise Asymmetric Bottleneck) block and SE (Squeeze-and-Excitation) attention optimizes the network's ability to process multi-scale information. Furthermore, the classifier module is extended by incorporating deeper convolutions and more complex convolutional structures, leading to a further improvement in model performance. These enhancements significantly improve the model's ability to capture details and overall segmentation performance. Experimental results demonstrate that the proposed method excels in processing complex street scene images, achieving a mean Intersection over Union (mIoU) of 71.7% and 69.4% on the Cityscapes and CamVid datasets, respectively, while maintaining inference speeds of 81.4 fps and 113.6 fps. These results indicate that the proposed model effectively improves segmentation quality in complex street scenes while ensuring real-time processing capabilities.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Real-time semantic segmentation network based on parallel atrous convolution for short-term dense concatenate and attention feature fusion
    Wu, Lijun
    Qiu, Shangdong
    Chen, Zhicong
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (03)
  • [42] RTSNet: Real-Time Semantic Segmentation Network For Outdoor Scenes
    Ma, Mingyu
    Zou, Fengshan
    Xu, Fang
    Song, Jilai
    2019 9TH IEEE ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2019), 2019, : 659 - 664
  • [43] A Real-Time Semantic Segmentation Approach for Autonomous Driving Scenes
    Qin, Feiwei
    Shen, Xiyue
    Peng, Yong
    Shao, Yanli
    Yuan, Wenqiang
    Ji, Zhongping
    Bai, Jing
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (07): : 1026 - 1037
  • [44] Attention based lightweight asymmetric network for real-time semantic segmentation
    Liu, Qian
    Wang, Cunbao
    Li, Zhensheng
    Qi, Youwei
    Fang, Jiongtao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 130
  • [45] A Semantic Segmentation Method of Remote Sensing Image Based on Feature Fusion and Attention Mechanism
    Wang, Yiqin
    Dong, Yunyun
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2024, 20 (05): : 640 - 653
  • [46] LCFNet: Loss Compensation Fusion Network for Real-time Semantic Segmentation of Urban Road Scenes
    Yang, Lu
    Bai, Yiwen
    Ren, Fenglei
    Zhang, Shiyu
    Bi, Chongke
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 347 - 354
  • [47] ESNET: EDGE-BASED SEGMENTATION NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION IN TRAFFIC SCENES
    Lyu, Haoran
    Fu, Huiyuan
    Hu, Xiaojun
    Liu, Liang
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1855 - 1859
  • [48] ENTROPY-BASED FEATURE EXTRACTION FOR REAL-TIME SEMANTIC SEGMENTATION
    Abrahamyan, Lusine
    Deligiannis, Nikos
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 591 - 595
  • [49] Stripe Pooling Attention for Real-Time Semantic Segmentation
    Lyu J.
    Sun Y.
    Xu P.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (09): : 1395 - 1404
  • [50] A hybrid attention multi-scale fusion network for real-time semantic segmentation
    Ye, Baofeng
    Xue, Renzheng
    Wu, Qianlong
    SCIENTIFIC REPORTS, 2025, 15 (01):