A Real-Time Road Scene Semantic Segmentation Model Based on Spatial Context Learning

被引:0
作者
Xiao, Xiaomei [1 ]
Tang, Jialiang [1 ]
Lu, Xiaoyan [1 ]
Feng, Zhengyong [1 ]
Li, Yi [2 ]
机构
[1] China West Normal Univ, Elect Informat Proc Engn Technol Res Ctr, Sch Elect Informat Engn, Nanchong 637009, Peoples R China
[2] Chengdu Normal Univ, Coll Phys & Engn Technol, Chengdu 611130, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Semantics; Semantic segmentation; Accuracy; Computational modeling; Real-time systems; Training; Context modeling; Attention mechanisms; Encoding; Real-time semantic segmentation; spatial context guidance; feature attention; feature alignment; NETWORK;
D O I
10.1109/ACCESS.2024.3503676
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To address the issues of high computational complexity and insufficient aggregation of global and local information in existing image segmentation methods, this paper proposes an efficient segmentation model based on Spatial Context Learning, named SCLSeg. The main idea is to aggregate local regions into higher-level semantic regions in a learnable manner. The proposed Spatial Context Guided Feature Alignment module (SC-FA) learns aligned features from image-level to local regions, exploring and integrating contextual information. During training, a multi-scale strategy is used to group semantic regions, and a Channel Aggregation Block (CAB) is designed to dynamically capture semantic groups through a mechanism of feature separation and fusion, thereby aggregating multi-level pixel features to generate the final segmentation results. We further introduce a boundary loss to optimize the accuracy of segmentation edges. To meet real-time processing requirements, a series of lightweight strategies and simplified structures are adopted to reduce computational costs, including lightweight encoding, channel compression, and simplified neck. Our method achieves good performance on the Cityscapes and Camvid datasets, specifically achieving 76.45% mIoU & 237 FPS on the Cityscapes test set, and 73.95% mIoU & 300.4 FPS on the CamVid test set.
引用
收藏
页码:178495 / 178506
页数:12
相关论文
共 50 条
  • [21] HoloParser: Holistic Visual Parsing for Real-Time Semantic Segmentation in Autonomous Driving
    Li, Shu
    Yan, Qingqing
    Shi, Wenbo
    Wang, Liuyi
    Liu, Chengju
    Chen, Qijun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [22] Tripartite real-time semantic segmentation network with scene commonality
    Wang, Chenyang
    Wang, Chuanxu
    Liu, Peng
    Zhang, Zhe
    Lin, Guocheng
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (02)
  • [23] Lightweight Asymmetric Dilation Network for Real-Time Semantic Segmentation
    Hu, Xuegang
    Gong, Yu
    IEEE ACCESS, 2021, 9 : 55630 - 55643
  • [24] Dual Context Network for real-time semantic segmentation
    Hong Yin
    Wenbin Xie
    Jingjing Zhang
    Yuanfa Zhang
    Weixing Zhu
    Jie Gao
    Yan Shao
    Yajun Li
    Machine Vision and Applications, 2023, 34
  • [25] Real-time Semantic Segmentation with Context Aggregation Network
    Yang, Michael Ying
    Kumaar, Saumya
    Lyu, Ye
    Nex, Francesco
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 178 : 124 - 134
  • [26] RoboSeg: Real-Time Semantic Segmentation on Computationally Constrained Robots
    Yan, Qingqing
    Li, Shu
    Liu, Chengju
    Liu, Ming
    Chen, Qijun
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (03): : 1567 - 1577
  • [27] Dual Context Network for real-time semantic segmentation
    Yin, Hong
    Xie, Wenbin
    Zhang, Jingjing
    Zhang, Yuanfa
    Zhu, Weixing
    Gao, Jie
    Shao, Yan
    Li, Yajun
    MACHINE VISION AND APPLICATIONS, 2023, 34 (02)
  • [28] Region-Enhanced Feature Learning for Scene Semantic Segmentation
    Kang, Xin
    Wang, Chaoqun
    Chen, Xuejin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 954 - 964
  • [29] BiAttnNet: Bilateral Attention for Improving Real-Time Semantic Segmentation
    Li, Genling
    Li, Liang
    Zhang, Jiawan
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 46 - 50
  • [30] MSSINet: Real-Time Segmentation Based on Multi-Scale Strip Integration
    Wang, Lin
    Zhu, Fenghua
    Zhang, Hui
    Xiong, Gang
    Huang, Yunhu
    Chen, Dewang
    IEEE JOURNAL OF RADIO FREQUENCY IDENTIFICATION, 2024, 8 : 241 - 251