A Real-Time Road Scene Semantic Segmentation Model Based on Spatial Context Learning

被引:0
|
作者
Xiao, Xiaomei [1 ]
Tang, Jialiang [1 ]
Lu, Xiaoyan [1 ]
Feng, Zhengyong [1 ]
Li, Yi [2 ]
机构
[1] China West Normal Univ, Elect Informat Proc Engn Technol Res Ctr, Sch Elect Informat Engn, Nanchong 637009, Peoples R China
[2] Chengdu Normal Univ, Coll Phys & Engn Technol, Chengdu 611130, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Semantics; Semantic segmentation; Accuracy; Computational modeling; Real-time systems; Training; Context modeling; Attention mechanisms; Encoding; Real-time semantic segmentation; spatial context guidance; feature attention; feature alignment; NETWORK;
D O I
10.1109/ACCESS.2024.3503676
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To address the issues of high computational complexity and insufficient aggregation of global and local information in existing image segmentation methods, this paper proposes an efficient segmentation model based on Spatial Context Learning, named SCLSeg. The main idea is to aggregate local regions into higher-level semantic regions in a learnable manner. The proposed Spatial Context Guided Feature Alignment module (SC-FA) learns aligned features from image-level to local regions, exploring and integrating contextual information. During training, a multi-scale strategy is used to group semantic regions, and a Channel Aggregation Block (CAB) is designed to dynamically capture semantic groups through a mechanism of feature separation and fusion, thereby aggregating multi-level pixel features to generate the final segmentation results. We further introduce a boundary loss to optimize the accuracy of segmentation edges. To meet real-time processing requirements, a series of lightweight strategies and simplified structures are adopted to reduce computational costs, including lightweight encoding, channel compression, and simplified neck. Our method achieves good performance on the Cityscapes and Camvid datasets, specifically achieving 76.45% mIoU & 237 FPS on the Cityscapes test set, and 73.95% mIoU & 300.4 FPS on the CamVid test set.
引用
收藏
页码:178495 / 178506
页数:12
相关论文
共 50 条
  • [11] A Survey on Real-Time Semantic Segmentation Based on Deep Learning
    Li, Binbin
    Tang, Xiangyan
    Ruan, Chengchun
    Fu, Cebin
    Tao, Zhicong
    Yang, Yue
    BIG DATA AND SECURITY, ICBDS 2023, PT I, 2024, 2099 : 51 - 62
  • [12] EACNet: Enhanced Asymmetric Convolution for Real-Time Semantic Segmentation
    Li, Yaqian
    Li, Xiaokun
    Xiao, Cunjun
    Li, Haibin
    Zhang, Wenming
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 234 - 238
  • [13] Real-time semantic segmentation based on BiSeNetV2 for wild road
    Chen, Honghuan
    Lan, Xiaoke
    JOURNAL OF INTELLIGENT SYSTEMS, 2024, 33 (01)
  • [14] Exploring Scale-Aware Features for Real-Time Semantic Segmentation of Street Scenes
    Li, Kaige
    Geng, Qichuan
    Zhou, Zhong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (05) : 3575 - 3587
  • [15] Real-time semantic segmentation via mutual optimization of spatial details and semantic information
    Ma, Mengyuan
    Huang, Huiling
    Han, Jun
    Feng, Yanbing
    Yang, Yi
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (03) : 6821 - 6834
  • [16] MSCFNet: A Lightweight Network With Multi-Scale Context Fusion for Real-Time Semantic Segmentation
    Gao, Guangwei
    Xu, Guoan
    Yu, Yi
    Xie, Jin
    Yang, Jian
    Yue, Dong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 25489 - 25499
  • [17] Real-Time Semantic Segmentation for Road Scene Based on Data Enhancement and Dual-Path Fusion Network
    Zhang Z.-W.
    Liu T.-G.
    Nie P.-J.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (07): : 1609 - 1620
  • [18] MGSeg: Multiple Granularity-Based Real-Time Semantic Segmentation Network
    He, Jun-Yan
    Liang, Shi-Hua
    Wu, Xiao
    Zhao, Bo
    Zhang, Lei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 7200 - 7214
  • [19] Real-time semantic segmentation with local spatial pixel adjustment
    Xiao, Cunjun
    Hao, Xingjun
    Li, Haibin
    Li, Yaqian
    Zhang, Wenming
    IMAGE AND VISION COMPUTING, 2022, 123
  • [20] LDPNet: A Lightweight Densely Connected Pyramid Network for Real-Time Semantic Segmentation
    Hu, Xuegang
    Jing, Liyuan
    IEEE ACCESS, 2020, 8 : 212647 - 212658