A Real-Time Road Scene Semantic Segmentation Model Based on Spatial Context Learning

被引:0
作者
Xiao, Xiaomei [1 ]
Tang, Jialiang [1 ]
Lu, Xiaoyan [1 ]
Feng, Zhengyong [1 ]
Li, Yi [2 ]
机构
[1] China West Normal Univ, Elect Informat Proc Engn Technol Res Ctr, Sch Elect Informat Engn, Nanchong 637009, Peoples R China
[2] Chengdu Normal Univ, Coll Phys & Engn Technol, Chengdu 611130, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Semantics; Semantic segmentation; Accuracy; Computational modeling; Real-time systems; Training; Context modeling; Attention mechanisms; Encoding; Real-time semantic segmentation; spatial context guidance; feature attention; feature alignment; NETWORK;
D O I
10.1109/ACCESS.2024.3503676
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To address the issues of high computational complexity and insufficient aggregation of global and local information in existing image segmentation methods, this paper proposes an efficient segmentation model based on Spatial Context Learning, named SCLSeg. The main idea is to aggregate local regions into higher-level semantic regions in a learnable manner. The proposed Spatial Context Guided Feature Alignment module (SC-FA) learns aligned features from image-level to local regions, exploring and integrating contextual information. During training, a multi-scale strategy is used to group semantic regions, and a Channel Aggregation Block (CAB) is designed to dynamically capture semantic groups through a mechanism of feature separation and fusion, thereby aggregating multi-level pixel features to generate the final segmentation results. We further introduce a boundary loss to optimize the accuracy of segmentation edges. To meet real-time processing requirements, a series of lightweight strategies and simplified structures are adopted to reduce computational costs, including lightweight encoding, channel compression, and simplified neck. Our method achieves good performance on the Cityscapes and Camvid datasets, specifically achieving 76.45% mIoU & 237 FPS on the Cityscapes test set, and 73.95% mIoU & 300.4 FPS on the CamVid test set.
引用
收藏
页码:178495 / 178506
页数:12
相关论文
共 50 条
  • [31] Spatial Information Guided Convolution for Real-Time RGBD Semantic Segmentation
    Chen, Lin-Zhuo
    Lin, Zheng
    Wang, Ziqin
    Yang, Yong-Liang
    Cheng, Ming-Ming
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2313 - 2324
  • [32] LightSeg: Local Spatial Perception Convolution for Real-Time Semantic Segmentation
    Lei, Xiaochun
    Liang, Jiaming
    Gong, Zhaoting
    Jiang, Zetao
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (14):
  • [33] Spatial-Semantic Fusion Network for Semantic Segmentation in Real-time
    Fang Yu
    Zhang Xuehe
    Zhang He
    Liu Gangfeng
    Li Changle
    Zhao Jie
    [J]. 2019 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2019, : 30 - 35
  • [34] Adjacent Feature Propagation Network (AFPNet) for Real-Time Semantic Segmentation
    Hyun, Junhyuk
    Seong, Hongje
    Kim, Sangki
    Kim, Euntai
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (09): : 5877 - 5888
  • [35] Lightweight Real-Time Semantic Segmentation Network With Efficient Transformer and CNN
    Xu, Guoan
    Li, Juncheng
    Gao, Guangwei
    Lu, Huimin
    Yang, Jian
    Yue, Dong
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (12) : 15897 - 15906
  • [36] RTONet: Real-Time Occupancy Network for Semantic Scene Completion
    Lai, Quan
    Zheng, Haifeng
    Feng, Xinxin
    Zheng, Mingkui
    Chen, Huacong
    Chen, Wenqiang
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (10): : 8370 - 8377
  • [37] Road Scene Segmentation Based on Deep Learning
    Zheng, Ke
    Naji, Hasan Abdullah Hasan
    [J]. IEEE ACCESS, 2020, 8 : 140964 - 140971
  • [38] BiDNet: A Real-Time Semantic Segmentation Network With Antifeature Interference and Detail Recovery for Industrial Defects
    Pan, Jiawei
    Zeng, Deyu
    Wu, Zongze
    Xie, Shengli
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [39] Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street Scenes
    Ye, Baofeng
    Xue, Renzheng
    [J]. IEEE ACCESS, 2025, 13 : 588 - 595
  • [40] FRNet: Factorized and Regular Blocks Network for Semantic Segmentation in Road Scene
    Lu, Mengxu
    Chen, Zhenxue
    Wu, Q. M. Jonathan
    Wang, Nannan
    Rong, Xuewen
    Yan, Xinghe
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (04) : 3522 - 3530