Real-Time High-Performance Semantic Image Segmentation of Urban Street Scenes

被引:84
|
作者
Dong, Genshun [1 ]
Yan, Yan [1 ]
Shen, Chunhua [2 ]
Wang, Hanzi [1 ]
机构
[1] Xiamen Univ, Sch Informat, Fujian Key Lab Sensing & Comp Smart City, Xiamen 361005, Peoples R China
[2] Univ Adelaide, Sch Comp Sci, Adelaide, SA 5005, Australia
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Semantics; Real-time systems; Image segmentation; Convolution; Intelligent transportation systems; Task analysis; Computational modeling; Intelligent vehicles; street scene understanding; deep learning; real-time semantic image segmentation; light-weight convolutional neural networks; OBJECT RECOGNITION;
D O I
10.1109/TITS.2020.2980426
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Deep Convolutional Neural Networks (DCNNs) have recently shown outstanding performance in semantic image segmentation. However, state-of-the-art DCNN-based semantic segmentation methods usually suffer from high computational complexity due to the use of complex network architectures. This greatly limits their applications in the real-world scenarios that require real-time processing. In this paper, we propose a real-time high-performance DCNN-based method for robust semantic segmentation of urban street scenes, which achieves a good trade-off between accuracy and speed. Specifically, a Lightweight Baseline Network with Atrous convolution and Attention (LBN-AA) is firstly used as our baseline network to efficiently obtain dense feature maps. Then, the Distinctive Atrous Spatial Pyramid Pooling (DASPP), which exploits the different sizes of pooling operations to encode the rich and distinctive semantic information, is developed to detect objects at multiple scales. Meanwhile, a Spatial detail-Preserving Network (SPN) with shallow convolutional layers is designed to generate high-resolution feature maps preserving the detailed spatial information. Finally, a simple but practical Feature Fusion Network (FFN) is used to effectively combine both deep and shallow features from the semantic branch (DASPP) and the spatial branch (SPN), respectively. Extensive experimental results show that the proposed method respectively achieves the accuracy of 73.6% and 68.0% mean Intersection over Union (mIoU) at the inference speeds of 51.0 fps and 39.3 fps on the challenging Cityscapes and CamVid test datasets (by only using a single NVIDIA TITAN X card). This demonstrates that the proposed method offers excellent performance at the real-time speed for semantic segmentation of urban street scenes.
引用
收藏
页码:3258 / 3274
页数:17
相关论文
共 50 条
  • [31] DRMNet: more efficient bilateral networks for real-time semantic segmentation of road scenes
    Zhang, Wenming
    Zhang, Shaotong
    Li, Yaqian
    Li, Haibin
    Song, Tao
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (06)
  • [32] EFRNet: Edge feature refinement network for real-time semantic segmentation of driving scenes
    Hou, Zhiqiang
    Qu, Minjie
    Cheng, Minjie
    Ma, Sugang
    Wang, Yunchen
    Yang, Xiaobao
    DIGITAL SIGNAL PROCESSING, 2025, 156
  • [33] Triple-Branch Asymmetric Network for Real-time Semantic Segmentation of Road Scenes
    Yazhi Zhang
    Xuguang Zhang
    Hui Yu
    Instrumentation, 2024, 11 (02) : 72 - 82
  • [34] BSSNet: A Real-Time Semantic Segmentation Network for Road Scenes Inspired From AutoEncoder
    Shi, Xiaoqiang
    Yin, Zhenyu
    Han, Guangjie
    Liu, Wenzhuo
    Qin, Li
    Bi, Yuanguo
    Li, Shurui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3424 - 3438
  • [35] An open-source project for real-time image semantic segmentation
    Quan ZHOU
    Yu WANG
    Jia LIU
    Xin JIN
    Longin Jan LATECKI
    Science China(Information Sciences), 2019, 62 (12) : 246 - 247
  • [36] Real-Time Semantic Clothing Segmentation
    Cushen, George. A.
    Nixon, Mark. S.
    ADVANCES IN VISUAL COMPUTING, ISVC 2012, PT I, 2012, 7431 : 272 - 281
  • [37] An open-source project for real-time image semantic segmentation
    Quan Zhou
    Yu Wang
    Jia Liu
    Xin Jin
    Longin Jan Latecki
    Science China Information Sciences, 2019, 62
  • [38] A Real-Time Image Semantic Segmentation Method Based on Multilabel Classification
    Jin, Ran
    Han, Xiaozhen
    Yu, Tongrui
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [39] DESIGN of a spaceborne high-performance and real-time image processing platform
    Pan Zheng
    Feng Xingtai
    Peng Chengxiang
    INTERNATIONAL CONFERENCE ON OPTICAL AND PHOTONIC ENGINEERING, ICOPEN 2022, 2022, 12550
  • [40] An open-source project for real-time image semantic segmentation
    Zhou, Quan
    Wang, Yu
    Liu, Jia
    Jin, Xin
    Latecki, Longin Jan
    SCIENCE CHINA-INFORMATION SCIENCES, 2019, 62 (12)