Real-Time High-Performance Semantic Image Segmentation of Urban Street Scenes

被引：84

作者：

Dong, Genshun ^{[1
]}

Yan, Yan ^{[1
]}

Shen, Chunhua ^{[2
]}

Wang, Hanzi ^{[1
]}

机构：

[1] Xiamen Univ, Sch Informat, Fujian Key Lab Sensing & Comp Smart City, Xiamen 361005, Peoples R China

[2] Univ Adelaide, Sch Comp Sci, Adelaide, SA 5005, Australia

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2021年 / 22卷 / 06期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Semantics; Real-time systems; Image segmentation; Convolution; Intelligent transportation systems; Task analysis; Computational modeling; Intelligent vehicles; street scene understanding; deep learning; real-time semantic image segmentation; light-weight convolutional neural networks; OBJECT RECOGNITION;

D O I：

10.1109/TITS.2020.2980426

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Deep Convolutional Neural Networks (DCNNs) have recently shown outstanding performance in semantic image segmentation. However, state-of-the-art DCNN-based semantic segmentation methods usually suffer from high computational complexity due to the use of complex network architectures. This greatly limits their applications in the real-world scenarios that require real-time processing. In this paper, we propose a real-time high-performance DCNN-based method for robust semantic segmentation of urban street scenes, which achieves a good trade-off between accuracy and speed. Specifically, a Lightweight Baseline Network with Atrous convolution and Attention (LBN-AA) is firstly used as our baseline network to efficiently obtain dense feature maps. Then, the Distinctive Atrous Spatial Pyramid Pooling (DASPP), which exploits the different sizes of pooling operations to encode the rich and distinctive semantic information, is developed to detect objects at multiple scales. Meanwhile, a Spatial detail-Preserving Network (SPN) with shallow convolutional layers is designed to generate high-resolution feature maps preserving the detailed spatial information. Finally, a simple but practical Feature Fusion Network (FFN) is used to effectively combine both deep and shallow features from the semantic branch (DASPP) and the spatial branch (SPN), respectively. Extensive experimental results show that the proposed method respectively achieves the accuracy of 73.6% and 68.0% mean Intersection over Union (mIoU) at the inference speeds of 51.0 fps and 39.3 fps on the challenging Cityscapes and CamVid test datasets (by only using a single NVIDIA TITAN X card). This demonstrates that the proposed method offers excellent performance at the real-time speed for semantic segmentation of urban street scenes.

引用

页码：3258 / 3274

页数：17

共 50 条

[31] DRMNet: more efficient bilateral networks for real-time semantic segmentation of road scenes
Zhang, Wenming
Zhang, Shaotong
Li, Yaqian
Li, Haibin
Song, Tao
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (06)
[32] EFRNet: Edge feature refinement network for real-time semantic segmentation of driving scenes
Hou, Zhiqiang
Qu, Minjie
Cheng, Minjie
Ma, Sugang
Wang, Yunchen
Yang, Xiaobao
DIGITAL SIGNAL PROCESSING, 2025, 156
[33] Triple-Branch Asymmetric Network for Real-time Semantic Segmentation of Road Scenes
Yazhi Zhang
Xuguang Zhang
Hui Yu
Instrumentation, 2024, 11 (02) : 72 - 82
[34] BSSNet: A Real-Time Semantic Segmentation Network for Road Scenes Inspired From AutoEncoder
Shi, Xiaoqiang
Yin, Zhenyu
Han, Guangjie
Liu, Wenzhuo
Qin, Li
Bi, Yuanguo
Li, Shurui
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3424 - 3438
[35] An open-source project for real-time image semantic segmentation
Quan ZHOU
Yu WANG
Jia LIU
Xin JIN
Longin Jan LATECKI
Science China(Information Sciences), 2019, 62 (12) : 246 - 247
[36] Real-Time Semantic Clothing Segmentation
Cushen, George. A.
Nixon, Mark. S.
ADVANCES IN VISUAL COMPUTING, ISVC 2012, PT I, 2012, 7431 : 272 - 281
[37] An open-source project for real-time image semantic segmentation
Quan Zhou
Yu Wang
Jia Liu
Xin Jin
Longin Jan Latecki
Science China Information Sciences, 2019, 62
[38] A Real-Time Image Semantic Segmentation Method Based on Multilabel Classification
Jin, Ran
Han, Xiaozhen
Yu, Tongrui
MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
[39] DESIGN of a spaceborne high-performance and real-time image processing platform
Pan Zheng
Feng Xingtai
Peng Chengxiang
INTERNATIONAL CONFERENCE ON OPTICAL AND PHOTONIC ENGINEERING, ICOPEN 2022, 2022, 12550
[40] An open-source project for real-time image semantic segmentation
Zhou, Quan
Wang, Yu
Liu, Jia
Jin, Xin
Latecki, Longin Jan
SCIENCE CHINA-INFORMATION SCIENCES, 2019, 62 (12)

← 1 2 3 4 5 →