Small Object Augmentation of Urban Scenes for Real-Time Semantic Segmentation

被引：54

作者：

Yang, Zhengeng ^{[1
,2
,3
]}

Yu, Hongshan ^{[1
,2
]}

Feng, Mingtao ^{[1
,2
]}

Sun, Wei ^{[1
,2
]}

Lin, Xuefei ^{[4
]}

Sun, Mingui ^{[3
,5
,6
]}

Mao, Zhi-Hong ^{[5
,6
]}

Mian, Ajmal ^{[7
]}

机构：

[1] Hunan Univ, Natl Engn Lab Robot Visual Percept & Control Tech, Coll Elect & Informat Engn, Changsha 410082, Hunan, Peoples R China

[2] Hunan Univ, Shenzhen Inst, Shenzhen 518057, Peoples R China

[3] Univ Pittsburgh, Dept Neurol Surg, Pittsburgh, PA 15260 USA

[4] Hunan Agr Univ, Dept Art, Changsha 410128, Peoples R China

[5] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15260 USA

[6] Univ Pittsburgh, Dept Bioengn, Pittsburgh, PA 15260 USA

[7] Univ Western Australia, Dept Comp Sci, Perth, WA 6009, Australia

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2020年 / 29卷

基金：

中国国家自然科学基金; 湖南省自然科学基金; 美国国家卫生研究院;

关键词：

Semantic segmentation; scene understanding; autonomous driving; synthetic dataset; FEATURES; NETWORK;

D O I：

10.1109/TIP.2020.2976856

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semantic segmentation is a key step in scene understanding for autonomous driving. Although deep learning has significantly improved the segmentation accuracy, current high-quality models such as PSPNet and DeepLabV3 are inefficient given their complex architectures and reliance on multi-scale inputs. Thus, it is difficult to apply them to real-time or practical applications. On the other hand, existing real-time methods cannot yet produce satisfactory results on small objects such as traffic lights, which are imperative to safe autonomous driving. In this paper, we improve the performance of real-time semantic segmentation from two perspectives, methodology and data. Specifically, we propose a real-time segmentation model coined Narrow Deep Network (NDNet) and build a synthetic dataset by inserting additional small objects into the training images. The proposed method achieves 65.7% mean intersection over union (mIoU) on the Cityscapes test set with only 8.4G floating-point operations (FLOPs) on $1024\times 2048$ inputs. Furthermore, by re-training the existing PSPNet and DeepLabV3 models on our synthetic dataset, we obtained an average 2% mIoU improvement on small objects.

引用

页码：5175 / 5190

页数：16

共 50 条

[31] Block attention network: A lightweight deep network for real-time semantic segmentation of road scenes in resource-constrained devices
Mazhar, Saquib
Atif, Nadeem
Bhuyan, M. K.
Ahamed, Shaik Rafi
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
[32] Paddy field object detection for robotic combine based on real-time semantic segmentation algorithm
Zhu, Jiajun
Iida, Michihisa
Chen, Sikai
Cheng, Shijing
Suguri, Masahiko
Masuda, Ryohei
JOURNAL OF FIELD ROBOTICS, 2024, 41 (02) : 273 - 287
[33] AUGMENTED-TRAINING-AWARE BISENET FOR REAL-TIME SEMANTIC SEGMENTATION
Hsu, Chih-Chung
Lee, Cheih
Tai, Shen-Chieh
Jiang, Yun-Zhong
2022 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (IEEE ICMEW 2022), 2022,
[34] Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation
Peng, Chengli
Tian, Tian
Chen, Chen
Guo, Xiaojie
Ma, Jiayi
NEURAL NETWORKS, 2021, 137 : 188 - 199
[35] LightSeg: Local Spatial Perception Convolution for Real-Time Semantic Segmentation
Lei, Xiaochun
Liang, Jiaming
Gong, Zhaoting
Jiang, Zetao
APPLIED SCIENCES-BASEL, 2023, 13 (14):
[36] MiniNet: An Efficient Semantic Segmentation ConvNet for Real-Time Robotic Applications
Alonso, Inigo
Riazuelo, Luis
Murillo, Ana C.
IEEE TRANSACTIONS ON ROBOTICS, 2020, 36 (04) : 1340 - 1347
[37] ERFNet: Efficient Residual Factorized ConvNet for Real-Time Semantic Segmentation
Romera, Eduardo
Alvarez, Jose M.
Bergasa, Luis M.
Arroyo, Roberto
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (01) : 263 - 272
[38] Real-time semantic segmentation with weighted factorized-depthwise convolution
Hao, Xiaochen
Hao, Xingjun
Zhang, Yaru
Li, Yuanyuan
Wu, Chao
IMAGE AND VISION COMPUTING, 2021, 114
[39] ADSCNet: asymmetric depthwise separable convolution for semantic segmentation in real-time
Wang, Jiawei
Xiong, Hongyun
Wang, Haibo
Nian, Xiaohong
APPLIED INTELLIGENCE, 2020, 50 (04) : 1045 - 1056
[40] NDNet: Narrow While Deep Network for Real-Time Semantic Segmentation
Yang, Zhengeng
Yu, Hongshan
Fu, Qiang
Sun, Wei
Jia, Wenyan
Sun, Mingui
Mao, Zhi-Hong
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (09) : 5508 - 5519

← 1 2 3 4 5 →