Small Object Augmentation of Urban Scenes for Real-Time Semantic Segmentation

被引：54

作者：

Yang, Zhengeng ^{[1
,2
,3
]}

Yu, Hongshan ^{[1
,2
]}

Feng, Mingtao ^{[1
,2
]}

Sun, Wei ^{[1
,2
]}

Lin, Xuefei ^{[4
]}

Sun, Mingui ^{[3
,5
,6
]}

Mao, Zhi-Hong ^{[5
,6
]}

Mian, Ajmal ^{[7
]}

机构：

[1] Hunan Univ, Natl Engn Lab Robot Visual Percept & Control Tech, Coll Elect & Informat Engn, Changsha 410082, Hunan, Peoples R China

[2] Hunan Univ, Shenzhen Inst, Shenzhen 518057, Peoples R China

[3] Univ Pittsburgh, Dept Neurol Surg, Pittsburgh, PA 15260 USA

[4] Hunan Agr Univ, Dept Art, Changsha 410128, Peoples R China

[5] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15260 USA

[6] Univ Pittsburgh, Dept Bioengn, Pittsburgh, PA 15260 USA

[7] Univ Western Australia, Dept Comp Sci, Perth, WA 6009, Australia

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2020年 / 29卷

基金：

中国国家自然科学基金; 湖南省自然科学基金; 美国国家卫生研究院;

关键词：

Semantic segmentation; scene understanding; autonomous driving; synthetic dataset; FEATURES; NETWORK;

D O I：

10.1109/TIP.2020.2976856

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semantic segmentation is a key step in scene understanding for autonomous driving. Although deep learning has significantly improved the segmentation accuracy, current high-quality models such as PSPNet and DeepLabV3 are inefficient given their complex architectures and reliance on multi-scale inputs. Thus, it is difficult to apply them to real-time or practical applications. On the other hand, existing real-time methods cannot yet produce satisfactory results on small objects such as traffic lights, which are imperative to safe autonomous driving. In this paper, we improve the performance of real-time semantic segmentation from two perspectives, methodology and data. Specifically, we propose a real-time segmentation model coined Narrow Deep Network (NDNet) and build a synthetic dataset by inserting additional small objects into the training images. The proposed method achieves 65.7% mean intersection over union (mIoU) on the Cityscapes test set with only 8.4G floating-point operations (FLOPs) on $1024\times 2048$ inputs. Furthermore, by re-training the existing PSPNet and DeepLabV3 models on our synthetic dataset, we obtained an average 2% mIoU improvement on small objects.

引用

页码：5175 / 5190

页数：16

共 50 条

[1] Real-time Object Detection and Semantic Segmentation for Autonomous Driving
Li, Baojun
Liu, Shun
Xu, Weichao
Qiu, Wei
MIPPR 2017: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2018, 10608
[2] RTSNet: Real-Time Semantic Segmentation Network For Outdoor Scenes
Ma, Mingyu
Zou, Fengshan
Xu, Fang
Song, Jilai
2019 9TH IEEE ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2019), 2019, : 659 - 664
[3] Small Target Augmentation for Urban Remote Sensing Image Real-Time Segmentation
Ren, Shasha
Liu, Qiong
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (02) : 2076 - 2088
[4] Research on Efficient Asymmetric Attention Module for Real-Time Semantic Segmentation Networks in Urban Scenes
Su, Xu
Li, Lihong
Xiao, Jiejie
Wang, Pengtao
JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2024, 28 (03) : 562 - 572
[5] Exploring Scale-Aware Features for Real-Time Semantic Segmentation of Street Scenes
Li, Kaige
Geng, Qichuan
Zhou, Zhong
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (05) : 3575 - 3587
[6] Real-time Semantic Segmentation with Parallel Multiple Views Feature Augmentation
Qiao, Jian-Jun
Cheng, Zhi-Qi
Wu, Xiao
Li, Wei
Zhang, Ji
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 6300 - 6308
[7] Rethinking DABNet: Light-Weight Network for Real-Time Semantic Segmentation of Road Scenes
Mazhar S.
Atif N.
Bhuyan M.K.
Ahamed S.R.
IEEE Transactions on Artificial Intelligence, 2024, 5 (06): : 3098 - 3108
[8] Deep Dual-Resolution Networks for Real-Time and Accurate Semantic Segmentation of Traffic Scenes
Pan, Huihui
Hong, Yuanduo
Sun, Weichao
Jia, Yisong
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (03) : 3448 - 3460
[9] Real-Time Semantic Segmentation Algorithm for Street Scenes Based on Attention Mechanism and Feature Fusion
Wu, Bao
Xiong, Xingzhong
Wang, Yong
ELECTRONICS, 2024, 13 (18)
[10] ESNET: EDGE-BASED SEGMENTATION NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION IN TRAFFIC SCENES
Lyu, Haoran
Fu, Huiyuan
Hu, Xiaojun
Liu, Liang
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1855 - 1859

← 1 2 3 4 5 →