Small Object Augmentation of Urban Scenes for Real-Time Semantic Segmentation

被引：54

作者：

Yang, Zhengeng ^{[1
,2
,3
]}

Yu, Hongshan ^{[1
,2
]}

Feng, Mingtao ^{[1
,2
]}

Sun, Wei ^{[1
,2
]}

Lin, Xuefei ^{[4
]}

Sun, Mingui ^{[3
,5
,6
]}

Mao, Zhi-Hong ^{[5
,6
]}

Mian, Ajmal ^{[7
]}

机构：

[1] Hunan Univ, Natl Engn Lab Robot Visual Percept & Control Tech, Coll Elect & Informat Engn, Changsha 410082, Hunan, Peoples R China

[2] Hunan Univ, Shenzhen Inst, Shenzhen 518057, Peoples R China

[3] Univ Pittsburgh, Dept Neurol Surg, Pittsburgh, PA 15260 USA

[4] Hunan Agr Univ, Dept Art, Changsha 410128, Peoples R China

[5] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15260 USA

[6] Univ Pittsburgh, Dept Bioengn, Pittsburgh, PA 15260 USA

[7] Univ Western Australia, Dept Comp Sci, Perth, WA 6009, Australia

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2020年 / 29卷

基金：

中国国家自然科学基金; 湖南省自然科学基金; 美国国家卫生研究院;

关键词：

Semantic segmentation; scene understanding; autonomous driving; synthetic dataset; FEATURES; NETWORK;

D O I：

10.1109/TIP.2020.2976856

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semantic segmentation is a key step in scene understanding for autonomous driving. Although deep learning has significantly improved the segmentation accuracy, current high-quality models such as PSPNet and DeepLabV3 are inefficient given their complex architectures and reliance on multi-scale inputs. Thus, it is difficult to apply them to real-time or practical applications. On the other hand, existing real-time methods cannot yet produce satisfactory results on small objects such as traffic lights, which are imperative to safe autonomous driving. In this paper, we improve the performance of real-time semantic segmentation from two perspectives, methodology and data. Specifically, we propose a real-time segmentation model coined Narrow Deep Network (NDNet) and build a synthetic dataset by inserting additional small objects into the training images. The proposed method achieves 65.7% mean intersection over union (mIoU) on the Cityscapes test set with only 8.4G floating-point operations (FLOPs) on $1024\times 2048$ inputs. Furthermore, by re-training the existing PSPNet and DeepLabV3 models on our synthetic dataset, we obtained an average 2% mIoU improvement on small objects.

引用

页码：5175 / 5190

页数：16

共 50 条

[41] Real-time semantic segmentation based on improved BiSeNet
Ren F.
Yang L.
Zhou H.
Zhang S.
He X.
Xu W.
Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2023, 31 (08): : 1217 - 1227
[42] A Real-Time Semantic Segmentation Method Based on Transformer for Autonomous Driving
Hao, Weiyu
Wang, Jingyi
Lu, Huimin
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (03): : 4419 - 4433
[43] Bilateral network with rich semantic extractor for real-time semantic segmentation
Shan Zhao
Xuan Wu
Kaiwen Tian
Yang Yuan
Complex & Intelligent Systems, 2024, 10 : 1899 - 1916
[44] Bilateral network with rich semantic extractor for real-time semantic segmentation
Zhao, Shan
Wu, Xuan
Tian, Kaiwen
Yuan, Yang
COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (02) : 1899 - 1916
[45] Feature extraction and enhancement for real-time semantic segmentation
Tan, Sixiang
Yang, Wenzhong
Lin, JianZhuang
Yu, Weijie
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (17)
[46] Dual Context Network for real-time semantic segmentation
Hong Yin
Wenbin Xie
Jingjing Zhang
Yuanfa Zhang
Weixing Zhu
Jie Gao
Yan Shao
Yajun Li
Machine Vision and Applications, 2023, 34
[47] Real-time Semantic Segmentation with Context Aggregation Network
Yang, Michael Ying
Kumaar, Saumya
Lyu, Ye
Nex, Francesco
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 178 : 124 - 134
[48] RTSEG: REAL-TIME SEMANTIC SEGMENTATION COMPARATIVE STUDY
Siam, Mennatullah
Gamal, Mostafa
Abdel-Razek, Moemen
Yogamani, Senthil
Jagersand, Martin
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 1603 - 1607
[49] Dual Context Network for real-time semantic segmentation
Yin, Hong
Xie, Wenbin
Zhang, Jingjing
Zhang, Yuanfa
Zhu, Weixing
Gao, Jie
Shao, Yan
Li, Yajun
MACHINE VISION AND APPLICATIONS, 2023, 34 (02)
[50] A REAL-TIME SEMANTIC SEGMENTATION ALGORITHM FOR AERIAL IMAGERY
Blokhinov, Y. B.
Gorbachev, V. A.
Rakutin, Y. O.
Nikitin, D. A.
COMPUTER OPTICS, 2018, 42 (01) : 141 - 148

← 1 2 3 4 5 →