Small Object Augmentation of Urban Scenes for Real-Time Semantic Segmentation

被引:54
|
作者
Yang, Zhengeng [1 ,2 ,3 ]
Yu, Hongshan [1 ,2 ]
Feng, Mingtao [1 ,2 ]
Sun, Wei [1 ,2 ]
Lin, Xuefei [4 ]
Sun, Mingui [3 ,5 ,6 ]
Mao, Zhi-Hong [5 ,6 ]
Mian, Ajmal [7 ]
机构
[1] Hunan Univ, Natl Engn Lab Robot Visual Percept & Control Tech, Coll Elect & Informat Engn, Changsha 410082, Hunan, Peoples R China
[2] Hunan Univ, Shenzhen Inst, Shenzhen 518057, Peoples R China
[3] Univ Pittsburgh, Dept Neurol Surg, Pittsburgh, PA 15260 USA
[4] Hunan Agr Univ, Dept Art, Changsha 410128, Peoples R China
[5] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15260 USA
[6] Univ Pittsburgh, Dept Bioengn, Pittsburgh, PA 15260 USA
[7] Univ Western Australia, Dept Comp Sci, Perth, WA 6009, Australia
基金
中国国家自然科学基金; 湖南省自然科学基金; 美国国家卫生研究院;
关键词
Semantic segmentation; scene understanding; autonomous driving; synthetic dataset; FEATURES; NETWORK;
D O I
10.1109/TIP.2020.2976856
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation is a key step in scene understanding for autonomous driving. Although deep learning has significantly improved the segmentation accuracy, current high-quality models such as PSPNet and DeepLabV3 are inefficient given their complex architectures and reliance on multi-scale inputs. Thus, it is difficult to apply them to real-time or practical applications. On the other hand, existing real-time methods cannot yet produce satisfactory results on small objects such as traffic lights, which are imperative to safe autonomous driving. In this paper, we improve the performance of real-time semantic segmentation from two perspectives, methodology and data. Specifically, we propose a real-time segmentation model coined Narrow Deep Network (NDNet) and build a synthetic dataset by inserting additional small objects into the training images. The proposed method achieves 65.7% mean intersection over union (mIoU) on the Cityscapes test set with only 8.4G floating-point operations (FLOPs) on $1024\times 2048$ inputs. Furthermore, by re-training the existing PSPNet and DeepLabV3 models on our synthetic dataset, we obtained an average 2% mIoU improvement on small objects.
引用
收藏
页码:5175 / 5190
页数:16
相关论文
共 50 条
  • [21] Real-Time Driving Scene Semantic Segmentation
    Wang, Wenfu
    Fu, Yongjian
    Pan, Zhijie
    Li, Xi
    Zhuang, Yueting
    IEEE ACCESS, 2020, 8 : 36776 - 36788
  • [22] Background Subtraction With Real-Time Semantic Segmentation
    Zeng, Dongdong
    Chen, Xiang
    Zhu, Ming
    Goesele, Michael
    Kuijper, Arjan
    IEEE ACCESS, 2019, 7 : 153869 - 153884
  • [23] Real-time semantic segmentation with local spatial pixel adjustment
    Xiao, Cunjun
    Hao, Xingjun
    Li, Haibin
    Li, Yaqian
    Zhang, Wenming
    IMAGE AND VISION COMPUTING, 2022, 123
  • [24] Context and Spatial Feature Calibration for Real-Time Semantic Segmentation
    Li, Kaige
    Geng, Qichuan
    Wan, Maoxian
    Cao, Xiaochun
    Zhou, Zhong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5465 - 5477
  • [25] Real-time semantic segmentation via sequential knowledge distillation
    Wu, Jipeng
    Ji, Rongrong
    Liu, Jianzhuang
    Xu, Mingliang
    Zheng, Jiawen
    Shao, Ling
    Tian, Qi
    NEUROCOMPUTING, 2021, 439 : 134 - 145
  • [26] Real-time Object Detection and Semantic Segmentation Hardware System with Deep Learning Networks
    Fang, Shaoxia
    Tian, Lu
    Wang, Junbin
    Liang, Shuang
    Xie, Dongliang
    Chen, Zhongmin
    Sui, Lingzhi
    Yu, Qian
    Sun, Xiaoming
    Shan, Yi
    Wang, Yu
    2018 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT 2018), 2018, : 392 - 395
  • [27] UDS-SLAM: real-time robust visual SLAM based on semantic segmentation in dynamic scenes
    Liu, Jun
    Dong, Junyuan
    Hu, Mingming
    Lu, Xu
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2024, 51 (02): : 206 - 218
  • [28] FuseSeg: Semantic Segmentation of Urban Scenes Based on RGB and Thermal Data Fusion
    Sun, Yuxiang
    Zuo, Weixun
    Yun, Peng
    Wang, Hengli
    Liu, Ming
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 18 (03) : 1000 - 1011
  • [29] Real-time semantic segmentation via mutual optimization of spatial details and semantic information
    Ma, Mengyuan
    Huang, Huiling
    Han, Jun
    Feng, Yanbing
    Yang, Yi
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (03) : 6821 - 6834
  • [30] Semantic Segmentation of a Point Clouds of an Urban Scenes
    Dashkevich, Andrey
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT SYSTEMS (COLINS-2019), VOL I: MAIN CONFERENCE, 2019, 2362 : 208 - 217