Small Object Augmentation of Urban Scenes for Real-Time Semantic Segmentation

被引:54
|
作者
Yang, Zhengeng [1 ,2 ,3 ]
Yu, Hongshan [1 ,2 ]
Feng, Mingtao [1 ,2 ]
Sun, Wei [1 ,2 ]
Lin, Xuefei [4 ]
Sun, Mingui [3 ,5 ,6 ]
Mao, Zhi-Hong [5 ,6 ]
Mian, Ajmal [7 ]
机构
[1] Hunan Univ, Natl Engn Lab Robot Visual Percept & Control Tech, Coll Elect & Informat Engn, Changsha 410082, Hunan, Peoples R China
[2] Hunan Univ, Shenzhen Inst, Shenzhen 518057, Peoples R China
[3] Univ Pittsburgh, Dept Neurol Surg, Pittsburgh, PA 15260 USA
[4] Hunan Agr Univ, Dept Art, Changsha 410128, Peoples R China
[5] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15260 USA
[6] Univ Pittsburgh, Dept Bioengn, Pittsburgh, PA 15260 USA
[7] Univ Western Australia, Dept Comp Sci, Perth, WA 6009, Australia
基金
中国国家自然科学基金; 湖南省自然科学基金; 美国国家卫生研究院;
关键词
Semantic segmentation; scene understanding; autonomous driving; synthetic dataset; FEATURES; NETWORK;
D O I
10.1109/TIP.2020.2976856
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation is a key step in scene understanding for autonomous driving. Although deep learning has significantly improved the segmentation accuracy, current high-quality models such as PSPNet and DeepLabV3 are inefficient given their complex architectures and reliance on multi-scale inputs. Thus, it is difficult to apply them to real-time or practical applications. On the other hand, existing real-time methods cannot yet produce satisfactory results on small objects such as traffic lights, which are imperative to safe autonomous driving. In this paper, we improve the performance of real-time semantic segmentation from two perspectives, methodology and data. Specifically, we propose a real-time segmentation model coined Narrow Deep Network (NDNet) and build a synthetic dataset by inserting additional small objects into the training images. The proposed method achieves 65.7% mean intersection over union (mIoU) on the Cityscapes test set with only 8.4G floating-point operations (FLOPs) on $1024\times 2048$ inputs. Furthermore, by re-training the existing PSPNet and DeepLabV3 models on our synthetic dataset, we obtained an average 2% mIoU improvement on small objects.
引用
收藏
页码:5175 / 5190
页数:16
相关论文
共 50 条
  • [31] Block attention network: A lightweight deep network for real-time semantic segmentation of road scenes in resource-constrained devices
    Mazhar, Saquib
    Atif, Nadeem
    Bhuyan, M. K.
    Ahamed, Shaik Rafi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [32] Paddy field object detection for robotic combine based on real-time semantic segmentation algorithm
    Zhu, Jiajun
    Iida, Michihisa
    Chen, Sikai
    Cheng, Shijing
    Suguri, Masahiko
    Masuda, Ryohei
    JOURNAL OF FIELD ROBOTICS, 2024, 41 (02) : 273 - 287
  • [33] AUGMENTED-TRAINING-AWARE BISENET FOR REAL-TIME SEMANTIC SEGMENTATION
    Hsu, Chih-Chung
    Lee, Cheih
    Tai, Shen-Chieh
    Jiang, Yun-Zhong
    2022 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (IEEE ICMEW 2022), 2022,
  • [34] Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation
    Peng, Chengli
    Tian, Tian
    Chen, Chen
    Guo, Xiaojie
    Ma, Jiayi
    NEURAL NETWORKS, 2021, 137 : 188 - 199
  • [35] LightSeg: Local Spatial Perception Convolution for Real-Time Semantic Segmentation
    Lei, Xiaochun
    Liang, Jiaming
    Gong, Zhaoting
    Jiang, Zetao
    APPLIED SCIENCES-BASEL, 2023, 13 (14):
  • [36] MiniNet: An Efficient Semantic Segmentation ConvNet for Real-Time Robotic Applications
    Alonso, Inigo
    Riazuelo, Luis
    Murillo, Ana C.
    IEEE TRANSACTIONS ON ROBOTICS, 2020, 36 (04) : 1340 - 1347
  • [37] ERFNet: Efficient Residual Factorized ConvNet for Real-Time Semantic Segmentation
    Romera, Eduardo
    Alvarez, Jose M.
    Bergasa, Luis M.
    Arroyo, Roberto
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (01) : 263 - 272
  • [38] Real-time semantic segmentation with weighted factorized-depthwise convolution
    Hao, Xiaochen
    Hao, Xingjun
    Zhang, Yaru
    Li, Yuanyuan
    Wu, Chao
    IMAGE AND VISION COMPUTING, 2021, 114
  • [39] ADSCNet: asymmetric depthwise separable convolution for semantic segmentation in real-time
    Wang, Jiawei
    Xiong, Hongyun
    Wang, Haibo
    Nian, Xiaohong
    APPLIED INTELLIGENCE, 2020, 50 (04) : 1045 - 1056
  • [40] NDNet: Narrow While Deep Network for Real-Time Semantic Segmentation
    Yang, Zhengeng
    Yu, Hongshan
    Fu, Qiang
    Sun, Wei
    Jia, Wenyan
    Sun, Mingui
    Mao, Zhi-Hong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (09) : 5508 - 5519