Semantic scene segmentation in unstructured environment with modified DeepLabV3+

被引:75
作者
Baheti, Bhakti [1 ]
Innani, Shubham [1 ]
Gajre, Suhas [1 ]
Talbar, Sanjay [1 ]
机构
[1] SGGSIE&T, Ctr Excellence Signal & Image Proc, Nanded 431606, Maharashtra, India
关键词
Semantic Segmentation; Convolutional Neural Network(CNN); Xception; MobileNetV2; EFFICIENT;
D O I
10.1016/j.patrec.2020.07.029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic scene segmentation has become a key application in computer vision and is an essential part of intelligent transportation systems for complete scene understanding of the surrounding environment. While several methods based on deep fully Convolutional Neural Network (CNN) have been emerging, there are two main challenges: (i) They mainly focus on improvement of the accuracy than efficiency. (ii) They assume structured driving environment like in USA and Europe. While most of the current works focus on the well structured driving environment, we focus our research on India Driving Dataset (IDD) which contains data from unstructured traffic scenario. In this paper, we propose modifications in the DeepLabV3+ framework by using lower atrous rates in Atrous Spatial Pyramid Pooling (ASPP) module for dense traffic prediction. We propose to use dilated Xception network as the backbone for feature extraction. A lightweight segmentation framework is also presented by exploring the effectiveness of MobileNetV2 architecture, which achieves competitively high accuracy and is much smaller than other state-of-art architectures. The performance is evaluated in terms of mean Intersection over Union (mIoU) on 26 fine grained classes of IDD. Our proposed model with 24 M parameters achieves 68.41 mIoU on test set and efficient mobile model achieves mIoU of 61.6 by reducing the parameters to 2.2 M only. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:223 / 229
页数:7
相关论文
共 34 条
  • [1] Augmented Reality Meets Computer Vision: Efficient Data Generation for Urban Driving Scenes
    Abu Alhaija, Hassan
    Mustikovela, Siva Karthik
    Mescheder, Lars
    Geiger, Andreas
    Rother, Carsten
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2018, 126 (09) : 961 - 972
  • [2] [Anonymous], 2017, Computing Research Repository, DOI DOI 10.4271/2018-01-1635
  • [3] [Anonymous], 2009, P BMVC BRIT MACH VIS
  • [4] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [5] Towards Computationally Efficient and Realtime Distracted Driver Detection With MobileVGG Network
    Baheti, Bhakti
    Talbar, Sanjay
    Gajre, Suhas
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2020, 5 (04): : 565 - 574
  • [6] Eff-UNet: A Novel Architecture for Semantic Segmentation in Unstructured Environment
    Baheti, Bhakti
    Innani, Shubham
    Gajre, Suhas
    Talbar, Sanjay
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 1473 - 1481
  • [7] Baheti B, 2019, TENCON IEEE REGION, P790, DOI [10.1109/tencon.2019.8929376, 10.1109/TENCON.2019.8929376]
  • [8] Baheti B, 2016, 2016 CONFERENCE ON ADVANCES IN SIGNAL PROCESSING (CASP), P254
  • [9] A Novel Approach for Fully Automatic Intra-Tumor Segmentation With 3D U-Net Architecture for Gliomas
    Baid, Ujjwal
    Talbar, Sanjay
    Rane, Swapnil
    Gupta, Sudeep
    Thakur, Meenakshi H.
    Moiyadi, Aliasgar
    Sable, Nilesh
    Akolkar, Mayuresh
    Mahajan, Abhishek
    [J]. FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2020, 14
  • [10] Chen L.-C., 2015, ABS14127062 ICLR