Semantic scene segmentation in unstructured environment with modified DeepLabV3+

被引:74
|
作者
Baheti, Bhakti [1 ]
Innani, Shubham [1 ]
Gajre, Suhas [1 ]
Talbar, Sanjay [1 ]
机构
[1] SGGSIE&T, Ctr Excellence Signal & Image Proc, Nanded 431606, Maharashtra, India
关键词
Semantic Segmentation; Convolutional Neural Network(CNN); Xception; MobileNetV2; EFFICIENT;
D O I
10.1016/j.patrec.2020.07.029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic scene segmentation has become a key application in computer vision and is an essential part of intelligent transportation systems for complete scene understanding of the surrounding environment. While several methods based on deep fully Convolutional Neural Network (CNN) have been emerging, there are two main challenges: (i) They mainly focus on improvement of the accuracy than efficiency. (ii) They assume structured driving environment like in USA and Europe. While most of the current works focus on the well structured driving environment, we focus our research on India Driving Dataset (IDD) which contains data from unstructured traffic scenario. In this paper, we propose modifications in the DeepLabV3+ framework by using lower atrous rates in Atrous Spatial Pyramid Pooling (ASPP) module for dense traffic prediction. We propose to use dilated Xception network as the backbone for feature extraction. A lightweight segmentation framework is also presented by exploring the effectiveness of MobileNetV2 architecture, which achieves competitively high accuracy and is much smaller than other state-of-art architectures. The performance is evaluated in terms of mean Intersection over Union (mIoU) on 26 fine grained classes of IDD. Our proposed model with 24 M parameters achieves 68.41 mIoU on test set and efficient mobile model achieves mIoU of 61.6 by reducing the parameters to 2.2 M only. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:223 / 229
页数:7
相关论文
共 50 条
  • [21] Segmentation of Brain Tumors Using DeepLabv3+
    Choudhury, Ahana Roy
    Vanguri, Rami
    Jambawalikar, Sachin R.
    Kumar, Piyush
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2018, PT II, 2019, 11384 : 154 - 167
  • [22] Semantic Segmentation of Forward-Looking Sonar Images Based on Improved Deeplabv3+
    Yin, Fei
    Nie, Weizhi
    Su, Yishan
    OCEANS 2024 - SINGAPORE, 2024,
  • [23] A lightweight semantic segmentation method for concrete bridge surface diseases based on improved DeeplabV3+
    Yu, Zhiyuan
    Dai, Chunquan
    Zeng, Xiaoming
    Lv, Yunlong
    Li, Haisheng
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [24] Diabetic fundus lesion segmentation by improved DeepLabv3+
    Ma X.
    Liu W.
    Li H.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2024, 52 (05): : 90 - 97
  • [25] Marine Target Segmentation Based on improved DeepLabv3+
    Fu, Huixuan
    Gu, Zhiqiang
    Wang, Bingyu
    Wang, Yuchao
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7314 - 7319
  • [26] A novel method for semantic segmentation of sewer defects based on StyleGAN3 and improved Deeplabv3+
    Li, Youlin
    Yang, Yang
    Liu, Yong
    Zhong, Fengcheng
    Zheng, Hongrui
    Wang, Shiji
    Wang, Zurui
    Huang, Zhangyang
    JOURNAL OF CIVIL STRUCTURAL HEALTH MONITORING, 2025,
  • [27] Semantic Segmentation of High-Resolution Airborne Images with Dual-Stream DeepLabV3+
    Akcay, Ozgun
    Kinaci, Ahmet Cumhur
    Avsar, Emin Ozgur
    Aydar, Umut
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2022, 11 (01)
  • [28] DCN-Deeplabv3+: A Novel Road Segmentation Algorithm Based on Improved Deeplabv3+
    Peng, Hongming
    Xiang, Siyu
    Chen, Mingju
    Li, Hongyang
    Su, Qin
    IEEE ACCESS, 2024, 12 : 87397 - 87406
  • [29] Multi-scale dense and attention mechanism for image semantic segmentation based on improved DeepLabv3+
    Wang, Zuoshuai
    Zhang, Hongyi
    Huang, Zhiquan
    Lin, Zhibin
    Wu, Hangxing
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (05)
  • [30] L-DeeplabV3+: a lightweight semantic segmentation algorithm for complex scene perception
    Fei, Zhengshun
    Xin, Kai
    Liu, Li
    Wang, Jinglong
    Chen, Tiandong
    Xiang, Xinjian
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)