A novel weight initialization with adaptive hyper-parameters for deep semantic segmentation

被引:0
|
作者
Nuhman Ul Haq
Ahmad Khan
Zia ur Rehman
Ahmad Din
Ling Shao
Sajid Shah
机构
[1] Abbottabad Campus University Road Tobe Camp,COMSATS University Islamabad (CUI)
[2] Inception Institute of Artificial Intelligence,undefined
来源
Multimedia Tools and Applications | 2021年 / 80卷
关键词
Semantic segmentation; Deep learning; Initialization; Adaptive layer learning rate;
D O I
暂无
中图分类号
学科分类号
摘要
The semantic segmentation process divides an image into its constituent objects and background by assigning a corresponding class label to each pixel in the image. Semantic segmentation is an important area in computer vision with wide practical applications. The contemporary semantic segmentation approaches are primarily based on two types of deep neural networks architectures i.e., symmetric and asymmetric networks. Both types of networks consist of several layers of neurons which are arranged in two sections called encoder and decoder. The encoder section receives the input image and the decoder section outputs the segmented image. However, both sections in symmetric networks have the same number of layers and the number of neurons in an encoder layer is the same as that of the corresponding layer in the decoder section but asymmetric networks do not strictly follow such one-one correspondence between encoder and decoder layers. At the moment, SegNet and ESNet are the two leading state-of-the-art symmetric encoder-decoder deep neural network architectures. However, both architectures require extensive training for good generalization and need several hundred epochs for convergence. This paper aims to improve the convergence and enhance network generalization by introducing two novelties into the network training process. The first novelty is a weight initialization method and the second contribution is an adaptive mechanism for dynamic layer learning rate adjustment in training loop. The proposed initialization technique uses transfer learning to initialize the encoder section of the network, but for initialization of decoder section, the weights of the encoder section layers are copied to the corresponding layers of the decoder section. The second contribution of the paper is an adaptive layer learning rate method, wherein the learning rates of the encoder layers are updated based on a metric representing the difference between the probability distributions of the input images and encoder weights. Likewise, the learning rates of the decoder layers are updated based on the difference between the probability distributions of the output labels and decoder weights. Intensive empirical validation of the proposed approach shows significant improvement in terms of faster convergence and generalization.
引用
收藏
页码:21771 / 21787
页数:16
相关论文
共 43 条
  • [21] A novel W13 deep CNN structure for improved semantic segmentation of multiple objects in remote sensing imagery
    Khaled Mohammed Elgamily
    M. A. Mohamed
    Ahmed Mohamed Abou-Taleb
    Mohamed Maher Ata
    Neural Computing and Applications, 2025, 37 (7) : 5397 - 5427
  • [22] OffRoadSynth Open Dataset for Semantic Segmentation using Synthetic-Data-Based Weight Initialization for Autonomous UGV in Off-Road Environments
    Malek, Konrad
    Dybala, Jacek
    Kordecki, Andrzej
    Hondra, Piotr
    Kijania, Katarzyna
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2024, 110 (02)
  • [23] A novel approach to use semantic segmentation based deep learning networks to classify multi-temporal SAR data
    Mehra, Aryan
    Jain, Nihal
    Srivastava, Hari Shanker
    GEOCARTO INTERNATIONAL, 2022, 37 (01) : 163 - 178
  • [24] OffRoadSynth Open Dataset for Semantic Segmentation using Synthetic-Data-Based Weight Initialization for Autonomous UGV in Off-Road Environments
    Konrad Małek
    Jacek Dybała
    Andrzej Kordecki
    Piotr Hondra
    Katarzyna Kijania
    Journal of Intelligent & Robotic Systems, 110 (2)
  • [25] SegFast-V2: Semantic image segmentation with less parameters in deep learning for autonomous driving
    Ghosh, Swarnendu
    Pal, Anisha
    Jaiswal, Shourya
    Santosh, K. C.
    Das, Nibaran
    Nasipuri, Mita
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (11) : 3145 - 3154
  • [26] SegFast-V2: Semantic image segmentation with less parameters in deep learning for autonomous driving
    Swarnendu Ghosh
    Anisha Pal
    Shourya Jaiswal
    K. C. Santosh
    Nibaran Das
    Mita Nasipuri
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 3145 - 3154
  • [27] Handling Open-Set Noise and Novel Target Recognition in Domain Adaptive Semantic Segmentation
    Guo, Xiaoqing
    Liu, Jie
    Liu, Tongliang
    Yuan, Yixuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9846 - 9861
  • [28] MFVNet: a deep adaptive fusion network with multiple field-of-views for remote sensing image semantic segmentation
    Yansheng Li
    Wei Chen
    Xin Huang
    Zhi Gao
    Siwei Li
    Tao He
    Yongjun Zhang
    Science China Information Sciences, 2023, 66
  • [29] MFVNet: a deep adaptive fusion network with multiple field-of-views for remote sensing image semantic segmentation
    Li, Yansheng
    Chen, Wei
    Huang, Xin
    Gao, Zhi
    Li, Siwei
    He, Tao
    Zhang, Yongjun
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (04)
  • [30] Semantic Segmentation Network Based on Adaptive Attention and Deep Fusion Utilizing a Multi-Scale Dilated Convolutional Pyramid
    Zhao, Shan
    Wang, Zihao
    Huo, Zhanqiang
    Zhang, Fukai
    SENSORS, 2024, 24 (16)