A novel weight initialization with adaptive hyper-parameters for deep semantic segmentation

被引:0
|
作者
Nuhman Ul Haq
Ahmad Khan
Zia ur Rehman
Ahmad Din
Ling Shao
Sajid Shah
机构
[1] Abbottabad Campus University Road Tobe Camp,COMSATS University Islamabad (CUI)
[2] Inception Institute of Artificial Intelligence,undefined
来源
Multimedia Tools and Applications | 2021年 / 80卷
关键词
Semantic segmentation; Deep learning; Initialization; Adaptive layer learning rate;
D O I
暂无
中图分类号
学科分类号
摘要
The semantic segmentation process divides an image into its constituent objects and background by assigning a corresponding class label to each pixel in the image. Semantic segmentation is an important area in computer vision with wide practical applications. The contemporary semantic segmentation approaches are primarily based on two types of deep neural networks architectures i.e., symmetric and asymmetric networks. Both types of networks consist of several layers of neurons which are arranged in two sections called encoder and decoder. The encoder section receives the input image and the decoder section outputs the segmented image. However, both sections in symmetric networks have the same number of layers and the number of neurons in an encoder layer is the same as that of the corresponding layer in the decoder section but asymmetric networks do not strictly follow such one-one correspondence between encoder and decoder layers. At the moment, SegNet and ESNet are the two leading state-of-the-art symmetric encoder-decoder deep neural network architectures. However, both architectures require extensive training for good generalization and need several hundred epochs for convergence. This paper aims to improve the convergence and enhance network generalization by introducing two novelties into the network training process. The first novelty is a weight initialization method and the second contribution is an adaptive mechanism for dynamic layer learning rate adjustment in training loop. The proposed initialization technique uses transfer learning to initialize the encoder section of the network, but for initialization of decoder section, the weights of the encoder section layers are copied to the corresponding layers of the decoder section. The second contribution of the paper is an adaptive layer learning rate method, wherein the learning rates of the encoder layers are updated based on a metric representing the difference between the probability distributions of the input images and encoder weights. Likewise, the learning rates of the decoder layers are updated based on the difference between the probability distributions of the output labels and decoder weights. Intensive empirical validation of the proposed approach shows significant improvement in terms of faster convergence and generalization.
引用
收藏
页码:21771 / 21787
页数:16
相关论文
共 43 条
  • [1] A novel weight initialization with adaptive hyper-parameters for deep semantic segmentation
    Haq, Nuhman Ui
    Khan, Ahmad
    Rehman, Zia Ur
    Din, Ahmad
    Shao, Ling
    Shah, Sajid
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (14) : 21771 - 21787
  • [2] Constraints on Hyper-parameters in Deep Learning Convolutional Neural Networks
    Al-Saggaf, Ubaid M.
    Botalb, Abdelaziz
    Faisal, Muhammad
    Moinuddin, Muhammad
    Alsaggaf, Abdulrahman U.
    Alfakeh, Sulhi Ali
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (11) : 439 - 449
  • [3] Depth-Adaptive Deep Neural Network for Semantic Segmentation
    Kang, Byeongkeun
    Lee, Yeejin
    Nguyen, Truong Q.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (09) : 2478 - 2490
  • [4] Automatically Avoiding Overfitting in Deep Neural Networks by Using Hyper-Parameters Optimization Methods
    Kadhim, Zahraa Saddi
    Abdullah, Hasanen S.
    Ghathwan, Khalil I.
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (05) : 146 - 162
  • [5] Determination of the Live Weight of Farm Animals with Deep Learning and Semantic Segmentation Techniques
    Guvenoglu, Erdal
    APPLIED SCIENCES-BASEL, 2023, 13 (12):
  • [6] A deep learning-based and adaptive region proposal algorithm for semantic segmentation
    Taghizadeh, Maryam
    Chalechale, Abdolah
    APPLIED SOFT COMPUTING, 2024, 155
  • [7] Deep-learning: investigating deep neural networks hyper-parameters and comparison of performance to shallow methods for modeling bioactivity data
    Koutsoukas, Alexios
    Monaghan, Keith J.
    Li, Xiaoli
    Huan, Jun
    JOURNAL OF CHEMINFORMATICS, 2017, 9
  • [8] Deep-learning: investigating deep neural networks hyper-parameters and comparison of performance to shallow methods for modeling bioactivity data
    Alexios Koutsoukas
    Keith J. Monaghan
    Xiaoli Li
    Jun Huan
    Journal of Cheminformatics, 9
  • [9] Surface defect detection and semantic segmentation with a novel lightweight deep neural network
    Huang, Qiang
    Li, Fudong
    Yang, Yuequan
    Tao, Xian
    Li, Wei
    Wang, Xu
    Wang, Yong
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (08)
  • [10] Adaptive Update in Deep Learning Algorithms for LiDAR Data Semantic Segmentation
    Hamid, Nur
    Wibisono, Ari
    Gamal, Ahmad
    Ardhianto, Ronni
    Jatmiko, Wisnu
    2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, : 1038 - 1041