Solving the linear interval tolerance problem for weight initialization of neural networks

被引:29
作者
Adam, S. P. [1 ,2 ]
Karras, D. A. [3 ]
Magoulas, G. D. [4 ]
Vrahatis, M. N. [1 ]
机构
[1] Univ Patras, Dept Math, Computat Intelligence Lab, GR-26110 Patras, Greece
[2] Technol Educ Inst Epirus, Dept Comp Engn, Arta 47100, Greece
[3] Technol Educ Inst Sterea Hellas, Dept Automat, Psahna 34400, Evia, Greece
[4] Univ London, Birkbeck Coll, Dept Comp Sci & Informat Syst, London WC1E 7HX, England
关键词
Neural networks; Weight initialization; Interval analysis; Linear interval tolerance problem; FEEDFORWARD NETWORKS; STATISTICAL TESTS; TRAINING SPEED; HIGH-DIMENSION; BACKPROPAGATION; ALGORITHM; INTELLIGENCE;
D O I
10.1016/j.neunet.2014.02.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining good initial conditions for an algorithm used to train a neural network is considered a parameter estimation problem dealing with uncertainty about the initial weights. Interval analysis approaches model uncertainty in parameter estimation problems using intervals and formulating tolerance problems. Solving a tolerance problem is defining lower and upper bounds of the intervals so that the system functionality is guaranteed within predefined limits. The aim of this paper is to show how the problem of determining the initial weight intervals of a neural network can be defined in terms of solving a linear interval tolerance problem. The proposed linear interval tolerance approach copes with uncertainty about the initial weights without any previous knowledge or specific assumptions on the input data as required by approaches such as fuzzy sets or rough sets. The proposed method is tested on a number of well known benchmarks for neural networks trained with the back-propagation family of algorithms. Its efficiency is evaluated with regards to standard performance measures and the results obtained are compared against results of a number of well known and established initialization methods. These results provide credible evidence that the proposed method outperforms classical weight initialization methods. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:17 / 37
页数:21
相关论文
共 50 条
  • [41] Modeling and Decoding Complex Problem Solving Process by Artificial Neural Networks
    Akan, Adil Kaan
    Kivilcim, Baran Baris
    Akbas, Emre
    Newman, Sharlene D.
    Vural, Fatos T. Yarman
    [J]. 2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [42] Analysis on the Weight initialization Problem in Fully-connected Multi-layer Perceptron Neural Network
    Li Wanchen
    [J]. 2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTER ENGINEERING (ICAICE 2020), 2020, : 150 - 153
  • [43] TOLERANCE ALLOCATION USING NEURAL NETWORKS
    KOPARDEKAR, P
    ANAND, S
    [J]. INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 1995, 10 (04) : 269 - 276
  • [44] State estimation for neural networks with jumping interval weight matrices and transmission delays
    Rao, Hong-Xia
    Lu, Renquan
    Xu, Yong
    Liu, Chang
    [J]. NEUROCOMPUTING, 2018, 275 : 909 - 915
  • [45] Problem-solving using complex networks
    de Arruda, Henrique F.
    Comin, Cesar H.
    Costa, Luciano da F.
    [J]. EUROPEAN PHYSICAL JOURNAL B, 2019, 92 (06)
  • [46] Solving linear and bilinear problems with interval uncertainty
    Latipova, A. T.
    [J]. INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING (ICIE-2015), 2015, 129 : 670 - 675
  • [47] Interval type-2 fuzzy weight adjustment for backpropagation neural networks with application in time series prediction
    Gaxiola, Fernando
    Melin, Patricia
    Valdez, Fevrier
    Castillo, Oscar
    [J]. INFORMATION SCIENCES, 2014, 260 : 1 - 14
  • [48] Solving the Inverse Potential Problem in the Parabolic Equation by the Deep Neural Networks Method
    Zhang, Mengmeng
    Zhang, Zhidong
    [J]. CSIAM TRANSACTIONS ON APPLIED MATHEMATICS, 2024, 5 (04): : 852 - 883
  • [49] HARDWARE DESCRIPTION OF DIGITAL HOPFIELD NEURAL NETWORKS FOR SOLVING SHORTEST PATH PROBLEM
    Asgari, Hajar
    Kavian, Yousef S.
    [J]. NEURAL NETWORK WORLD, 2014, 24 (02) : 211 - 230
  • [50] Solving the Shortest Path Routing Problem Using Noisy Hopfield Neural Networks
    Liu, Wen
    Wang, Lipo
    [J]. 2009 WRI INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND MOBILE COMPUTING: CMC 2009, VOL 2, 2009, : 299 - 302