Fitness Landscape Analysis of Weight-Elimination Neural Networks

被引:11
作者
Bosman, Anna [1 ]
Engelbrecht, Andries [1 ]
Helbig, Marde [1 ]
机构
[1] Univ Pretoria, Dept Comp Sci, Pretoria, South Africa
基金
新加坡国家研究基金会;
关键词
Neural networks; Fitness landscapes; Regularisation; Weight elimination; CONTINUOUS OPTIMIZATION PROBLEMS;
D O I
10.1007/s11063-017-9729-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural network architectures can be regularised by adding a penalty term to the objective function, thus minimising network complexity in addition to the error. However, adding a term to the objective function inevitably changes the surface of the objective function. This study investigates the landscape changes induced by the weight elimination penalty function under various parameter settings. Fitness landscape metrics are used to quantify and visualise the induced landscape changes, as well as to propose sensible ranges for the regularisation parameters. Fitness landscape metrics are shown to be a viable tool for neural network objective function landscape analysis and visualisation.
引用
收藏
页码:353 / 373
页数:21
相关论文
共 47 条
[21]   REGULARIZATION THEORY AND NEURAL NETWORKS ARCHITECTURES [J].
GIROSI, F ;
JONES, M ;
POGGIO, T .
NEURAL COMPUTATION, 1995, 7 (02) :219-269
[22]  
Glorot X., 2010, P 13 INT C ART INT S, P249
[23]   Semantic Learning Machine: A Feedforward Neural Network Construction Algorithm Inspired by Geometric Semantic Genetic Programming [J].
Goncalves, Ivo ;
Silva, Sara ;
Fonseca, Carlos M. .
PROGRESS IN ARTIFICIAL INTELLIGENCE-BK, 2015, 9273 :280-285
[24]   Weight decay backpropagation for noisy data [J].
Gupta, A ;
Lam, SM .
NEURAL NETWORKS, 1998, 11 (06) :1127-1137
[25]   XOR has no local minima: A case study in neural network error surface analysis [J].
Hamey, LGC .
NEURAL NETWORKS, 1998, 11 (04) :669-681
[26]  
HINTON GE, 1987, LECT NOTES COMPUT SC, V258, P1
[27]   ERROR SURFACES FOR MULTILAYER PERCEPTRONS [J].
HUSH, DR ;
HORNE, B ;
SALAS, JM .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1992, 22 (05) :1152-1161
[28]  
Jones Terry, 1995, THESIS
[29]  
Kordos M, 2004, CONTROL CYBERN, V33, P611
[30]   Efficient backprop [J].
LeCun, Y ;
Bottou, L ;
Orr, GB ;
Müller, KR .
NEURAL NETWORKS: TRICKS OF THE TRADE, 1998, 1524 :9-50