Regularising neural networks using flexible multivariate activation function

被引:23
作者
Solazzi, M
Uncini, A
机构
[1] Univ Roma La Sapienza, Dipartimento INFOCOM, I-00184 Rome, Italy
[2] Univ Ancona, Dipartimento Elettron & Automat, I-60131 Ancona, Italy
关键词
neural networks; spline neural networks; multilayer perceptron; generalised sigmoidal functions; adaptive activation functions; spline; regularisation; generalisation;
D O I
10.1016/S0893-6080(03)00189-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new general neural structure based on nonlinear flexible multivariate function that can be viewed in the framework of the generalised regularisation net-works theory. The proposed architecture is based on multi-dimensional adaptive cubic spline basis activation function that collects information from the previous network layer in aggregate form. In other words, each activation function represents a spline function of a subset of previous layer outputs so the number of network connections (structural complexity) can be very low with respect to the problem complexity. A specific learning algorithm, based on the adaptation of local parameters of the activation function, is derived. This fact improve the network generalisation capabilities and speed up the convergence of the learning process. At last, some experimental results demonstrating the effectiveness of the proposed architecture, are presented. (C) 2003 Elsevier Ltd. All rights reserved.
引用
收藏
页码:247 / 260
页数:14
相关论文
共 34 条
[1]   A UNIVERSAL THEOREM ON LEARNING-CURVES [J].
AMARI, SI .
NEURAL NETWORKS, 1993, 6 (02) :161-166
[2]  
[Anonymous], 1 IEEE INT C NEUR NE
[3]  
Catmull E., 1974, COMPUT AIDED GEOM D, V74, P317, DOI 10.1016/B978-0-12-079050-0.50020-5
[4]  
Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274
[5]   ON THE APPROXIMATE REALIZATION OF CONTINUOUS-MAPPINGS BY NEURAL NETWORKS [J].
FUNAHASHI, K .
NEURAL NETWORKS, 1989, 2 (03) :183-192
[6]   REGULARIZATION THEORY AND NEURAL NETWORKS ARCHITECTURES [J].
GIROSI, F ;
JONES, M ;
POGGIO, T .
NEURAL COMPUTATION, 1995, 7 (02) :219-269
[7]  
GIROSI F, 1993, AI MEMO, V1430
[8]   Multilayer feedforward networks with adaptive spline activation function [J].
Guarnieri, S ;
Piazza, F ;
Uncini, A .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (03) :672-683
[9]  
Haykin, 1996, ADAPTIVE FILTER THEO, V3rd
[10]  
Haykin S., 1999, Neural Networks: A Comprehensive Foundation, V2nd ed