Analysis of Tikhonov regularization for function approximation by neural networks

被引:24
作者
Burger, M [1 ]
Neubauer, A [1 ]
机构
[1] Johannes Kepler Univ Linz, Inst Ind Math, A-4040 Linz, Austria
关键词
ill-posed problems; neural networks; Tikhonov regularization; output smoothing; weight decay; function approximation;
D O I
10.1016/S0893-6080(02)00167-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is devoted to the convergence and stability analysis of Tikhonov regularization for function approximation by a class of feedforward neural networks with one hidden layer and linear output layer. We investigate two frequently used approaches, namely regularization by output smoothing and regularization by weight decay, as well as a combination of both methods to combine their advantages. We show that in all cases stable approximations are obtained converging to the approximated function in a desired Sobolev space as the noise in the data tends to zero (in the weaker L-2-norm) if the regularization parameter and the number of units in the network are chosen appropriately. Under additional smoothness assumptions we are able to show convergence rates results in terms of the noise level and the number of units in the network. In addition, we show how the theoretical results can be applied to the important classes of perceptrons with one hidden layer and to translation networks. Finally, the performance of the different approaches is compared in some numerical examples. (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:79 / 90
页数:12
相关论文
共 24 条
[1]  
Adams R. A., 1975, SOBOLEV SPACES
[2]   UNIVERSAL APPROXIMATION BOUNDS FOR SUPERPOSITIONS OF A SIGMOIDAL FUNCTION [J].
BARRON, AR .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1993, 39 (03) :930-945
[3]  
Bishop C. M., 1995, NEURAL NETWORKS PATT
[4]   TRAINING WITH NOISE IS EQUIVALENT TO TIKHONOV REGULARIZATION [J].
BISHOP, CM .
NEURAL COMPUTATION, 1995, 7 (01) :108-116
[5]   Training neural networks with noisy data as an ill-posed problem [J].
Burger, M ;
Engl, HW .
ADVANCES IN COMPUTATIONAL MATHEMATICS, 2000, 13 (04) :335-354
[6]   Regularization methods for blind deconvolution and blind source separation problems [J].
Burger, M ;
Scherzer, O .
MATHEMATICS OF CONTROL SIGNALS AND SYSTEMS, 2001, 14 (04) :358-383
[7]   Error bounds for approximation with neural networks [J].
Burger, M ;
Neubauer, A .
JOURNAL OF APPROXIMATION THEORY, 2001, 112 (02) :235-250
[8]   APPROXIMATION BY RIDGE FUNCTIONS AND NEURAL NETWORKS WITH ONE HIDDEN LAYER [J].
CHUI, CK ;
LI, X .
JOURNAL OF APPROXIMATION THEORY, 1992, 70 (02) :131-141
[9]  
DAUTRAY R., 1990, MATH ANAL NUMERICAL, V3
[10]  
Engl H., 1996, Mathematics and Its Applications, V375, DOI DOI 10.1007/978-94-009-1740-8