Deterministic convergence of an online gradient method for neural networks

被引：33

作者：

Wu, W ^{[1
]}

Xu, YS

机构：

[1] Dalian Univ Technol, Dept Math, Dalian 116023, Peoples R China

[2] N Dakota State Univ, Dept Math, Fargo, ND 58105 USA

[3] Acad Sinica, Math Inst, Beijing 100080, Peoples R China

来源：

JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS | 2002年 / 144卷 / 1-2期

关键词：

online stochastic gradient method; nonlinear feedforward neural networks; deterministic convergence; monotonicity; constant learning rate;

D O I：

10.1016/S0377-0427(01)00571-4

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

The online gradient method has been widely used as a learning algorithm for neural networks. We establish a deterministic convergence of online gradient methods for the training of a class of nonlinear feedforward neural networks when the training examples are linearly independent. We choose the learning rate eta to be a constant during the training procedure. The monotonicity of the error function in the iteration is proved. A criterion for choosing the learning rate eta is also provided to guarantee the convergence. Under certain conditions similar to those for the classical gradient methods, an optimal convergence rate for our online gradient methods is proved. (C) 2001 Elsevier Science B.V. All rights reserved.

引用

页码：335 / 347

页数：13

共 18 条

[1] 1ST-ORDER AND 2ND-ORDER METHODS FOR LEARNING - BETWEEN STEEPEST DESCENT AND NEWTON METHOD [J].

BATTITI, R .

NEURAL COMPUTATION, 1992, 4 (02) :141-166

[2]

Billingsley P., 1985, PROBABILITY MEASURE

[3]

ELLACOTT SW, 1993, MATH APPROACHES NEUR, P103

[4] Parameter convergence and learning curves for neural networks [J].

Fine, TL ;

Mukherjee, S .

NEURAL COMPUTATION, 1999, 11 (03) :747-769

[5] DIFFUSION APPROXIMATIONS FOR THE CONSTANT LEARNING RATE BACKPROPAGATION ALGORITHM AND RESISTANCE TO LOCAL MINIMA [J].

FINNOFF, W .

NEURAL COMPUTATION, 1994, 6 (02) :285-295

[6]

GAIVORONSKI AA, 1994, OPTIMIZATION METHODS, V4, P117, DOI DOI 10.1080/10556789408805582

[7] Convergent on-line algorithms for supervised learning in neural networks [J].

Grippo, L .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (06) :1284-1299

[8]

HASSOUN M., 1995, FDN ARTIFICIAL NEURA

[9]

Haykin S., 1999, NEURAL NETWORK COMPR

[10] CONVERGENCE OF LEARNING ALGORITHMS WITH CONSTANT LEARNING RATES [J].

KUAN, CM ;

HORNIK, K .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 1991, 2 (05) :484-489

← 1 2 →