LEARNING ALGORITHMS FOR NEURAL NETWORKS BASED ON QUASI-NEWTON METHODS WITH SELF-SCALING

被引:16
作者
BEIGI, HSM [1 ]
LI, CJ [1 ]
机构
[1] COLUMBIA UNIV,DEPT MECH ENGN,NEW YORK,NY 10027
来源
JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME | 1993年 / 115卷 / 01期
关键词
Algorithms - Control systems - Convergence of numerical methods - Describing functions - Function evaluation - Learning systems - Optimization - Systems analysis;
D O I
10.1115/1.2897405
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Previous studies have suggested that, for moderate sized neural networks, the use of classical Quasi-Newton methods yields the best convergence properties among all the state-of-the-art [1]. This paper describes a set of even better learning algorithms based on a class of Quasi-Newton optimization techniques called Self-Scaling Variable Metric (SSVM) methods. One of the characteristics of SSVM methods is that they provide a set of search directions which are invariant under the scaling of the objective function. With an XOR benchmark and an encoder benchmark, simulations using the SS VM algorithms for the learning of general feedforward neural networks were carried out to study their performance. Compared to classical Quasi-Newton methods, it is shown that the SSVM method reduces the number of iterations required for convergence by 40 percent to 60 percent that of the classical Quasi-Newton methods which, in general, converge two to three orders of magnitude faster than the steepest descent techniques.
引用
收藏
页码:38 / 43
页数:6
相关论文
共 27 条
[1]  
[Anonymous], 1970, IMA J APPL MATH, DOI DOI 10.1093/IMAMAT/6.1.76
[2]  
BECKER S, 1988, 1988 P CONN MOD SUMM, P29
[3]  
BEIGI HSM, 1990, MAR ISMM INT S COMP
[4]  
Broyden C. G., 1970, Journal of the Institute of Mathematics and Its Applications, V6, P222
[5]  
FAHLMAN SE, 1988, 1988 P CONN MOD SUMM, P38
[6]   A NEW APPROACH TO VARIABLE METRIC ALGORITHMS [J].
FLETCHER, R .
COMPUTER JOURNAL, 1970, 13 (03) :317-&
[7]  
FLETCHER R, 1987, PRACTICAL METHODS OP, P54
[8]   A FAMILY OF VARIABLE-METRIC METHODS DERIVED BY VARIATIONAL MEANS [J].
GOLDFARB, D .
MATHEMATICS OF COMPUTATION, 1970, 24 (109) :23-&
[9]   MAXIMIZATION BY QUADRATIC HILL-CLIMBING [J].
GOLDFELD, SM ;
QUANDT, RE ;
TROTTER, HF .
ECONOMETRICA, 1966, 34 (03) :541-&
[10]   ON RELATIVE EFFICIENCIES OF GRADIENT METHODS [J].
GREENSTADT, J .
MATHEMATICS OF COMPUTATION, 1967, 21 (99) :360-+