The Chaotic Nature of Faster Gradient Descent Methods

被引:21
作者
van den Doel, Kees [1 ]
Ascher, Uri [1 ]
机构
[1] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1W5, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Gradient descent; Iterative methods; Chaos; Dynamical systems; STEEPEST DESCENT; BARZILAI; BEHAVIOR;
D O I
10.1007/s10915-011-9521-3
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The steepest descent method for large linear systems is well-known to often converge very slowly, with the number of iterations required being about the same as that obtained by utilizing a gradient descent method with the best constant step size and growing proportionally to the condition number. Faster gradient descent methods must occasionally resort to significantly larger step sizes, which in turn yields a rather non-monotone decrease pattern in the residual vector norm. We show that such faster gradient descent methods in fact generate chaotic dynamical systems for the normalized residual vectors. Very little is required to generate chaos here: simply damping steepest descent by a constant factor close to 1 will do. Several variants of the family of faster gradient descent methods are investigated, both experimentally and analytically. The fastest practical methods of this family in general appear to be the known, chaotic, two-step ones. Our results also highlight the need of better theory for existing faster gradient descent methods.
引用
收藏
页码:560 / 581
页数:22
相关论文
共 30 条
[2]  
[Anonymous], 1996, Dynamical systems and numerical analysis
[3]  
[Anonymous], 1999, SPRINGER SCI
[4]  
[Anonymous], 2002, COMPUTATIONAL METHOD
[5]  
[Anonymous], 1982, Stability of Motion
[6]   Artificial time integration [J].
Ascher, U. M. ;
Huang, H. ;
van den Doel, K. .
BIT NUMERICAL MATHEMATICS, 2007, 47 (01) :3-25
[7]   GRADIENT DESCENT AND FAST ARTIFICIAL TIME INTEGRATION [J].
Ascher, Uri M. ;
van den Doel, Kees ;
Huang, Hui ;
Svaiter, Benar F. .
ESAIM-MATHEMATICAL MODELLING AND NUMERICAL ANALYSIS-MODELISATION MATHEMATIQUE ET ANALYSE NUMERIQUE, 2009, 43 (04) :689-708
[8]   2-POINT STEP SIZE GRADIENT METHODS [J].
BARZILAI, J ;
BORWEIN, JM .
IMA JOURNAL OF NUMERICAL ANALYSIS, 1988, 8 (01) :141-148
[9]  
Bhaya A., 2009, 48 IEEE C DEC CONTR, P2347
[10]   On the asymptotic behaviour of some new gradient methods [J].
Dai, YH ;
Fletcher, R .
MATHEMATICAL PROGRAMMING, 2005, 103 (03) :541-559