Conservative set valued fields, automatic differentiation, stochastic gradient methods and deep learning

被引:58
作者
Bolte, Jerome [1 ]
Pauwels, Edouard [2 ,3 ]
机构
[1] Univ Toulouse 1 Capitole, Toulouse Sch Econ, Toulouse, France
[2] Univ Toulouse, CNRS, IRIT, Toulouse, France
[3] DEEL IRT St Exupery, Toulouse, France
关键词
Deep learning; Automatic differentiation; Backpropagation algorithm; Nonsmooth stochastic optimization; Definable sets; o-Minimal structures; Stochastic gradient; Clarke subdifferential; First order methods; SUBDIFFERENTIALS; APPROXIMATIONS;
D O I
10.1007/s10107-020-01501-5
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Modern problems in AI or in numerical analysis require nonsmooth approaches with a flexible calculus. We introduce generalized derivatives called conservative fields for which we develop a calculus and provide representation formulas. Functions having a conservative field are called path differentiable: convex, concave, Clarke regular and any semialgebraic Lipschitz continuous functions are path differentiable. Using Whitney stratification techniques for semialgebraic and definable sets, our model provides variational formulas for nonsmooth automatic differentiation oracles, as for instance the famous backpropagation algorithm in deep learning. Our differential model is applied to establish the convergence in values of nonsmooth stochastic gradient methods as they are implemented in practice.
引用
收藏
页码:19 / 51
页数:33
相关论文
共 57 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
ADIL S, 2018, THESIS
[3]  
ALIPRANTIS C., 1999, Infinite-dimensional analysis. A hitchhiker's guide, DOI 10.1007/3-540-29587-9
[4]  
[Anonymous], RAAG NOTES
[5]  
[Anonymous], 2010, Applied Shape Optimization for Fluids
[6]  
[Anonymous], 1963, THESIS
[7]   The heavy ball with friction method, I. The continuous dynamical system: Global exploration of the local minima of a real-valued function by asymptotic analysis of a dissipative dynamical system [J].
Attouch, H ;
Goudou, X ;
Redont, P .
COMMUNICATIONS IN CONTEMPORARY MATHEMATICS, 2000, 2 (01) :1-34
[8]  
Aubin Jean-Pierre., 1984, Set-valued maps and viability theory, V264, pxiii+342
[9]  
Aubin JP., 2009, SET-VALUED ANAL
[10]  
BARAKAT A, 2018, ARXIV181002263