Approaching Quartic Convergence Rates for Quasi-Stochastic Approximation with Application to Gradient-Free Optimization

被引:0
作者
Lauand, Caio Kalil [1 ]
Meyn, Sean [1 ]
机构
[1] Univ Florida, Gainesville, FL 32611 USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年
基金
美国国家科学基金会;
关键词
RATIONAL LINEAR FORM; LOGARITHMS; ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stochastic approximation is a foundation for many algorithms found in machine learning and optimization. It is in general slow to converge: the mean square error vanishes as O(n(-1)). A deterministic counterpart known as quasi-stochastic approximation is a viable alternative in many applications, including gradient-free optimization and reinforcement learning. It was assumed in prior research that the optimal achievable convergence rate is O(n(-2)). It is shown in this paper that through design it is possible to obtain far faster convergence, of order O(n(-4+delta)), with delta > 0 arbitrary. Two techniques are introduced for the first time to achieve this rate of convergence. The theory is also specialized within the context of gradient-free optimization, and tested on standard benchmarks. The main results are based on a combination of novel application of results from number theory and techniques adapted from stochastic approximation theory.
引用
收藏
页数:14
相关论文
共 50 条
[11]   Weighted means in stochastic approximation of minima [J].
Dippon, J ;
Renz, J .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1997, 35 (05) :1811-1827
[12]   Accelerated randomized stochastic optimization [J].
Dippon, J .
ANNALS OF STATISTICS, 2003, 31 (04) :1260-1281
[13]   ON CHOICE OF DESIGN IN STOCHASTIC APPROXIMATION METHODS [J].
FABIAN, V .
ANNALS OF MATHEMATICAL STATISTICS, 1968, 39 (02) :457-&
[14]   Adaptive and Robust Control in the USSR [J].
Fradkov, Alexander ;
Polyak, Boris T. .
IFAC PAPERSONLINE, 2020, 53 (02) :1373-1378
[15]  
Gosavi A., 2015, SIMULATION BASED OPT
[16]   Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning [J].
Karmakar, Prasenjit ;
Bhatnagar, Shalabh .
MATHEMATICS OF OPERATIONS RESEARCH, 2018, 43 (01) :130-151
[17]   STOCHASTIC ESTIMATION OF THE MAXIMUM OF A REGRESSION FUNCTION [J].
KIEFER, J ;
WOLFOWITZ, J .
ANNALS OF MATHEMATICAL STATISTICS, 1952, 23 (03) :462-466
[18]   Convergence rate of linear two-time-scale stochastic approximation [J].
Konda, VR ;
Tsitsiklis, JN .
ANNALS OF APPLIED PROBABILITY, 2004, 14 (02) :796-819
[19]  
Kovachki N, 2021, J MACH LEARN RES, V22, P1
[20]  
Lapeybe B., 1990, Statistics, V21, P251