Approaching Quartic Convergence Rates for Quasi-Stochastic Approximation with Application to Gradient-Free Optimization

被引:0
作者
Lauand, Caio Kalil [1 ]
Meyn, Sean [1 ]
机构
[1] Univ Florida, Gainesville, FL 32611 USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年
基金
美国国家科学基金会;
关键词
RATIONAL LINEAR FORM; LOGARITHMS; ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stochastic approximation is a foundation for many algorithms found in machine learning and optimization. It is in general slow to converge: the mean square error vanishes as O(n(-1)). A deterministic counterpart known as quasi-stochastic approximation is a viable alternative in many applications, including gradient-free optimization and reinforcement learning. It was assumed in prior research that the optimal achievable convergence rate is O(n(-2)). It is shown in this paper that through design it is possible to obtain far faster convergence, of order O(n(-4+delta)), with delta > 0 arbitrary. Two techniques are introduced for the first time to achieve this rate of convergence. The theory is also specialized within the context of gradient-free optimization, and tested on standard benchmarks. The main results are based on a combination of novel application of results from number theory and techniques adapted from stochastic approximation theory.
引用
收藏
页数:14
相关论文
共 50 条
[31]  
Mou W, 2020, PR MACH LEARN RES, V125
[32]  
Moulines E., 2011, P NEURIPS
[33]  
Owen A. B., 2016, Monte Carlo and Quasi-Monte Carlo Methods
[34]  
Pasupathy R., 2013, TUTORIALS OPERATIONS, V10, P122, DOI [10.1287/educ.2013.0118, DOI 10.1287/EDUC.2013.0118]
[35]  
Polyak B. T., 1990, Problemy Peredachi Informatsii, V26, P45
[36]  
POLYAK BT, 1990, AUTOMAT REM CONTR+, V51, P937
[37]   ACCELERATION OF STOCHASTIC-APPROXIMATION BY AVERAGING [J].
POLYAK, BT ;
JUDITSKY, AB .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1992, 30 (04) :838-855
[38]   A STOCHASTIC APPROXIMATION METHOD [J].
ROBBINS, H ;
MONRO, S .
ANNALS OF MATHEMATICAL STATISTICS, 1951, 22 (03) :400-407
[39]  
Ruppert D., 1988, Efficient Estimations from a Slowly Convergent Robbins-Monro Process
[40]  
Shalabh Bhatnagar, 2003, ACM Transactions on Modeling and Computer Simulation, V13, P180, DOI 10.1145/858481.858486