Approaching Quartic Convergence Rates for Quasi-Stochastic Approximation with Application to Gradient-Free Optimization

被引：0

作者：

Lauand, Caio Kalil ^{[1
]}

Meyn, Sean ^{[1
]}

机构：

[1] Univ Florida, Gainesville, FL 32611 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年

基金：

美国国家科学基金会;

关键词：

RATIONAL LINEAR FORM; LOGARITHMS; ALGORITHMS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Stochastic approximation is a foundation for many algorithms found in machine learning and optimization. It is in general slow to converge: the mean square error vanishes as O(n(-1)). A deterministic counterpart known as quasi-stochastic approximation is a viable alternative in many applications, including gradient-free optimization and reinforcement learning. It was assumed in prior research that the optimal achievable convergence rate is O(n(-2)). It is shown in this paper that through design it is possible to obtain far faster convergence, of order O(n(-4+delta)), with delta > 0 arbitrary. Two techniques are introduced for the first time to achieve this rate of convergence. The theory is also specialized within the context of gradient-free optimization, and tested on standard benchmarks. The main results are based on a combination of novel application of results from number theory and techniques adapted from stochastic approximation theory.

引用

页数：14

共 50 条

[31]

Mou W, 2020, PR MACH LEARN RES, V125

[32]

Moulines E., 2011, P NEURIPS

[33]

Owen A. B., 2016, Monte Carlo and Quasi-Monte Carlo Methods

[34]

Pasupathy R., 2013, TUTORIALS OPERATIONS, V10, P122, DOI [10.1287/educ.2013.0118, DOI 10.1287/EDUC.2013.0118]

[35]

Polyak B. T., 1990, Problemy Peredachi Informatsii, V26, P45

[36]

POLYAK BT, 1990, AUTOMAT REM CONTR+, V51, P937

[37] ACCELERATION OF STOCHASTIC-APPROXIMATION BY AVERAGING [J].

POLYAK, BT ;

JUDITSKY, AB .

SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1992, 30 (04) :838-855

[38] A STOCHASTIC APPROXIMATION METHOD [J].

ROBBINS, H ;

MONRO, S .

ANNALS OF MATHEMATICAL STATISTICS, 1951, 22 (03) :400-407

[39]

Ruppert D., 1988, Efficient Estimations from a Slowly Convergent Robbins-Monro Process

[40]

Shalabh Bhatnagar, 2003, ACM Transactions on Modeling and Computer Simulation, V13, P180, DOI 10.1145/858481.858486

← 1 2 3 4 5 →