Adaptive System Optimization Using Random Directions Stochastic Approximation

被引：19

作者：

Prashanth, L. A. ^{[1
]}

Bhatnagar, Shalabh ^{[2
]}

Fu, Michael ^{[1
,3
,4
]}

Marcus, Steve ^{[1
,4
]}

机构：

[1] Univ Maryland, Syst Res Inst, College Pk, MD 20742 USA

[2] Indian Inst Sci, Dept Comp Sci & Automat, Bangalore, Karnataka, India

[3] Univ Maryland, Robert H Smith Sch Business, College Pk, MD 20742 USA

[4] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2017年 / 62卷 / 05期

基金：

美国国家科学基金会;

关键词：

Random directions stochastic approximation (RDSA); simultaneous perturbation stochastic approximation (SPSA); stochastic approximation; stochastic optimization; ALGORITHMS;

D O I：

10.1109/TAC.2016.2600643

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present new algorithms for simulation optimization using random directions stochastic approximation (RDSA). These include first-order (gradient) as well as second-order (Newton) schemes. We incorporate both continuous-valued as well as discrete-valued perturbations into both types of algorithms. The former are chosen to be independent and identically distributed (i.i.d.) symmetric uniformly distributed random variables (r.v.), while the latter are i.i.d. asymmetric Bernoulli r.v.s. Our Newton algorithm, with a novel Hessian estimation scheme, requires N-dimensional perturbations and three loss measurements per iteration, whereas the simultaneous perturbation Newton search algorithm of [1] requires 2N-dimensional perturbations and four loss measurements per iteration. We prove the asymptotic unbiasedness of both gradient and Hessian estimates and asymptotic (strong) convergence for both first-order and second-order schemes. We also provide asymptotic normality results, which in particular establish that the asymmetric Bernoulli variant of Newton RDSA method is better than 2SPSA of [1]. Numerical experiments are used to validate the theoretical results.

引用

页码：2223 / 2238

页数：16

共 40 条

[1]

[Anonymous], 1981, Practical optimization

[2]

[Anonymous], 1971, Optimizing Methods in Statistics

[3]

[Anonymous], 2009, Stochastic Approximation: A Dynamical Systems Viewpoint

[4]

Bhatnagar S., 2005, ACM Transactions on Modeling and Computer Simulation, V15, P74, DOI 10.1145/1044322.1044326

[5] Simultaneous Perturbation Newton Algorithms for Simulation Optimization [J].

Bhatnagar, Shalabh ;

Prashanth, L. A. .

JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2015, 164 (02) :621-643

[6] Adaptive Newton-based multivariate smoothed functional algorithms for simulation optimization [J].

Bhatnagar, Shalabh .

ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2008, 18 (01)

[7] Comparative study of stochastic algorithms for system optimization based on gradient approximations [J].

Chin, DC .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1997, 27 (02) :244-249

[8] Weighted means in stochastic approximation of minima [J].

Dippon, J ;

Renz, J .

SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1997, 35 (05) :1811-1827

[9]

Duchi J. C., 2013, ARXIV13122139

[10] ON ASYMPTOTIC NORMALITY IN STOCHASTIC APPROXIMATION [J].

FABIAN, V .

ANNALS OF MATHEMATICAL STATISTICS, 1968, 39 (04) :1327-&

← 1 2 3 4 →