Distributionally Constrained Black-Box Stochastic Gradient Estimation and Optimization

被引:0
作者
Lam, Henry [1 ]
Zhang, Junhui [2 ]
机构
[1] Columbia Univ, Dept Ind Engn & Operat Res, New York, NY 10027 USA
[2] Columbia Univ, Dept Appl Phys & Appl Math, New York, NY 10027 USA
基金
美国国家科学基金会;
关键词
zeroth-order gradient estimation; finite difference; simultaneous perturbation; distributionally robust optimization; stochastic approximation; ROBUST; SIMULATION; SENSITIVITY; RISK;
D O I
10.1287/opre.2021.0307
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We consider stochastic gradient estimation using only black-box function evaluations, where the function argument lies within a probability simplex. This problem is motivated from gradient-descent optimization procedures in multiple applications in distributionally robust analysis and inverse model calibration involving decision variables that are probability distributions. We are especially interested in obtaining gradient estimators where one or few sample observations or simulation runs apply simultaneously to all directions. Conventional zeroth-order gradient schemes such as simultaneous perturbation face challenges as the required moment conditions that allow the "canceling" of higher- order biases cannot be satisfied without violating the simplex constraints. We investigate a new set of required conditions on the random perturbation generator, which leads us to a class of implementable gradient estimators using Dirichlet mixtures. We study the statistical properties of these estimators and their utility in constrained stochastic approximation. We demonstrate the effectiveness of our procedures and compare with benchmarks via several numerical examples.
引用
收藏
页数:16
相关论文
共 53 条
[21]   Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations [J].
Esfahani, Peyman Mohajerin ;
Kuhn, Daniel .
MATHEMATICAL PROGRAMMING, 2018, 171 (1-2) :115-166
[22]  
Flaxman AD, 2005, PROCEEDINGS OF THE SIXTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P385
[23]  
Fox BL, 1989, Probability in the Engineering and Informational Sciences, P299
[24]  
Fu MC, 2006, HBK OPERAT RES MANAG, V13, P575, DOI 10.1016/S0927-0507(06)13019-4
[25]   STOCHASTIC FIRST- AND ZEROTH-ORDER METHODS FOR NONCONVEX STOCHASTIC PROGRAMMING [J].
Ghadimi, Saeed ;
Lan, Guanghui .
SIAM JOURNAL ON OPTIMIZATION, 2013, 23 (04) :2341-2368
[26]   Robust Analysis in Stochastic Simulation: Computation and Performance Guarantees [J].
Ghosh, Soumyadip ;
Lam, Henry .
OPERATIONS RESEARCH, 2019, 67 (01) :232-249
[27]  
Ghosh S, 2015, WINT SIMUL C PROC, P425, DOI 10.1109/WSC.2015.7408184
[28]   Robust risk measurement and model risk [J].
Glasserman, Paul ;
Xu, Xingbo .
QUANTITATIVE FINANCE, 2014, 14 (01) :29-58
[29]   LIKELIHOOD RATIO GRADIENT ESTIMATION FOR STOCHASTIC-SYSTEMS [J].
GLYNN, PW .
COMMUNICATIONS OF THE ACM, 1990, 33 (10) :75-84
[30]   Optimization-Based Calibration of Simulation Input Models [J].
Goeva, Aleksandrina ;
Lam, Henry ;
Qian, Huajie ;
Zhang, Bo .
OPERATIONS RESEARCH, 2019, 67 (05) :1362-1382