Optimal learning for sequential sampling with non-parametric beliefs

被引：5

作者：

Barut, Emre ^{[1
]}

Powell, Warren B. ^{[1
]}

机构：

[1] Princeton Univ, Dept Operat Res & Financial Engn, Princeton, NJ 08544 USA

来源：

JOURNAL OF GLOBAL OPTIMIZATION | 2014年 / 58卷 / 03期

关键词：

Bayesian global optimization; Knowledge gradient; Non-parametric estimation; GLOBAL OPTIMIZATION; KNOWLEDGE-GRADIENT; APPROXIMATION; AGGREGATION; SELECTION;

D O I：

10.1007/s10898-013-0050-5

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

We propose a sequential learning policy for ranking and selection problems, where we use a non-parametric procedure for estimating the value of a policy. Our estimation approach aggregates over a set of kernel functions in order to achieve a more consistent estimator. Each element in the kernel estimation set uses a different bandwidth to achieve better aggregation. The final estimate uses a weighting scheme with the inverse mean square errors of the kernel estimators as weights. This weighting scheme is shown to be optimal under independent kernel estimators. For choosing the measurement, we employ the knowledge gradient policy that relies on predictive distributions to calculate the optimal sampling point. Our method allows a setting where the beliefs are expected to be correlated but the correlation structure is unknown beforehand. Moreover, the proposed policy is shown to be asymptotically optimal.

引用

页码：517 / 543

页数：27

共 50 条

[1] Optimal learning for sequential sampling with non-parametric beliefs [J].

Emre Barut ;

Warren B. Powell .

Journal of Global Optimization, 2014, 58 :517-543

[2] A Refined Non-parametric Algorithm for Sequential Software Reliability Estimation [J].

Mizoguchi, Shintaro ;

Dohi, Tadashi .

ADVANCES IN SOFTWARE ENGINEERING, PROCEEDINGS, 2009, 59 :330-337

[3] Non-parametric adaptive importance sampling for the probability estimation of a launcher impact position [J].

Morio, Jerome .

RELIABILITY ENGINEERING & SYSTEM SAFETY, 2011, 96 (01) :178-183

[4] OPTIMAL LEARNING WITH LOCAL NONLINEAR PARAMETRIC MODELS OVER CONTINUOUS DESIGNS [J].

He, Xinyu ;

Reyes, Kristofer G. ;

Powell, Warren B. .

SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2020, 42 (04) :A2134-A2157

[5] Optimal learning with a local parametric belief model [J].

Cheng, Bolong ;

Jamshidi, Arta ;

Powell, Warren B. .

JOURNAL OF GLOBAL OPTIMIZATION, 2015, 63 (02) :401-425

[6] A general non-parametric active learning framework for classification on multiple manifolds [J].

Huang, Lei ;

Ma, Yuqing ;

Liu, Xianglong .

PATTERN RECOGNITION LETTERS, 2020, 130 :250-258

[7] OPTIMAL LEARNING FOR NONLINEAR PARAMETRIC BELIEF MODELS OVER MULTIDIMENSIONAL CONTINUOUS SPACES [J].

He, Xinyu ;

Ho, Yangzhou ;

Powell, Warren B. .

SIAM JOURNAL ON OPTIMIZATION, 2018, 28 (04) :2945-2974

[8] Penalized Averaging of Parametric and Non-Parametric Quantile Forecasts [J].

De Gooijer, Jan G. ;

Zerom, Dawit .

JOURNAL OF TIME SERIES ECONOMETRICS, 2020, 12 (01)

[9] Non-parametric and semi-parametric asset pricing [J].

Erdos, Peter ;

Ormos, Mihaly ;

Zibriczky, David .

ECONOMIC MODELLING, 2011, 28 (03) :1150-1162

[10] Non-parametric bootstrap tests for parametric distribution families [J].

Szucs, Gabor .

ACTA SCIENTIARUM MATHEMATICARUM, 2011, 77 (3-4) :703-723

← 1 2 3 4 5 →