Optimal learning for sequential sampling with non-parametric beliefs

被引:5
|
作者
Barut, Emre [1 ]
Powell, Warren B. [1 ]
机构
[1] Princeton Univ, Dept Operat Res & Financial Engn, Princeton, NJ 08544 USA
关键词
Bayesian global optimization; Knowledge gradient; Non-parametric estimation; GLOBAL OPTIMIZATION; KNOWLEDGE-GRADIENT; APPROXIMATION; AGGREGATION; SELECTION;
D O I
10.1007/s10898-013-0050-5
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We propose a sequential learning policy for ranking and selection problems, where we use a non-parametric procedure for estimating the value of a policy. Our estimation approach aggregates over a set of kernel functions in order to achieve a more consistent estimator. Each element in the kernel estimation set uses a different bandwidth to achieve better aggregation. The final estimate uses a weighting scheme with the inverse mean square errors of the kernel estimators as weights. This weighting scheme is shown to be optimal under independent kernel estimators. For choosing the measurement, we employ the knowledge gradient policy that relies on predictive distributions to calculate the optimal sampling point. Our method allows a setting where the beliefs are expected to be correlated but the correlation structure is unknown beforehand. Moreover, the proposed policy is shown to be asymptotically optimal.
引用
收藏
页码:517 / 543
页数:27
相关论文
共 50 条
  • [1] Optimal learning for sequential sampling with non-parametric beliefs
    Emre Barut
    Warren B. Powell
    Journal of Global Optimization, 2014, 58 : 517 - 543
  • [2] Optimal sequential design in a controlled non-parametric regression
    Efromovich, Sam
    SCANDINAVIAN JOURNAL OF STATISTICS, 2008, 35 (02) : 266 - 285
  • [3] Non-parametric manifold learning
    Asta, Dena Marie
    ELECTRONIC JOURNAL OF STATISTICS, 2024, 18 (02): : 3903 - 3930
  • [4] ON ASYMPTOTICALLY OPTIMAL NON-PARAMETRIC CRITERIA
    BOROKOV, AA
    SYCHEVA, NM
    THEORY OF PROBILITY AND ITS APPLICATIONS,USSR, 1968, 13 (03): : 359 - &
  • [5] Optimal Non-parametric Learning in Repeated Contextual Auctions with Strategic Buyer
    Drutsa, Alexey
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [6] Non-parametric estimation of sequential english auctions
    Brendstrup, Bjarne
    JOURNAL OF ECONOMETRICS, 2007, 141 (02) : 460 - 481
  • [7] Optimal Non-parametric Learning in Repeated Contextual Auctions with Strategic Buyer
    Drutsa, Alexey
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [8] Non-parametric selected ranked set sampling
    Hossain, SS
    BIOMETRICAL JOURNAL, 2001, 43 (01) : 97 - 105
  • [9] LEARNING NON-PARAMETRIC MODELS OF PRONUNCIATION
    Hutchinson, Brian
    Droppo, Jasha
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4904 - 4907
  • [10] Non-parametric Representation Learning with Kernels
    Esser, Pascal
    Fleissner, Maximilian
    Ghoshdastidar, Debarghya
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 11910 - 11918