Optimal learning for sequential sampling with non-parametric beliefs

被引:5
|
作者
Barut, Emre [1 ]
Powell, Warren B. [1 ]
机构
[1] Princeton Univ, Dept Operat Res & Financial Engn, Princeton, NJ 08544 USA
关键词
Bayesian global optimization; Knowledge gradient; Non-parametric estimation; GLOBAL OPTIMIZATION; KNOWLEDGE-GRADIENT; APPROXIMATION; AGGREGATION; SELECTION;
D O I
10.1007/s10898-013-0050-5
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We propose a sequential learning policy for ranking and selection problems, where we use a non-parametric procedure for estimating the value of a policy. Our estimation approach aggregates over a set of kernel functions in order to achieve a more consistent estimator. Each element in the kernel estimation set uses a different bandwidth to achieve better aggregation. The final estimate uses a weighting scheme with the inverse mean square errors of the kernel estimators as weights. This weighting scheme is shown to be optimal under independent kernel estimators. For choosing the measurement, we employ the knowledge gradient policy that relies on predictive distributions to calculate the optimal sampling point. Our method allows a setting where the beliefs are expected to be correlated but the correlation structure is unknown beforehand. Moreover, the proposed policy is shown to be asymptotically optimal.
引用
收藏
页码:517 / 543
页数:27
相关论文
共 50 条
  • [41] Adaptive Learning Control for Nonlinear Systems With Parametric and Non-Parametric Uncertainties
    Sun, Yunping
    Xu, Tianwei
    Xia, Youming
    Xiao, Fei
    PROCEEDINGS OF 2010 ASIA-PACIFIC YOUTH CONFERENCE ON COMMUNICATION, VOLS 1 AND 2, 2010, : 315 - 320
  • [42] Accelerated parallel non-conjugate sampling for Bayesian non-parametric models
    Zhang, Michael Minyi
    Williamson, Sinead A.
    Perez-Cruz, Fernando
    STATISTICS AND COMPUTING, 2022, 32 (03)
  • [43] Optimal Sufficient Statistics for Parametric and Non-Parametric Multiple Simultaneous Hypothesis Testing
    Oba, Shigeyuki
    Ishii, Shin
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2009, 5 (01):
  • [44] Accelerated parallel non-conjugate sampling for Bayesian non-parametric models
    Michael Minyi Zhang
    Sinead A. Williamson
    Fernando Pérez-Cruz
    Statistics and Computing, 2022, 32
  • [45] Effect of Sampling Rate on Parametric and Non-parametric Data Preprocessing for Gearbox Fault Diagnosis
    Kumar, Vikash
    Kumar, Sanjeev
    Sarangi, Somnath
    JOURNAL OF VIBRATION ENGINEERING & TECHNOLOGIES, 2024, 12 (02) : 1195 - 1202
  • [46] Iterative learning control for systems with both parametric and non-parametric uncertainties
    Er, MJ
    Xu, J
    2004 8TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1-3, 2004, : 30 - 35
  • [47] Effect of Sampling Rate on Parametric and Non-parametric Data Preprocessing for Gearbox Fault Diagnosis
    Vikash Kumar
    Sanjeev Kumar
    Somnath Sarangi
    Journal of Vibration Engineering & Technologies, 2024, 12 : 1195 - 1202
  • [48] Non-parametric learning of lifted Restricted Boltzmann Machines
    Kaur, Navdeep
    Kunapuli, Gautam
    Natarajan, Sriraam
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2020, 120 : 33 - 47
  • [49] Non-Parametric Kernel Learning with robust pairwise constraints
    Changyou Chen
    Junping Zhang
    Xuefang He
    Zhi-Hua Zhou
    International Journal of Machine Learning and Cybernetics, 2012, 3 : 83 - 96
  • [50] Learning Non-Parametric Surrogate Losses With Correlated Gradients
    Yoa, Seungdong
    Park, Jinyoung
    Kim, Hyunwoo J.
    IEEE ACCESS, 2021, 9 : 141199 - 141209