On Regression-Based Stopping Times

被引:0
|
作者
Benjamin Van Roy
机构
[1] Stanford University,
来源
Discrete Event Dynamic Systems | 2010年 / 20卷
关键词
Approximate dynamic programming; Optimal stopping; Optimal control;
D O I
暂无
中图分类号
学科分类号
摘要
We study approaches that fit a linear combination of basis functions to the continuation value function of an optimal stopping problem and then employ a greedy policy based on the resulting approximation. We argue that computing weights to maximize expected payoff of the greedy policy or to minimize expected squared-error with respect to an invariant measure is intractable. On the other hand, certain versions of approximate value iteration lead to policies competitive with those that would result from optimizing the latter objective.
引用
收藏
页码:307 / 324
页数:17
相关论文
共 50 条
  • [1] On Regression-Based Stopping Times
    Van Roy, Benjamin
    DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2010, 20 (03): : 307 - 324
  • [2] Regression-Based Determination of Execution Times in Human-Robot Interaction
    Jansing S.
    Jung L.
    Hoffmann F.
    Deuse J.
    ZWF Zeitschrift fuer Wirtschaftlichen Fabrikbetrieb, 2024, 119 (06): : 440 - 444
  • [3] Differences Scores: Regression-Based Reliable Difference and the Regression-Based Confidence Interval
    Charter, Richard A.
    JOURNAL OF CLINICAL PSYCHOLOGY, 2009, 65 (04) : 456 - 460
  • [4] Regression-based Metric Learning
    Huang, Zheng-Yi
    Mu, Qing-Qing
    Hu, Mian
    Zou, Ying
    Xiao, Jiang-Wen
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9107 - 9112
  • [6] Regression-based Metric Learning
    Moutafis, Panagiotis
    Leng, Mengjun
    Kakadiaris, Ioannis A.
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2700 - 2705
  • [7] A Regression-Based Approach to Scalability Prediction
    Barnes, Bradley J.
    Rountree, Barry
    Lowenthal, David K.
    Reeves, Jaxk
    de Supinski, Bronis
    Schulz, Martin
    ICS'08: PROCEEDINGS OF THE 2008 ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, 2008, : 368 - +
  • [8] A REGRESSION-BASED LINEAR CLASSIFICATION PROCEDURE
    LAMOTTE, LR
    MCWHORTER, A
    EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1981, 41 (02) : 341 - 347
  • [9] Regression-based Monte Carlo Integration
    Salaun, Corentin
    Gruson, Adrien
    Hua, Binh-Son
    Hachisuka, Toshiya
    Singh, Gurprit
    ACM TRANSACTIONS ON GRAPHICS, 2022, 41 (04):
  • [10] Regression-based tests of predictive ability
    West, KD
    McCracken, MW
    INTERNATIONAL ECONOMIC REVIEW, 1998, 39 (04) : 817 - 840