Efficient sampling in approximate dynamic programming algorithms

被引:21
|
作者
Cervellera, Cristiano [1 ]
Muselli, Marco [1 ]
机构
[1] Ist Studi Sistemi Intelligenti Lautomaz, Consiglio Nazl Ric, I-16149 Genoa, Italy
关键词
stochastic optimal control problem; dynamic programming; sample complexity; deterministic learning; low-discrepancy sequences;
D O I
10.1007/s10589-007-9054-8
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Dynamic Programming (DP) is known to be a standard optimization tool for solving Stochastic Optimal Control (SOC) problems, either over a finite or an infinite horizon of stages. Under very general assumptions, commonly employed numerical algorithms are based on approximations of the cost-to-go functions, by means of suitable parametric models built from a set of sampling points in the d-dimensional state space. Here the problem of sample complexity, i.e., how "fast" the number of points must grow with the input dimension in order to have an accurate estimate of the cost-to-go functions in typical DP approaches such as value iteration and policy iteration, is discussed. It is shown that a choice of the sampling based on low-discrepancy sequences, commonly used for efficient numerical integration, permits to achieve, under suitable hypotheses, an almost linear sample complexity, thus contributing to mitigate the curse of dimensionality of the approximate DP procedure.
引用
收藏
页码:417 / 443
页数:27
相关论文
共 50 条
  • [41] Approximate Dynamic Programming Captures Fleet Operations for Schneider National
    Simao, Hugo P.
    George, Abraham
    Powell, Warren B.
    Gifford, Ted
    Nienow, John
    Day, Jeff
    INTERFACES, 2010, 40 (05) : 342 - 352
  • [42] A Dynamic Programming Approach for Approximate Optimal Control for Cancer Therapy
    A. Nowakowski
    A. Popa
    Journal of Optimization Theory and Applications, 2013, 156 : 365 - 379
  • [43] An approximate dynamic programming approach to attended home delivery management
    Yang, Xinan
    Strauss, Arne K.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2017, 263 (03) : 935 - 945
  • [44] An approximate dynamic programming approach to tackling mass evacuation operations
    Rempel, Mark
    Shiell, Nicholi
    Tessier, Kaeden
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [45] Approximate Dynamic Programming for Traffic Signal Control at Isolated Intersection
    Yin, Biao
    Dridi, Mahjoub
    El Moudni, Abdellah
    MODERN TRENDS AND TECHNIQUES IN COMPUTER SCIENCE (CSOC 2014), 2014, 285 : 369 - 381
  • [46] A Dynamic Programming Approach for Approximate Optimal Control for Cancer Therapy
    Nowakowski, A.
    Popa, A.
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2013, 156 (02) : 365 - 379
  • [47] Visualization Techniques for the Design and Analysis of Dynamic Programming Algorithms
    Zhu, Ying
    2024 28TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION, IV 2024, 2024, : 20 - 25
  • [48] EXPERIMENTS WITH DYNAMIC-PROGRAMMING ALGORITHMS FOR NONSEPARABLE PROBLEMS
    DOMINGO, A
    SNIEDOVICH, M
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1993, 67 (02) : 172 - 187
  • [49] Dynamic Programming algorithms and their applications in machine scheduling: A review
    Goncalves de Souza, Edson Antonio
    Nagano, Marcelo Seido
    Rolim, Gustavo Alencar
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 190
  • [50] Enhanced dynamic programming algorithms for series line optimization
    Veatch, MH
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2006, 51 (01) : 159 - 164