Efficient sampling in approximate dynamic programming algorithms

被引:21
|
作者
Cervellera, Cristiano [1 ]
Muselli, Marco [1 ]
机构
[1] Ist Studi Sistemi Intelligenti Lautomaz, Consiglio Nazl Ric, I-16149 Genoa, Italy
关键词
stochastic optimal control problem; dynamic programming; sample complexity; deterministic learning; low-discrepancy sequences;
D O I
10.1007/s10589-007-9054-8
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Dynamic Programming (DP) is known to be a standard optimization tool for solving Stochastic Optimal Control (SOC) problems, either over a finite or an infinite horizon of stages. Under very general assumptions, commonly employed numerical algorithms are based on approximations of the cost-to-go functions, by means of suitable parametric models built from a set of sampling points in the d-dimensional state space. Here the problem of sample complexity, i.e., how "fast" the number of points must grow with the input dimension in order to have an accurate estimate of the cost-to-go functions in typical DP approaches such as value iteration and policy iteration, is discussed. It is shown that a choice of the sampling based on low-discrepancy sequences, commonly used for efficient numerical integration, permits to achieve, under suitable hypotheses, an almost linear sample complexity, thus contributing to mitigate the curse of dimensionality of the approximate DP procedure.
引用
收藏
页码:417 / 443
页数:27
相关论文
共 50 条
  • [21] Approximate Dynamic Programming for Trajectory Tracking of Switched Systems
    Greene, Max L.
    Sakha, Masoud S.
    Kamalapurkar, Rushikesh
    Dixon, Warren E.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2025, 70 (02) : 1024 - 1037
  • [22] Approximate Dynamic Programming for Nonlinear-Constrained Optimizations
    Yang, Xiong
    He, Haibo
    Zhong, Xiangnan
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (05) : 2419 - 2432
  • [23] Kansei engineering, humans and computers: efficient dynamic programming algorithms for combinatorial food packing problems
    Imahori, Shinji
    Karuno, Yoshiyuki
    Nagamochi, Hiroshi
    Wang, Xiaoming
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2011, 3 (03) : 228 - 245
  • [24] Dynamic Programming Algorithms and their application to Financial Management
    Liu, Zhenwei
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (09) : 49 - 55
  • [25] PIPELINE ARCHITECTURES FOR DYNAMIC-PROGRAMMING ALGORITHMS
    CHEN, GH
    CHERN, MS
    JANG, JH
    PARALLEL COMPUTING, 1990, 13 (01) : 111 - 117
  • [26] SYSTOLIC ALGORITHMS FOR THE DYNAMIC-PROGRAMMING PROBLEM
    DHRIF, H
    SARKAR, D
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 1992, 41 (3-4) : 151 - 163
  • [27] LOCAL MINIMIZATION ALGORITHMS FOR DYNAMIC PROGRAMMING EQUATIONS
    Kalise, Dante
    Kroener, Axel
    Kunisch, Karl
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2016, 38 (03) : A1587 - A1615
  • [28] Dynamic programming revisited: Improving knapsack algorithms
    Pferschy, U
    COMPUTING, 1999, 63 (04) : 419 - 430
  • [29] Real-Time Energy-Efficient Actuation of Induction Motor Drives Using Approximate Dynamic Programming
    De Keyser, Arne
    Vansompel, Hendrik
    Crevecoeur, Guillaume
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 68 (12) : 11837 - 11846
  • [30] Realizing of algorithms of dynamic programming on mathematica software
    Zhou, Kang
    Zhou, Lang
    Tong, Xiao-Jun
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 560 - +