Efficient sampling in approximate dynamic programming algorithms

被引:21
|
作者
Cervellera, Cristiano [1 ]
Muselli, Marco [1 ]
机构
[1] Ist Studi Sistemi Intelligenti Lautomaz, Consiglio Nazl Ric, I-16149 Genoa, Italy
关键词
stochastic optimal control problem; dynamic programming; sample complexity; deterministic learning; low-discrepancy sequences;
D O I
10.1007/s10589-007-9054-8
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Dynamic Programming (DP) is known to be a standard optimization tool for solving Stochastic Optimal Control (SOC) problems, either over a finite or an infinite horizon of stages. Under very general assumptions, commonly employed numerical algorithms are based on approximations of the cost-to-go functions, by means of suitable parametric models built from a set of sampling points in the d-dimensional state space. Here the problem of sample complexity, i.e., how "fast" the number of points must grow with the input dimension in order to have an accurate estimate of the cost-to-go functions in typical DP approaches such as value iteration and policy iteration, is discussed. It is shown that a choice of the sampling based on low-discrepancy sequences, commonly used for efficient numerical integration, permits to achieve, under suitable hypotheses, an almost linear sample complexity, thus contributing to mitigate the curse of dimensionality of the approximate DP procedure.
引用
收藏
页码:417 / 443
页数:27
相关论文
共 50 条
  • [31] New Efficient Algorithms for the Merged LCS Problem With and Without Block Constraints Using Sparse Dynamic Programming
    Rahman, A. H. M. Mahfuzur
    Rahman, M. Sohel
    2012 15TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2012, : 26 - 35
  • [32] Safe Approximate Dynamic Programming via Kernelized Lipschitz Estimation
    Chakrabarty, Ankush
    Jha, Devesh K.
    Buzzard, Gregery T.
    Wang, Yebin
    Vamvoudakis, Kyriakos G.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (01) : 405 - 419
  • [33] Approximate dynamic programming approaches for appointment scheduling with patient preferences
    Li, Xin
    Wang, Jin
    Fung, Richard Y. K.
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2018, 85 : 16 - 25
  • [34] Stochastic Generation Capacity Expansion Planning with Approximate Dynamic Programming
    Bukenberger, Jesse
    Palmintier, Bryan
    2018 IEEE/PES TRANSMISSION AND DISTRIBUTION CONFERENCE AND EXPOSITION (T&D), 2018,
  • [35] Fast Approximate Dynamic Programming for Input-Affine Dynamics
    Kolarijani, Mohamad Amin Sharifi
    Esfahani, Peyman Mohajerin
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (10) : 6315 - 6322
  • [36] An Approximate Dynamic-Programming Approach to the Joint Replenishment Problem
    Segev, Danny
    MATHEMATICS OF OPERATIONS RESEARCH, 2014, 39 (02) : 432 - 444
  • [37] Decentralized Bayesian search using approximate dynamic programming methods
    Zhao, Yijia
    Patek, Stephen D.
    Beling, Peter A.
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 970 - 975
  • [38] Adaptive traffic signal control using approximate dynamic programming
    Cai, Chen
    Wong, Chi Kwong
    Heydecker, Benjamin G.
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2009, 17 (05) : 456 - 474
  • [39] Resource management in CDMA networks based on approximate dynamic programming
    Papadaki, K
    Friderikos, V
    2005 14TH IEEE WORKSHOP ON LOCAL & METROPOLITAN AREA NETWORKS (LANMAN), 2005, : 148 - 153
  • [40] An Approximate Dynamic Programming Approach to Vehicle Platooning Coordination in Networks
    Xiong, Xi
    Wang, Maonan
    Sun, Dengfeng
    Jin, Li
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 16536 - 16547