Optimal Approximation Schedules for a Class of Iterative Algorithms, With an Application to Multigrid Value Iteration

被引:14
作者
Almudevar, Anthony [1 ]
de Arruda, Edilson Fernandes [2 ,3 ]
机构
[1] Univ Rochester, Dept Biostat & Computat Biol, Rochester, NY 14642 USA
[2] Univ Fed Rio de Janeiro, Dept Ind Engn, Inst Grad Studies & Res Engn, Rio De Janeiro, RJ, Brazil
[3] Univ Fed Rio de Janeiro, Ind Engn Program, Alberto Luiz Coimbra Inst Grad Studies & Res Engn, Rio De Janeiro, RJ, Brazil
关键词
Approximate value iteration; Markov and semi-Markov decision processes; numerical approximation;
D O I
10.1109/TAC.2012.2203053
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many iterative algorithms employ operators which are difficult to evaluate exactly, but for which a graduated range of approximations exist. In such cases, "coarse-to-fine" algorithms are often used, in which a crude initial operator approximation is gradually refined with new iterations. In such algorithms, because the computational complexity increases over iterations, the algorithm's convergence rate is properly calculated with respect to cumulative computation time. This suggests the problem of determining an optimal rate of refinement for the operator approximation. This paper shows that, for linearly convergent algorithm, the optimal rate of refinement approaches the rate of convergence of the exact algorithm itself, regardless of the tolerance-complexity relationship. We illustrate this result with an analysis of "coarse-to-fine" grid algorithms for Markov decision processes with continuous state spaces. Using the methods proposed here we deduce an algorithm that presents optimal complexity results and consists solely of a non-adaptive schedule for the gradual decrease of grid size.
引用
收藏
页码:3132 / 3146
页数:15
相关论文
共 31 条
[1]  
Almudevar A., 2007, P 46 IEEE INT C DEC, P4087
[2]   APPROXIMATE FIXED POINT ITERATION WITH AN APPLICATION TO INFINITE HORIZON MARKOV DECISION PROCESSES [J].
Almudevar, Anthony .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2008, 47 (05) :2303-2347
[3]  
[Anonymous], 1996, Neuro-dynamic programming
[4]  
[Anonymous], 2007, APPL ITERATIVE METHO
[5]   Comparing solution methods for dynamic equilibrium economies [J].
Aruoba, S. Boragan ;
Fernandez-Villaverde, Jesus ;
Rubio-Ramirez, Juan F. .
JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2006, 30 (12) :2477-2508
[6]   Temporal Difference Methods for General Projected Equations [J].
Bertsekas, Dimitri P. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2011, 56 (09) :2128-2139
[7]  
Bertsekas D. P., 1995, DYNAM PROGRAMMING OP, V1-2
[8]   Accelerated gradient based diffuse optical tomographic image reconstruction [J].
Biswas, Samir Kumar ;
Rajan, K. ;
Vasu, R. M. .
MEDICAL PHYSICS, 2011, 38 (01) :539-547
[9]   A survey of computational complexity results in systems and control [J].
Blondel, VD ;
Tsitsiklis, JN .
AUTOMATICA, 2000, 36 (09) :1249-1274
[10]  
Chee-Seng Chow, 1989, Journal of Complexity, V5, P466, DOI 10.1016/0885-064X(89)90021-6