Eigenfunction approximation methods for linearly-solvable optimal control problems

被引:0
|
作者
Todorov, Emanuel [1 ]
机构
[1] Univ Calif San Diego, Dept Cognit Sci, La Jolla, CA 92093 USA
来源
ADPRL: 2009 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING | 2009年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We have identified a general class of nonlinear stochastic optimal control problems which can be reduced to computing the principal eigenfunction of a linear operator. Here we develop function approximation methods exploiting this inherent linearity. First we discretize the time axis in a novel way, yielding an integral operator that approximates not only our control problems but also more general elliptic PDEs. The eigenfunction problem is then approximated with a finite-dimensional eigenvector problem - by discretizing the state space, or by projecting on a set of adaptive bases evaluated at a set of collocation states. Solving the resulting eigenvector problem is faster than applying policy or value iteration. The bases are adapted via Levenberg-Marquardt minimization with guaranteed convergence. The collocation set can also be adapted so as to focus the approximation on a region of interest. Numerical results on test problems are provided.
引用
收藏
页码:161 / 168
页数:8
相关论文
共 50 条
  • [1] Moving least-squares approximations for linearly-solvable stochastic optimal control problems
    Zhong M.
    Todorov E.
    Journal of Control Theory and Applications, 2011, 9 (3): : 451 - 463
  • [2] Moving least-squares approximations for linearly-solvable stochastic optimal control problems
    Emanuel TODOROV
    JournalofControlTheoryandApplications, 2011, 9 (03) : 451 - 463
  • [3] Hierarchical Linearly-Solvable Markov Decision Problems
    Jonsson, Anders
    Gomez, Vicenc
    TWENTY-SIXTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING (ICAPS 2016), 2016, : 193 - 201
  • [4] Temporal difference approach in linearly-solvable Markov decision problems
    Burdelis, M. A. P.
    Ikeda, K.
    PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 16TH '11), 2011, : 808 - 811
  • [5] Globally Optimal Hierarchical Reinforcement Learning for Linearly-Solvable Markov Decision Processes
    Infante, Guillermo
    Jonsson, Anders
    Gomez, Vicenc
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6970 - 6977
  • [6] Linearly-Solvable Mean-Field Approximation for Multi-Team Road Traffic Games
    Pedram, Ali Reza
    Tanaka, Takashi
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 1243 - 1248
  • [7] Privacy-Aware Load Ensemble Control: A Linearly-Solvable MDP Approach
    Hassan, Ali
    Deka, Deepjyoti
    Dvorkin, Yury
    IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (01) : 255 - 267
  • [8] Approximation methods for solving fractional optimal control problems
    Samaneh Soradi Zeid
    Sohrab Effati
    Ali Vahidian Kamyad
    Computational and Applied Mathematics, 2018, 37 : 158 - 182
  • [9] Approximation methods for solving fractional optimal control problems
    Zeid, Samaneh Soradi
    Effati, Sohrab
    Kamyad, Ali Vahidian
    COMPUTATIONAL & APPLIED MATHEMATICS, 2018, 37 : 158 - 182
  • [10] Linearly Solvable Stochastic Optimal Control for Infinite-Dimensional Systems
    Theodorou, Evangelos A.
    Boutselis, George I.
    Bakshi, Kaivalya
    2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 4110 - 4116