Optimistic planning with an adaptive number of action switches for near-optimal nonlinear control

被引:0
|
作者
Mathe, Koppany [1 ]
Busoniu, Lucian [1 ]
Munos, Remi [2 ]
De Schutter, Bart [3 ]
机构
[1] Tech Univ Cluj Napoca, Dept Automat, Cluj Napoca, Romania
[2] Google DeepMind, London, England
[3] Delft Univ Technol, Delft Ctr Syst & Control, Delft, Netherlands
关键词
Optimal control; Planning; Nonlinear predictive control; Near-optimality analysis; MODEL-PREDICTIVE CONTROL; EXPLICIT; OPTIMIZATION;
D O I
10.1016/j.engappai.2017.08.020
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider infinite-horizon optimal control of nonlinear systems where the control actions are discrete, and focus on optimistic planning algorithms from artificial intelligence, which can handle general nonlinear systems with nonquadratic costs. With the main goal of reducing computations, we introduce two such algorithms that only search for constrained action sequences. The constraint prevents the sequences from switching between different actions more than a limited number of times. We call the first method optimistic switch-limited planning (OSP), and develop analysis showing that its fixed number of switches S leads to polynomial complexity in the search horizon, in contrast to the exponential complexity of the existing OP algorithm for deterministic systems; and to a correspondingly faster convergence towards optimality. Since tuning S is difficult, we introduce an adaptive variant called OASP that automatically adjusts S so as to limit computations while ensuring that near-optimal solutions keep being explored. OSP and OASP are analytically evaluated in representative special cases, and numerically illustrated in simulations of a rotational pendulum. To show that the algorithms also work in challenging applications, OSP is used to control the pendulum in real time, while OASP is applied for trajectory control of a simulated quadrotor. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:355 / 367
页数:13
相关论文
共 50 条
  • [31] Asymptotically optimal inspection planning via efficient near-optimal search on sampled roadmaps
    Fu, Mengyu
    Kuntz, Alan
    Salzman, Oren
    Alterovitz, Ron
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2023, 42 (4-5) : 150 - 175
  • [32] Output feedback near-optimal control of atomic force microscope
    Sutton, Joshua L.
    Al Saaideh, Mohammad
    Al Janaideh, Mohammad
    Boker, Almuatazbellah
    INTERNATIONAL JOURNAL OF CONTROL, 2025,
  • [33] Event-Triggered Near-Optimal Control for Unknown Discrete-Time Nonlinear Systems Using Parallel Control
    Lu, Jingwei
    Wei, Qinglai
    Zhou, Tianmin
    Wang, Ziyang
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1890 - 1904
  • [34] Near-Optimal Receding Horizon Control of Thermal Energy Storage
    Qureshi, O. A.
    Armstrong, P. R.
    JOURNAL OF ENERGY RESOURCES TECHNOLOGY-TRANSACTIONS OF THE ASME, 2022, 144 (06):
  • [35] Near-Optimal Control for Aircraft Conflict Resolution in the Presence of Uncertainty
    Matsuno, Yoshinori
    Tsuchiya, Takeshi
    Matayoshi, Naoki
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2016, 39 (02) : 326 - 338
  • [36] Optimistic planning algorithms for state-constrained optimal control problems
    Bokanowski, Olivier
    Gammoudi, Nidhal
    Zidani, Hasnaa
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2022, 109 : 158 - 179
  • [37] Scalable distributed algorithms for multi-robot near-optimal motion planning
    Zhao, Guoxiang
    Zhu, Minghui
    AUTOMATICA, 2022, 140
  • [38] Toward Asymptotically-Optimal Inspection Planning via Efficient Near-Optimal Graph Search
    Fu, Mengyu
    Kuntz, Alan
    Salzman, Oren
    Alterovitz, Ron
    ROBOTICS: SCIENCE AND SYSTEMS XV, 2019,
  • [39] A sparse sampling algorithm for near-optimal planning in large Markov decision processes
    Kearns, M
    Mansour, Y
    Ng, AY
    MACHINE LEARNING, 2002, 49 (2-3) : 193 - 208
  • [40] A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes
    Michael Kearns
    Yishay Mansour
    Andrew Y. Ng
    Machine Learning, 2002, 49 : 193 - 208