Optimistic planning with an adaptive number of action switches for near-optimal nonlinear control

被引:0
|
作者
Mathe, Koppany [1 ]
Busoniu, Lucian [1 ]
Munos, Remi [2 ]
De Schutter, Bart [3 ]
机构
[1] Tech Univ Cluj Napoca, Dept Automat, Cluj Napoca, Romania
[2] Google DeepMind, London, England
[3] Delft Univ Technol, Delft Ctr Syst & Control, Delft, Netherlands
关键词
Optimal control; Planning; Nonlinear predictive control; Near-optimality analysis; MODEL-PREDICTIVE CONTROL; EXPLICIT; OPTIMIZATION;
D O I
10.1016/j.engappai.2017.08.020
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider infinite-horizon optimal control of nonlinear systems where the control actions are discrete, and focus on optimistic planning algorithms from artificial intelligence, which can handle general nonlinear systems with nonquadratic costs. With the main goal of reducing computations, we introduce two such algorithms that only search for constrained action sequences. The constraint prevents the sequences from switching between different actions more than a limited number of times. We call the first method optimistic switch-limited planning (OSP), and develop analysis showing that its fixed number of switches S leads to polynomial complexity in the search horizon, in contrast to the exponential complexity of the existing OP algorithm for deterministic systems; and to a correspondingly faster convergence towards optimality. Since tuning S is difficult, we introduce an adaptive variant called OASP that automatically adjusts S so as to limit computations while ensuring that near-optimal solutions keep being explored. OSP and OASP are analytically evaluated in representative special cases, and numerically illustrated in simulations of a rotational pendulum. To show that the algorithms also work in challenging applications, OSP is used to control the pendulum in real time, while OASP is applied for trajectory control of a simulated quadrotor. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:355 / 367
页数:13
相关论文
共 50 条
  • [41] Identification of sparse nonlinear controlled variables for near-optimal operation of chemical processes
    Ma, Xie
    Guan, Hongwei
    Ye, Lingjian
    CANADIAN JOURNAL OF CHEMICAL ENGINEERING, 2024,
  • [42] Development of a real-time, near-optimal control process for water-distribution networks
    Rao, Zhengfu
    Salomons, Elad
    JOURNAL OF HYDROINFORMATICS, 2007, 9 (01) : 25 - 37
  • [43] HUMANS MAKE NEAR-OPTIMAL ADJUSTMENTS OF CONTROL TO INITIAL BODY CONFIGURATION IN VERTICAL SQUAT JUMPING
    Bobbert, Maarten F.
    Casius, L. J. Richard
    Kistemaker, Dinant A.
    NEUROSCIENCE, 2013, 237 : 232 - 242
  • [44] Near-Optimal Control Without Solving HJB Equations and Its Applications
    Zhang, Yinyan
    Li, Shuai
    Jiang, Xiangyuan
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2018, 65 (09) : 7173 - 7184
  • [45] Near-optimal online control of dynamic discrete-event systems
    Grigorov, Lenko
    Rudie, Karen
    DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2006, 16 (04): : 419 - 449
  • [46] Near-Optimal Online Control of Dynamic Discrete-Event Systems
    Lenko Grigorov
    Karen Rudie
    Discrete Event Dynamic Systems, 2006, 16 : 419 - 449
  • [47] Learning and Near-Optimal Control of Underactuated Surface Vessels With Periodic Disturbances
    Zhang, Yinyan
    Li, Shuai
    Weng, Jian
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (08) : 7453 - 7463
  • [48] Event-triggered near-optimal tracking control based on adaptive dynamic programming for discrete-time systems
    Wang, Ziyang
    Lee, Joonhyup
    Wei, Qinglai
    Zhang, Anting
    NEUROCOMPUTING, 2023, 537 : 187 - 197
  • [49] Multi-objective near-optimal necessary conditions for multi-sectoral planning
    Dubois, Antoine
    Dumas, Jonathan
    Thiran, Paolo
    Limpens, Gauthier
    Ernst, Damien
    APPLIED ENERGY, 2023, 350
  • [50] Near-optimal stabilization for a class of nonlinear systems with control constraint based on single network greedy iterative DHP algorithm
    Luo, Yan-Hong
    Zhang, Hua-Guang
    Cao, Ning
    Chen, Bing
    Zidonghua Xuebao/ Acta Automatica Sinica, 2009, 35 (11): : 1436 - 1445