Optimistic planning with an adaptive number of action switches for near-optimal nonlinear control

被引:0
|
作者
Mathe, Koppany [1 ]
Busoniu, Lucian [1 ]
Munos, Remi [2 ]
De Schutter, Bart [3 ]
机构
[1] Tech Univ Cluj Napoca, Dept Automat, Cluj Napoca, Romania
[2] Google DeepMind, London, England
[3] Delft Univ Technol, Delft Ctr Syst & Control, Delft, Netherlands
关键词
Optimal control; Planning; Nonlinear predictive control; Near-optimality analysis; MODEL-PREDICTIVE CONTROL; EXPLICIT; OPTIMIZATION;
D O I
10.1016/j.engappai.2017.08.020
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider infinite-horizon optimal control of nonlinear systems where the control actions are discrete, and focus on optimistic planning algorithms from artificial intelligence, which can handle general nonlinear systems with nonquadratic costs. With the main goal of reducing computations, we introduce two such algorithms that only search for constrained action sequences. The constraint prevents the sequences from switching between different actions more than a limited number of times. We call the first method optimistic switch-limited planning (OSP), and develop analysis showing that its fixed number of switches S leads to polynomial complexity in the search horizon, in contrast to the exponential complexity of the existing OP algorithm for deterministic systems; and to a correspondingly faster convergence towards optimality. Since tuning S is difficult, we introduce an adaptive variant called OASP that automatically adjusts S so as to limit computations while ensuring that near-optimal solutions keep being explored. OSP and OASP are analytically evaluated in representative special cases, and numerically illustrated in simulations of a rotational pendulum. To show that the algorithms also work in challenging applications, OSP is used to control the pendulum in real time, while OASP is applied for trajectory control of a simulated quadrotor. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:355 / 367
页数:13
相关论文
共 50 条
  • [21] Near-Optimal Control of Nonlinear Systems With Hybrid Inputs and Dwell-Time Constraints
    Lal, Ioana
    Morarescu, Irinel-Constantin
    Daafouz, Jamal
    Busoniu, Lucian
    IEEE CONTROL SYSTEMS LETTERS, 2023, 7 : 2455 - 2460
  • [22] Robust Near-optimal Control for Constrained Nonlinear System via Integral Reinforcement Learning
    Yu-Qing Qiu
    Yan Li
    Zhong Wang
    International Journal of Control, Automation and Systems, 2023, 21 : 1319 - 1330
  • [23] Near-optimal control of dynamical systems with neural ordinary differential equations
    Boettcher, Lucas
    Asikis, Thomas
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (04):
  • [24] Adaptive near-optimal consensus of high-order nonlinear multi-agent systems with heterogeneity
    Zhang, Yinyan
    Li, Shuai
    AUTOMATICA, 2017, 85 : 426 - 432
  • [25] Procrastinating with Confidence: Near-Optimal, Anytime, Adaptive Algorithm Configuration
    Kleinberg, Robert
    Leyton-Brown, Kevin
    Lucier, Brendan
    Graham, Devon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [26] Adaptive Near-Optimal Compensation in Lossy Polyphase Power Systems
    Lev-Ari, Hanoch
    Hernandez, Ronald D.
    Stankovic, Aleksandar M.
    Marengo, Edwin A.
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2018, 26 (02) : 732 - 739
  • [27] Adaptive near-optimal neuro controller for continuous-time nonaffine nonlinear systems with constrained input
    Esfandiari, Kasra
    Abdollahi, Farzaneh
    Talebi, Heidar Ali
    NEURAL NETWORKS, 2017, 93 : 195 - 204
  • [28] Performance-based near-optimal vibration control for nonlinear offshore platforms with delayed input
    Zhong, Xiao-Fang
    Han, Shi-Yuan
    Zhou, Jin
    Chen, Yue-Hui
    Tang, Gong-You
    ASIAN JOURNAL OF CONTROL, 2021, 23 (01) : 513 - 524
  • [29] Robust near-optimal control via unchattering sliding mode control
    Bartolini, G
    Sanna, S
    Usai, E
    COMPUTING ANTICIPATORY SYSTEMS: CASYS - FIRST INTERNATIONAL CONFERENCE, 1998, 437 : 269 - 283
  • [30] Near-optimal search-and-rescue path planning for a moving target
    Berger, Jean
    Barkaoui, Mohamed
    Lo, Nassirou
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2021, 72 (03) : 688 - 700