Optimistic planning with an adaptive number of action switches for near-optimal nonlinear control

被引：0

作者：

Mathe, Koppany ^{[1
]}

Busoniu, Lucian ^{[1
]}

Munos, Remi ^{[2
]}

De Schutter, Bart ^{[3
]}

机构：

[1] Tech Univ Cluj Napoca, Dept Automat, Cluj Napoca, Romania

[2] Google DeepMind, London, England

[3] Delft Univ Technol, Delft Ctr Syst & Control, Delft, Netherlands

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2018年 / 67卷

关键词：

Optimal control; Planning; Nonlinear predictive control; Near-optimality analysis; MODEL-PREDICTIVE CONTROL; EXPLICIT; OPTIMIZATION;

D O I：

10.1016/j.engappai.2017.08.020

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We consider infinite-horizon optimal control of nonlinear systems where the control actions are discrete, and focus on optimistic planning algorithms from artificial intelligence, which can handle general nonlinear systems with nonquadratic costs. With the main goal of reducing computations, we introduce two such algorithms that only search for constrained action sequences. The constraint prevents the sequences from switching between different actions more than a limited number of times. We call the first method optimistic switch-limited planning (OSP), and develop analysis showing that its fixed number of switches S leads to polynomial complexity in the search horizon, in contrast to the exponential complexity of the existing OP algorithm for deterministic systems; and to a correspondingly faster convergence towards optimality. Since tuning S is difficult, we introduce an adaptive variant called OASP that automatically adjusts S so as to limit computations while ensuring that near-optimal solutions keep being explored. OSP and OASP are analytically evaluated in representative special cases, and numerically illustrated in simulations of a rotational pendulum. To show that the algorithms also work in challenging applications, OSP is used to control the pendulum in real time, while OASP is applied for trajectory control of a simulated quadrotor. (C) 2017 Elsevier Ltd. All rights reserved.

引用

页码：355 / 367

页数：13

共 50 条

[31] Asymptotically optimal inspection planning via efficient near-optimal search on sampled roadmaps
Fu, Mengyu
Kuntz, Alan
Salzman, Oren
Alterovitz, Ron
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2023, 42 (4-5) : 150 - 175
[32] Output feedback near-optimal control of atomic force microscope
Sutton, Joshua L.
Al Saaideh, Mohammad
Al Janaideh, Mohammad
Boker, Almuatazbellah
INTERNATIONAL JOURNAL OF CONTROL, 2025,
[33] Event-Triggered Near-Optimal Control for Unknown Discrete-Time Nonlinear Systems Using Parallel Control
Lu, Jingwei
Wei, Qinglai
Zhou, Tianmin
Wang, Ziyang
Wang, Fei-Yue
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1890 - 1904
[34] Near-Optimal Receding Horizon Control of Thermal Energy Storage
Qureshi, O. A.
Armstrong, P. R.
JOURNAL OF ENERGY RESOURCES TECHNOLOGY-TRANSACTIONS OF THE ASME, 2022, 144 (06):
[35] Near-Optimal Control for Aircraft Conflict Resolution in the Presence of Uncertainty
Matsuno, Yoshinori
Tsuchiya, Takeshi
Matayoshi, Naoki
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2016, 39 (02) : 326 - 338
[36] Optimistic planning algorithms for state-constrained optimal control problems
Bokanowski, Olivier
Gammoudi, Nidhal
Zidani, Hasnaa
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2022, 109 : 158 - 179
[37] Scalable distributed algorithms for multi-robot near-optimal motion planning
Zhao, Guoxiang
Zhu, Minghui
AUTOMATICA, 2022, 140
[38] Toward Asymptotically-Optimal Inspection Planning via Efficient Near-Optimal Graph Search
Fu, Mengyu
Kuntz, Alan
Salzman, Oren
Alterovitz, Ron
ROBOTICS: SCIENCE AND SYSTEMS XV, 2019,
[39] A sparse sampling algorithm for near-optimal planning in large Markov decision processes
Kearns, M
Mansour, Y
Ng, AY
MACHINE LEARNING, 2002, 49 (2-3) : 193 - 208
[40] A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes
Michael Kearns
Yishay Mansour
Andrew Y. Ng
Machine Learning, 2002, 49 : 193 - 208

← 1 2 3 4 5 →