Optimistic planning with an adaptive number of action switches for near-optimal nonlinear control

被引:0
作者
Mathe, Koppany [1 ]
Busoniu, Lucian [1 ]
Munos, Remi [2 ]
De Schutter, Bart [3 ]
机构
[1] Tech Univ Cluj Napoca, Dept Automat, Cluj Napoca, Romania
[2] Google DeepMind, London, England
[3] Delft Univ Technol, Delft Ctr Syst & Control, Delft, Netherlands
关键词
Optimal control; Planning; Nonlinear predictive control; Near-optimality analysis; MODEL-PREDICTIVE CONTROL; EXPLICIT; OPTIMIZATION;
D O I
10.1016/j.engappai.2017.08.020
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider infinite-horizon optimal control of nonlinear systems where the control actions are discrete, and focus on optimistic planning algorithms from artificial intelligence, which can handle general nonlinear systems with nonquadratic costs. With the main goal of reducing computations, we introduce two such algorithms that only search for constrained action sequences. The constraint prevents the sequences from switching between different actions more than a limited number of times. We call the first method optimistic switch-limited planning (OSP), and develop analysis showing that its fixed number of switches S leads to polynomial complexity in the search horizon, in contrast to the exponential complexity of the existing OP algorithm for deterministic systems; and to a correspondingly faster convergence towards optimality. Since tuning S is difficult, we introduce an adaptive variant called OASP that automatically adjusts S so as to limit computations while ensuring that near-optimal solutions keep being explored. OSP and OASP are analytically evaluated in representative special cases, and numerically illustrated in simulations of a rotational pendulum. To show that the algorithms also work in challenging applications, OSP is used to control the pendulum in real time, while OASP is applied for trajectory control of a simulated quadrotor. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:355 / 367
页数:13
相关论文
共 32 条
[1]   A {0,1} linear program for fixed-profile load scheduling and demand management in automated irrigation channels [J].
Alende, Julien ;
Li, Yuping ;
Cantoni, Michael .
PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, :597-602
[2]   A review of interactive methods for multiobjective integer and mixed-integer programming [J].
Alves, Maria Joao ;
Climaco, Joao .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 180 (01) :99-115
[3]  
[Anonymous], TECH REP
[4]  
[Anonymous], 2006, Planning algorithms
[5]  
[Anonymous], 2004, IEEE T AUTOMAT CONTR, DOI DOI 10.1109/TAC.1972.1100008
[6]   Using hash tables to manage the time-storage complexity in a point location problem: Application to explicit model predictive control [J].
Bayat, Farhad ;
Johansen, Tor Arne ;
Jalali, Ali Akbar .
AUTOMATICA, 2011, 47 (03) :571-577
[7]   The explicit linear quadratic regulator for constrained systems [J].
Bemporad, A ;
Morari, M ;
Dua, V ;
Pistikopoulos, EN .
AUTOMATICA, 2002, 38 (01) :3-20
[8]   A Survey of Monte Carlo Tree Search Methods [J].
Browne, Cameron B. ;
Powley, Edward ;
Whitehouse, Daniel ;
Lucas, Simon M. ;
Cowling, Peter I. ;
Rohlfshagen, Philipp ;
Tavener, Stephen ;
Perez, Diego ;
Samothrakis, Spyridon ;
Colton, Simon .
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) :1-43
[9]  
Busoniu L., 2012, International Conference on Artificial Intelligence and Statistics, P182
[10]   Robust Self-Triggered Coordination With Ternary Controllers [J].
De Persis, Claudio ;
Frasca, Paolo .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2013, 58 (12) :3024-3038