Adaptive polyhedral meshing for approximate dynamic programming in control

被引：7

作者：

Sala, Antonio ^{[1
]}

Armesto, Leopoldo ^{[2
]}

机构：

[1] Univ Politecn Valencia, Inst Univ Automat & Informat Ind AI2, Camino Vera S-N, Valencia 46022, Spain

[2] Univ Politecn Valencia, Inst Diseno & Fabricac IDF, Camino Vera S-N, Valencia 46022, Spain

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2022年 / 107卷

关键词：

Optimal control; Dynamic programming; Function approximation; REFINEMENT METHOD; GRID SCHEME; PERFORMANCE;

D O I：

10.1016/j.engappai.2021.104515

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This work proposes a new criterion for adaptive meshing in polyhedral partitions which interpolate a value function in Approximate Dynamic Programming (ADP) in optimal control problems. The criterion adds new points to a simplicial mesh, based on: a user-defined initial condition probability density function which determines 'influential' regions of the state space, uncertainty (variance) propagation, and temporal-difference error. A collection of lemmas justifies the algorithmic proposal. Comparative analysis with other options in literature highlights the advantages of our proposal. The developed methods are applied to simulation examples and an experimental robotic setup.

引用

页数：12

共 40 条

[1] Model predictive control of three-axis gimbal system mounted on UAV for real-time target tracking under external disturbances [J].

Altan, Aytac ;

Hacioglu, Rifat .

MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2020, 138

[2]

[Anonymous], 2017, DYNAMIC PROGRAMMING

[3]

[Anonymous], 2007, AITR07339 U TEX AUST

[4]

ANTOS A., 2007, Adv. Neural Inf. Process. Syst., V20

[5] Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path [J].

Antos, Andras ;

Szepesvari, Csaba ;

Munos, Remi .

LEARNING THEORY, PROCEEDINGS, 2006, 4005 :574-588

[6]

Armesto L, YOUTUBE ROBOTICS SYS

[7]

Armesto L, 2021, MEARM ROBOT UPV VERS

[8] Volume-weighted Bellman error method for adaptive meshing in approximate dynamic programming [J].

Armesto, Leopoldo ;

Sala, Antonio .

REVISTA IBEROAMERICANA DE AUTOMATICA E INFORMATICA INDUSTRIAL, 2022, 19 (01) :37-47

[9]

Bertsekas D. P., 2018, ABSTRACT DYNAMIC PRO

[10]

Busoniu L, 2010, AUTOM CONTROL ENG SE, P1, DOI 10.1201/9781439821091-f

← 1 2 3 4 →