On a Probabilistic Approach for Inverse Data-Driven Optimal Control

被引:3
作者
Garrabe, Emiland [2 ]
Jesawada, Hozefa [1 ]
Del Vecchio, Carmen [1 ]
Russo, Giovanni [2 ]
机构
[1] Univ Sannio, Dept Engn, Benevento, Italy
[2] Univ Salerno, Dept Informat & Elect Engn & Appl Math, Salerno, Italy
来源
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC | 2023年
关键词
D O I
10.1109/CDC49753.2023.10383391
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of estimating the possibly non-convex cost of an agent by observing its interactions with a nonlinear, non-stationary and stochastic environment. For this inverse problem, we give a result that allows to estimate the cost by solving a convex optimization problem. To obtain this result we also tackle a forward problem. This leads to the formulation of a finite-horizon optimal control problem for which we show convexity and find the optimal solution. Our approach leverages certain probabilistic descriptions that can be obtained both from data and/or from first-principles. The effectiveness of our results, which are turned in an algorithm, is illustrated via simulations on the problem of estimating the cost of an agent that is stabilizing the unstable equilibrium of a pendulum.
引用
收藏
页码:4411 / 4416
页数:6
相关论文
共 24 条
[1]   From inverse optimal control to inverse reinforcement learning: A historical review [J].
Ab Azar, Nematollah ;
Shahmansoorian, Aref ;
Davoudi, Mohsen .
ANNUAL REVIEWS IN CONTROL, 2020, 50 :119-138
[2]   Optimal control - 1950 to 1985 [J].
Bryson, AE .
IEEE CONTROL SYSTEMS MAGAZINE, 1996, 16 (03) :26-33
[3]  
Cammardella N, 2019, IEEE DECIS CONTR P, P4195, DOI 10.1109/CDC40024.2019.9029512
[4]  
Dvijotham Krishnamurthy., 2010, Proceedings of the 27th International Conference on Machine Learning (ICML-10), P335, DOI DOI 10.0RG/PAPERS/571.PDF
[5]  
Finn C, 2016, PR MACH LEARN RES, V48
[6]   On a probabilistic approach to synthesize control policies from example datasets [J].
Gagliardi, Davide ;
Russo, Giovanni .
AUTOMATICA, 2022, 137
[7]   Probabilistic design of optimal sequential decision-making algorithms in learning and control [J].
Garrabe, Emiland ;
Russo, Giovanni .
ANNUAL REVIEWS IN CONTROL, 2022, 54 :81-102
[8]  
Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1
[9]   Online Markov Decision Processes With Kullback-Leibler Control Cost [J].
Guan, Peng ;
Raginsky, Maxim ;
Willett, Rebecca M. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (06) :1423-1438
[10]  
Kalakrishnan M, 2013, IEEE INT CONF ROBOT, P1331, DOI 10.1109/ICRA.2013.6630743