Control-Tutored Reinforcement Learning: Towards the Integration of Data-Driven and Model-Based Control

被引:0
作者
DeLellis, Francesco [1 ]
Coraggio, Marco [2 ]
Russo, Giovanni [3 ]
Musolesi, Mirco [4 ,5 ]
di Bernardo, Mario [1 ,2 ]
机构
[1] Univ Naples Federico II, Naples, Italy
[2] Scuola Super Merid, Naples, Italy
[3] Univ Salerno, Salerno, Italy
[4] UCL, London, England
[5] Univ Bologna, Bologna, Italy
来源
LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 168 | 2022年 / 168卷
关键词
Reinforcement learning based control; data-driven control; feedback control;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present an architecture where a feedback controller derived on an approximate model of the environment assists the learning process to enhance its data efficiency. This architecture, which we term as Control-Tutored Q-Learning (CTQL), is presented in two alternative flavours. The former is based on defining the reward function so that a Boolean condition can be used to determine when the control tutor policy is adopted, while the latter, termed as probabilistic CTQL (pCTQL), is instead based on executing calls to the tutor with a certain probability during learning. Both approaches are validated, and thoroughly benchmarked against Q-Learning, by considering the stabilization of an inverted pendulum as defined in OpenAI Gym as a representative problem.
引用
收藏
页数:12
相关论文
共 27 条
  • [1] Abbeel Pieter, 2006, INT C MACH LEARN ICM, P1
  • [2] Argerich M. F., 2020, AAAI SPRING S COMB M
  • [3] Berkenkamp F, 2017, ADV NEUR IN, V30
  • [4] Bertsekas D., 1996, NEURO DYNAMIC PROGRA
  • [5] Brockman G, 2016, Arxiv, DOI [arXiv:1606.01540, DOI 10.48550/ARXIV.1606.01540]
  • [6] De Lellis F, 2021, 2021 EUROPEAN CONTROL CONFERENCE (ECC), P580, DOI 10.23919/ECC54610.2021.9654881
  • [7] Deisenroth M., 2011, P 28 INT C MACH LEAR, P465, DOI [10.5555/3104482.3104541, DOI 10.5555/3104482.3104541]
  • [8] Duan Y, 2016, PR MACH LEARN RES, V48
  • [9] Even-Dar E, 2003, J MACH LEARN RES, V5, P1
  • [10] Nguyen HT, 2019, 2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), P102, DOI 10.1109/SSCI44817.2019.9002756