Efficient reinforcement learning: Model-based acrobot control

被引：0

作者：

Boone, G

机构：

来源：

1997 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION - PROCEEDINGS, VOLS 1-4 | 1997年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Several methods have been proposed in the reinforcement learning literature for learning optimal policies for sequential decision tasks. Q-learning is a model-free algorithm that has recently been applied to the Acrobot, a two-link arm with a single actuator at the elbow that learns to swing its free endpoint above a target height. However, applying Q-learning to a real Acrobot may be impractical due to the large number of required movements of the real robot as the controller learns. This paper explores the planning speed and data efficiency of explicitly learning models, as well as using heuristic knowledge to aid the search for solutions and reduce the amount of data required from the real robot.

引用

页码：229 / 234

页数：6

共 50 条

[21] Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation
Corneil, Dane
Gerstner, Wulfram
Brea, Johanni
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[22] Efficient Exploration in Continuous-time Model-based Reinforcement Learning
Treven, Lenart
Hubotter, Jonas
Sukhija, Bhavya
Dorfler, Florian
Krause, Andreas
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[23] Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning
Zhang, Shenao
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[24] Data-efficient model-based reinforcement learning with trajectory discrimination
Tuo Qu
Fuqing Duan
Junge Zhang
Bo Zhao
Wenzhen Huang
Complex & Intelligent Systems, 2024, 10 : 1927 - 1936
[25] Data-efficient model-based reinforcement learning with trajectory discrimination
Qu, Tuo
Duan, Fuqing
Zhang, Junge
Zhao, Bo
Huang, Wenzhen
COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (02) : 1927 - 1936
[26] Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization
Mutti, Mirco
De Santi, Riccardo
Rossi, Emanuele
Calderon, Juan Felipe
Bronstein, Michael
Restelli, Marcello
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9251 - 9259
[27] Residual-gradient-based neural reinforcement learning for the optimal control of an acrobot
Xu, X
He, HG
PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL, 2002, : 758 - 763
[28] Model-based Reinforcement Learning: A Survey
Moerland, Thomas M.
Broekens, Joost
Plaat, Aske
Jonker, Catholijn M.
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2023, 16 (01): : 1 - 118
[29] A survey on model-based reinforcement learning
Fan-Ming LUO
Tian XU
Hang LAI
Xiong-Hui CHEN
Weinan ZHANG
Yang YU
Science China(Information Sciences), 2024, 67 (02) : 59 - 84
[30] Nonparametric model-based reinforcement learning
Atkeson, CG
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 1008 - 1014

← 1 2 3 4 5 →