Adaptive Model Learning method for Reinforcement Learning

被引：0

作者：

Hwang, Kao-Shing ^{[1
]}

Jiang, Wei-Cheng ^{[2
]}

Chen, Yu-Jen ^{[2
]}

机构：

[1] Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung, Taiwan

[2] Natl Chung Cheng Univ, Dept Elect Engn, Chiayi, Taiwan

来源：

2012 PROCEEDINGS OF SICE ANNUAL CONFERENCE (SICE) | 2012年

关键词：

adaptive model learning method; Dyna-Q agent; Reinforcement learning;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The original Q-learning method is difficult on achieving sample efficiency such as training a policy to get to a goal with in limited time step. So, the Dyna-Q agent is proposed to speed up the policy learning. However, the Dyna-Q did not specify how to build the model, so the table is used to be the model largely. In this paper, we proposed an adaptive model learning method based on tree structures and combined with Q-Learning to form Tree-Based Dyna-Q agent to enhance the policy learning. When the tree-based model learns an accurate model, a planning method can use the model to produce simulated experiences to accelerate value iterations. Thus, the agent with the proposed method can obtain virtual experiences for updating the policy. The simulation result shows that training time of our method can improve obviously.

引用

页码：1277 / 1280

页数：4

共 50 条

[31] Adaptive Traffic Signal Control Method Based on Offline Reinforcement Learning
Wang, Lei
Wang, Yu-Xuan
Li, Jian-Kang
Liu, Yi
Pi, Jia-Tian
APPLIED SCIENCES-BASEL, 2024, 14 (22):
[32] Adaptive Virtual Machine Consolidation Method Based on Deep Reinforcement Learning
Yu X.
Li Z.
Sun S.
Zhang G.
Diao Z.
Xie G.
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (12): : 2783 - 2797
[33] Adaptive Clutter Intelligent Suppression Method Based on Deep Reinforcement Learning
Cheng, Yi
Su, Junjie
Xiu, Chunbo
Liu, Jiaxin
APPLIED SCIENCES-BASEL, 2024, 14 (17):
[34] ASRL: An Adaptive GPS Sampling Method Using Deep Reinforcement Learning
Qu, Boting
Zhao, Mengjiao
Feng, Jun
Wang, Xin
2022 23RD IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2022), 2022, : 153 - 158
[35] A Satellite Adaptive Modulation Coding Method Based on Deep Reinforcement Learning
Zhou, Xin
Li, Wenfeng
Zhao, Kanglian
2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE INNOVATION, ICAII 2023, 2023, : 83 - 88
[36] Adaptive Discretization in Online Reinforcement Learning
Sinclair, Sean R.
Banerjee, Siddhartha
Yu, Christina Lee
OPERATIONS RESEARCH, 2023, 71 (05) : 1636 - 1652
[37] Adaptive Interest for Emphatic Reinforcement Learning
Klissarov, Martin
Fakoor, Rasool
Mueller, Jonas
Asadi, Kavosh
Kim, Taesup
Smola, Alexander J.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[38] Adaptive State Aggregation for Reinforcement Learning
Hwang, Kao-Shing
Chen, Yu-Jen
Jiang, Wei-Cheng
PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 2452 - 2456
[39] Adaptive Exploration Strategies for Reinforcement Learning
Hwang, Kao-Shing
Li, Chih-Wen
Jiang, Wei-Cheng
2017 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING (ICSSE), 2017, : 16 - 19
[40] ADAPTIVE GUIDANCE WITH REINFORCEMENT META LEARNING
Gaudet, Brian
Linares, Richard
SPACEFLIGHT MECHANICS 2019, VOL 168, PTS I-IV, 2019, 168 : 4091 - 4109

← 1 2 3 4 5 →