Adaptive Model Learning method for Reinforcement Learning

被引:0
|
作者
Hwang, Kao-Shing [1 ]
Jiang, Wei-Cheng [2 ]
Chen, Yu-Jen [2 ]
机构
[1] Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung, Taiwan
[2] Natl Chung Cheng Univ, Dept Elect Engn, Chiayi, Taiwan
关键词
adaptive model learning method; Dyna-Q agent; Reinforcement learning;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The original Q-learning method is difficult on achieving sample efficiency such as training a policy to get to a goal with in limited time step. So, the Dyna-Q agent is proposed to speed up the policy learning. However, the Dyna-Q did not specify how to build the model, so the table is used to be the model largely. In this paper, we proposed an adaptive model learning method based on tree structures and combined with Q-Learning to form Tree-Based Dyna-Q agent to enhance the policy learning. When the tree-based model learns an accurate model, a planning method can use the model to produce simulated experiences to accelerate value iterations. Thus, the agent with the proposed method can obtain virtual experiences for updating the policy. The simulation result shows that training time of our method can improve obviously.
引用
收藏
页码:1277 / 1280
页数:4
相关论文
共 50 条
  • [31] Adaptive Traffic Signal Control Method Based on Offline Reinforcement Learning
    Wang, Lei
    Wang, Yu-Xuan
    Li, Jian-Kang
    Liu, Yi
    Pi, Jia-Tian
    APPLIED SCIENCES-BASEL, 2024, 14 (22):
  • [32] Adaptive Virtual Machine Consolidation Method Based on Deep Reinforcement Learning
    Yu X.
    Li Z.
    Sun S.
    Zhang G.
    Diao Z.
    Xie G.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (12): : 2783 - 2797
  • [33] Adaptive Clutter Intelligent Suppression Method Based on Deep Reinforcement Learning
    Cheng, Yi
    Su, Junjie
    Xiu, Chunbo
    Liu, Jiaxin
    APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [34] ASRL: An Adaptive GPS Sampling Method Using Deep Reinforcement Learning
    Qu, Boting
    Zhao, Mengjiao
    Feng, Jun
    Wang, Xin
    2022 23RD IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2022), 2022, : 153 - 158
  • [35] A Satellite Adaptive Modulation Coding Method Based on Deep Reinforcement Learning
    Zhou, Xin
    Li, Wenfeng
    Zhao, Kanglian
    2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE INNOVATION, ICAII 2023, 2023, : 83 - 88
  • [36] Adaptive Discretization in Online Reinforcement Learning
    Sinclair, Sean R.
    Banerjee, Siddhartha
    Yu, Christina Lee
    OPERATIONS RESEARCH, 2023, 71 (05) : 1636 - 1652
  • [37] Adaptive Interest for Emphatic Reinforcement Learning
    Klissarov, Martin
    Fakoor, Rasool
    Mueller, Jonas
    Asadi, Kavosh
    Kim, Taesup
    Smola, Alexander J.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [38] Adaptive State Aggregation for Reinforcement Learning
    Hwang, Kao-Shing
    Chen, Yu-Jen
    Jiang, Wei-Cheng
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 2452 - 2456
  • [39] Adaptive Exploration Strategies for Reinforcement Learning
    Hwang, Kao-Shing
    Li, Chih-Wen
    Jiang, Wei-Cheng
    2017 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING (ICSSE), 2017, : 16 - 19
  • [40] ADAPTIVE GUIDANCE WITH REINFORCEMENT META LEARNING
    Gaudet, Brian
    Linares, Richard
    SPACEFLIGHT MECHANICS 2019, VOL 168, PTS I-IV, 2019, 168 : 4091 - 4109