Adaptive Model Learning method for Reinforcement Learning

被引:0
|
作者
Hwang, Kao-Shing [1 ]
Jiang, Wei-Cheng [2 ]
Chen, Yu-Jen [2 ]
机构
[1] Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung, Taiwan
[2] Natl Chung Cheng Univ, Dept Elect Engn, Chiayi, Taiwan
关键词
adaptive model learning method; Dyna-Q agent; Reinforcement learning;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The original Q-learning method is difficult on achieving sample efficiency such as training a policy to get to a goal with in limited time step. So, the Dyna-Q agent is proposed to speed up the policy learning. However, the Dyna-Q did not specify how to build the model, so the table is used to be the model largely. In this paper, we proposed an adaptive model learning method based on tree structures and combined with Q-Learning to form Tree-Based Dyna-Q agent to enhance the policy learning. When the tree-based model learns an accurate model, a planning method can use the model to produce simulated experiences to accelerate value iterations. Thus, the agent with the proposed method can obtain virtual experiences for updating the policy. The simulation result shows that training time of our method can improve obviously.
引用
收藏
页码:1277 / 1280
页数:4
相关论文
共 50 条
  • [1] An adaptive clustering method for model-free reinforcement learning
    Matt, A
    Regensburger, G
    INMIC 2004: 8TH INTERNATIONAL MULTITOPIC CONFERENCE, PROCEEDINGS, 2004, : 362 - 367
  • [2] Adaptive Client Model Update with Reinforcement Learning in Synchronous Federated Learning
    Pan, Zirou
    Geng, Huan
    Wei, Linna
    Zhao, Wei
    2022 32ND INTERNATIONAL TELECOMMUNICATION NETWORKS AND APPLICATIONS CONFERENCE (ITNAC), 2022, : 155 - 157
  • [3] Combining Learner Model and Reinforcement Learning for Adaptive Sequencing of Learning Activities
    Yessad, Amel
    METHODOLOGIES AND INTELLIGENT SYSTEMS FOR TECHNOLOGY ENHANCED LEARNING, 2023, 580 : 97 - 102
  • [4] Adaptive Reinforcement Learning Method for Networks-on-Chip
    Farahnakian, Fahimeh
    Ebrahimi, Masoumeh
    Daneshtalab, Masoud
    Plosila, Juha
    Liljeberg, Pasi
    2012 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS (SAMOS): ARCHITECTURES, MODELING AND SIMULATION, 2012, : 236 - 243
  • [5] A reinforcement learning method based on adaptive simulated annealing
    Atiya, AF
    Parlos, AG
    Ingber, L
    Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, 2003, : 121 - 124
  • [6] Adaptive Supervisor: Method of Reinforcement Learning Fault Elimination by Application of Supervised Learning
    Krzyszton, Mateusz
    PROCEEDINGS OF THE 2018 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2018, : 139 - 143
  • [7] Adaptive Discretization for Model-Based Reinforcement Learning
    Sinclair, Sean R.
    Wang, Tianyu
    Jain, Gauri
    Banerjee, Siddhartha
    Yu, Christina Lee
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS (NEURIPS 2020), 2020, 33
  • [8] Deep Reinforcement Learning for Adaptive Learning Systems
    Li, Xiao
    Xu, Hanchen
    Zhang, Jinming
    Chang, Hua-hua
    JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2023, 48 (02) : 220 - 243
  • [9] Adaptive Individual Q-Learning-A Multiagent Reinforcement Learning Method for Coordination Optimization
    Zhang, Zhen
    Wang, Dongqing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 12
  • [10] Reinforcement Learning with Adaptive Networks
    Sasaki, Tomoki
    Yamada, Satoshi
    2017 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION SCIENCES (ICRAS), 2017, : 1 - 5