Adaptive Model Learning method for Reinforcement Learning

被引:0
|
作者
Hwang, Kao-Shing [1 ]
Jiang, Wei-Cheng [2 ]
Chen, Yu-Jen [2 ]
机构
[1] Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung, Taiwan
[2] Natl Chung Cheng Univ, Dept Elect Engn, Chiayi, Taiwan
关键词
adaptive model learning method; Dyna-Q agent; Reinforcement learning;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The original Q-learning method is difficult on achieving sample efficiency such as training a policy to get to a goal with in limited time step. So, the Dyna-Q agent is proposed to speed up the policy learning. However, the Dyna-Q did not specify how to build the model, so the table is used to be the model largely. In this paper, we proposed an adaptive model learning method based on tree structures and combined with Q-Learning to form Tree-Based Dyna-Q agent to enhance the policy learning. When the tree-based model learns an accurate model, a planning method can use the model to produce simulated experiences to accelerate value iterations. Thus, the agent with the proposed method can obtain virtual experiences for updating the policy. The simulation result shows that training time of our method can improve obviously.
引用
收藏
页码:1277 / 1280
页数:4
相关论文
共 50 条
  • [41] Reinforcement learning based adaptive metaheuristics
    Tessari, Michele
    Iacca, Giovanni
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 1854 - 1861
  • [42] Reinforcement Learning for Adaptive Mesh Refinement
    Yang, Jiachen
    Dzanic, Tarik
    Petersen, Brenden
    Kudo, Jun
    Mittal, Ketan
    Tomov, Vladimir
    Camier, Jean-Sylvain
    Zhao, Tuo
    Zha, Hongyuan
    Kolev, Tzanio
    Anderson, Robert
    Faissol, Daniel
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
  • [43] An Adaptive Authentication Based on Reinforcement Learning
    Cui, Ziqi
    Zhao, Yongxiang
    Li, Chunxi
    Zuo, Qi
    Zhang, Haipeng
    2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2019,
  • [44] Adaptive operator selection with reinforcement learning
    Durgut, Rafet
    Aydin, Mehmet Emin
    Atli, Ibrahim
    INFORMATION SCIENCES, 2021, 581 : 773 - 790
  • [45] Reinforcement learning of adaptive control strategies
    Leslie K. Held
    Luc Vermeylen
    David Dignath
    Wim Notebaert
    Ruth M. Krebs
    Senne Braem
    Communications Psychology, 2 (1):
  • [46] Adaptive Exploration for Continual Reinforcement Learning
    Stulp, Freek
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 1631 - 1636
  • [47] An Adaptive Implementation of ε-Greedy in Reinforcement Learning
    Mignon, Alexandre dos Santos
    de Azevedo da Rocha, Ricardo Luis
    8TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2017) AND THE 7TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT 2017), 2017, 109 : 1146 - 1151
  • [48] Reinforcement Learning for Adaptive Network Routing
    Desai, Rahul
    Patil, B. P.
    2014 INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2014, : 815 - 818
  • [49] Adaptive immunity based reinforcement learning
    Ito, Jungo
    Nakano, Kazushi
    Sakurama, Kazunori
    Hosokawa, Shu
    ARTIFICIAL LIFE AND ROBOTICS, 2008, 13 (01) : 188 - 193
  • [50] Adaptive Cognitive Training with Reinforcement Learning
    Zini, Floriano
    Le Piane, Fabio
    Gaspari, Mauro
    ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2022, 12 (01)