Model-based reinforcement learning control of reaction-diffusion problems

被引:0
|
作者
Schenk, Christina [1 ]
Vasudevan, Aditya [1 ]
Haranczyk, Maciej [1 ]
Romero, Ignacio [1 ,2 ]
机构
[1] IMDEA Mat Inst, Eric Kandel 2, Madrid 28906, Spain
[2] Univ Politecn Madrid, Dept Mech Engn, Madrid, Spain
关键词
disease and thermal transport; optimal control; partial differential equations; policy-gradient methods; reaction-diffusion; reinforcement learning; DYNAMICS;
D O I
10.1002/oca.3196
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mathematical and computational tools have proven to be reliable in decision-making processes. In recent times, in particular, machine learning-based methods are becoming increasingly popular as advanced support tools. When dealing with control problems, reinforcement learning has been applied to decision-making in several applications, most notably in games. The success of these methods in finding solutions to complex problems motivates the exploration of new areas where they can be employed to overcome current difficulties. In this article, we explore the use of automatic control strategies to initial boundary value problems in thermal and disease transport. Specifically, in this work, we adapt an existing reinforcement learning algorithm using a stochastic policy gradient method and we introduce two novel reward functions to drive the flow of the transported field. The new model-based framework exploits the interactions between a reaction-diffusion model and the modified agent. The results show that certain controls can be implemented successfully in these applications, although model simplifications had to be assumed. This paper explores reinforcement learning for control in thermal and disease transport problems, adapting a stochastic policy gradient algorithm and introducing novel reward functions. The new model-based framework leverages interactions between a reaction-diffusion model and the modified agent. Results demonstrate successful RL-based control for these applications despite necessary model simplifications. image
引用
收藏
页码:2897 / 2914
页数:18
相关论文
共 50 条
  • [1] Hybrid control for combining model-based and model-free reinforcement learning
    Pinosky, Allison
    Abraham, Ian
    Broad, Alexander
    Argall, Brenna
    Murphey, Todd D.
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2023, 42 (06) : 337 - 355
  • [2] Optimal control of dengue vector based on a reaction-diffusion model?
    Li, Yazhi
    Wang, Yan
    Liu, Lili
    MATHEMATICS AND COMPUTERS IN SIMULATION, 2023, 203 : 250 - 270
  • [3] Function approximation for reinforcement learning based on reaction-diffusion equation on a graph
    Kobayashi, Y
    Yuasa, H
    Arai, T
    SICE 2002: PROCEEDINGS OF THE 41ST SICE ANNUAL CONFERENCE, VOLS 1-5, 2002, : 813 - 818
  • [4] A survey on model-based reinforcement learning
    Luo, Fan-Ming
    Xu, Tian
    Lai, Hang
    Chen, Xiong-Hui
    Zhang, Weinan
    Yu, Yang
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (02)
  • [5] Offline Model-Based Reinforcement Learning for Tokamak Control
    Char, Ian
    Abbate, Joseph
    Bardoczi, Laszlo
    Boyer, Mark D.
    Chung, Youngseog
    Conlin, Rory
    Erickson, Keith
    Mehta, Viraj
    Richner, Nathan
    Kolemen, Egemen
    Schneider, Jeff
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [6] Model-Based Reinforcement Learning for Eco-Driving Control of Electric Vehicles
    Lee, Heeyun
    Kim, Namwook
    Cha, Suk Won
    IEEE ACCESS, 2020, 8 : 202886 - 202896
  • [7] Multi-Zone HVAC Control With Model-Based Deep Reinforcement Learning
    Ding, Xianzhong
    Cerpa, Alberto
    Du, Wan
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 4408 - 4426
  • [8] Model-based hierarchical reinforcement learning and human action control
    Botvinick, Matthew
    Weinstein, Ari
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2014, 369 (1655)
  • [9] THE OPTIMAL CONTROL OF AN HIV/AIDS REACTION-DIFFUSION EPIDEMIC MODEL
    Chorfi, Nouar
    Bendoukha, Samir
    Abdelmalek, Salem
    DISCRETE AND CONTINUOUS DYNAMICAL SYSTEMS-SERIES S, 2024,
  • [10] Advances in model-based reinforcement learning for Adaptive Optics control
    Nousiainen, Jalo
    Engler, Byron
    Kasper, Markus
    Helin, Tapio
    Heritier, Cedric T.
    Rajani, Chang
    ADAPTIVE OPTICS SYSTEMS VIII, 2022, 12185