Reinforcement learning-based estimation for spatio-temporal systems

被引:1
|
作者
Mowlavi, Saviz [1 ]
Benosman, Mouhacine [1 ]
机构
[1] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Estimation; Filtering; Partial differential equations; Model reduction; Reinforcement learning; MODEL-REDUCTION; FLUID-FLOWS;
D O I
10.1038/s41598-024-72055-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
State estimators such as Kalman filters compute an estimate of the instantaneous state of a dynamical system from sparse sensor measurements. For spatio-temporal systems, whose dynamics are governed by partial differential equations (PDEs), state estimators are typically designed based on a reduced-order model (ROM) that projects the original high-dimensional PDE onto a computationally tractable low-dimensional space. However, ROMs are prone to large errors, which negatively affects the performance of the estimator. Here, we introduce the reinforcement learning reduced-order estimator (RL-ROE), a ROM-based estimator in which the correction term that takes in the measurements is given by a nonlinear policy trained through reinforcement learning. The nonlinearity of the policy enables the RL-ROE to compensate efficiently for errors of the ROM, while still taking advantage of the imperfect knowledge of the dynamics. Using examples involving the Burgers and Navier-Stokes equations with parametric uncertainties, we show that in the limit of very few sensors, the trained RL-ROE outperforms a Kalman filter designed using the same ROM and yields accurate instantaneous estimates of high-dimensional states corresponding to unknown initial conditions and physical parameter values. The RL-ROE opens the door to lightweight real-time sensing of systems governed by parametric PDEs.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Estimating spatio-temporal fields through reinforcement learning
    Padrao, Paulo
    Fuentes, Jose
    Bobadilla, Leonardo
    Smith, Ryan N.
    FRONTIERS IN ROBOTICS AND AI, 2022, 9
  • [2] Parallel Computing of Spatio-Temporal Model Based on Deep Reinforcement Learning
    Lv, Zhiqiang
    Li, Jianbo
    Xu, Zhihao
    Wang, Yue
    Li, Haoran
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT I, 2021, 12937 : 391 - 403
  • [3] Vision Paper: Reinforcement Learning in Smart Spatio-Temporal Environments
    Schmoll, Sebastian
    Schubert, Matthias
    26TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2018), 2018, : 81 - 84
  • [4] Dynamic Bike Reposition: A Spatio-Temporal Reinforcement Learning Approach
    Li, Yexin
    Zheng, Yu
    Yang, Qiang
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1724 - 1733
  • [5] Spatio-Temporal Capsule-Based Reinforcement Learning for Mobility-on-Demand Coordination
    He, Suining
    Shin, Kang G.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (03) : 1446 - 1461
  • [6] STACoRe: Spatio-temporal and action-based contrastive representations for reinforcement learning in Atari
    Lee, Young Jae
    Kim, Jaehoon
    Kwak, Mingu
    Park, Young Joon
    Kim, Seoung Bum
    NEURAL NETWORKS, 2023, 160 : 1 - 11
  • [7] CST-RL: Contrastive Spatio-Temporal Representations for Reinforcement Learning
    Ho, Chi-Kai
    King, Chung-Ta
    IEEE ACCESS, 2023, 11 : 26820 - 26831
  • [8] Spatio-Temporal Capsule-based Reinforcement Learning for Mobility-on-Demand Network Coordination
    He, Suining
    Shin, Kang G.
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 2806 - 2813
  • [9] Optimizing the Spatio-Temporal Resource Search Problem with Reinforcement Learning (GIS Cup)
    Borutta, Felix
    Schmoll, Sebastian
    Friedl, Sabrina
    27TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2019), 2019, : 628 - 631
  • [10] Optimizing Taxi Carpool Policies via Reinforcement Learning and Spatio-Temporal Mining
    Jindal, Ishan
    Qin, Zhiwei
    Chen, Xuewen
    Nokleby, Matthew
    Ye, Jieping
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1417 - 1426