Reinforcement learning-based estimation for spatio-temporal systems

被引：1

作者：

Mowlavi, Saviz ^{[1
]}

Benosman, Mouhacine ^{[1
]}

机构：

[1] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA

来源：

SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期

关键词：

Estimation; Filtering; Partial differential equations; Model reduction; Reinforcement learning; MODEL-REDUCTION; FLUID-FLOWS;

D O I：

10.1038/s41598-024-72055-1

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

State estimators such as Kalman filters compute an estimate of the instantaneous state of a dynamical system from sparse sensor measurements. For spatio-temporal systems, whose dynamics are governed by partial differential equations (PDEs), state estimators are typically designed based on a reduced-order model (ROM) that projects the original high-dimensional PDE onto a computationally tractable low-dimensional space. However, ROMs are prone to large errors, which negatively affects the performance of the estimator. Here, we introduce the reinforcement learning reduced-order estimator (RL-ROE), a ROM-based estimator in which the correction term that takes in the measurements is given by a nonlinear policy trained through reinforcement learning. The nonlinearity of the policy enables the RL-ROE to compensate efficiently for errors of the ROM, while still taking advantage of the imperfect knowledge of the dynamics. Using examples involving the Burgers and Navier-Stokes equations with parametric uncertainties, we show that in the limit of very few sensors, the trained RL-ROE outperforms a Kalman filter designed using the same ROM and yields accurate instantaneous estimates of high-dimensional states corresponding to unknown initial conditions and physical parameter values. The RL-ROE opens the door to lightweight real-time sensing of systems governed by parametric PDEs.

引用

页数：13

共 50 条

[1] Estimating spatio-temporal fields through reinforcement learning
Padrao, Paulo
Fuentes, Jose
Bobadilla, Leonardo
Smith, Ryan N.
FRONTIERS IN ROBOTICS AND AI, 2022, 9
[2] Parallel Computing of Spatio-Temporal Model Based on Deep Reinforcement Learning
Lv, Zhiqiang
Li, Jianbo
Xu, Zhihao
Wang, Yue
Li, Haoran
WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT I, 2021, 12937 : 391 - 403
[3] Vision Paper: Reinforcement Learning in Smart Spatio-Temporal Environments
Schmoll, Sebastian
Schubert, Matthias
26TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2018), 2018, : 81 - 84
[4] Dynamic Bike Reposition: A Spatio-Temporal Reinforcement Learning Approach
Li, Yexin
Zheng, Yu
Yang, Qiang
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1724 - 1733
[5] Spatio-Temporal Capsule-Based Reinforcement Learning for Mobility-on-Demand Coordination
He, Suining
Shin, Kang G.
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (03) : 1446 - 1461
[6] STACoRe: Spatio-temporal and action-based contrastive representations for reinforcement learning in Atari
Lee, Young Jae
Kim, Jaehoon
Kwak, Mingu
Park, Young Joon
Kim, Seoung Bum
NEURAL NETWORKS, 2023, 160 : 1 - 11
[7] CST-RL: Contrastive Spatio-Temporal Representations for Reinforcement Learning
Ho, Chi-Kai
King, Chung-Ta
IEEE ACCESS, 2023, 11 : 26820 - 26831
[8] Spatio-Temporal Capsule-based Reinforcement Learning for Mobility-on-Demand Network Coordination
He, Suining
Shin, Kang G.
WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 2806 - 2813
[9] Optimizing the Spatio-Temporal Resource Search Problem with Reinforcement Learning (GIS Cup)
Borutta, Felix
Schmoll, Sebastian
Friedl, Sabrina
27TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2019), 2019, : 628 - 631
[10] Optimizing Taxi Carpool Policies via Reinforcement Learning and Spatio-Temporal Mining
Jindal, Ishan
Qin, Zhiwei
Chen, Xuewen
Nokleby, Matthew
Ye, Jieping
2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1417 - 1426

← 1 2 3 4 5 →