Identifying Ordinary Differential Equations for Data-efficient Model-based Reinforcement Learning

被引：0

作者：

Nagel, Tobias ^{[1
]}

Huber, Marco F. ^{[2
]}

机构：

[1] Fraunhofer Inst Mfg Engn & Automat IPA, D-70569 Stuttgart, Germany

[2] Univ Stuttgart, Inst Ind Mfg & Management IFF, D-70569 Stuttgart, Germany

来源：

2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024 | 2024年

关键词：

Kalman filtering; Neural nets; Ordinary Differential Equations; Nonlinear approximation; SPARSE IDENTIFICATION;

D O I：

10.1109/IJCNN60899.2024.10650369

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The identification of a mathematical dynamics model is a crucial step in the designing process of a controller. However, it is often very difficult to identify the system's governing equations, especially in complex environments that combine physical laws of different disciplines. In this paper, we present a new approach that allows identifying an ordinary differential equation by means of a physics-informed machine learning algorithm. Our method introduces a special neural network that allows exploiting prior human knowledge to a certain degree and extends it autonomously, so that the resulting differential equations describe the system as accurately as possible. We validate the method on a Duffing oscillator with simulation data and, additionally, on a cascaded tank example with real-world data. Subsequently, we use the developed algorithm in a model-based reinforcement learning framework by alternately identifying and controlling a system to a target state. We test the performance by swinging-up an inverted pendulum on a cart.

引用

页数：10

共 34 条

[1] Aguirre Luis Antonio, 2019, ARXIV
[2] Akman Devin, 2018, Journal of Applied Mathematics, V2018, DOI 10.1155/2018/9160793
[3] Discovering governing equations from data by sparse identification of nonlinear dynamical systems
Brunton, Steven L.
Proctor, Joshua L.
Kutz, J. Nathan
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (15) : 3932 - 3937
[4] Caicedo J. M., 2017, C P SOC EXPT MECH SE
[5] Reinforcement Learning-Based Adaptive Optimal Exponential Tracking Control of Linear Systems With Unknown Dynamics
Chen, Ci
Modares, Hamidreza
Xie, Kan
Lewis, Frank L.
Wan, Yan
Xie, Shengli
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (11) : 4423 - 4438
[6] Chen Ricky T.Q., 2018, ADV NEURAL INFORM PR, V31
[7] de Silva B., 2020, J OPEN SOURCE SOFTWA, V5, P2104, DOI [10.21105/joss.02104(2020, DOI 10.21105/JOSS.02104]
[8] Data-Efficient Reinforcement Learning for Complex Nonlinear Systems
Donge, Vrushabh S.
Lian, Bosen
Lewis, Frank L.
Davoudi, Ali
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (03) : 1391 - 1402
[9] Ensemble-SINDy: Robust sparse model discovery in the low-data, high-noise limit, with active learning and control
Fasel, U.
Kutz, J. N.
Brunton, B. W.
Brunton, S. L.
[J]. PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2022, 478 (2260):
[10] Sparse Identification of Nonlinear Duffing Oscillator From Measurement Data
Goharoodi, S. Khatiry
Dekemele, K.
Dupre, L.
Loccufier, M.
Crevecoeur, G.
[J]. IFAC PAPERSONLINE, 2018, 51 (33): : 162 - 167

← 1 2 3 4 →