Identifying Ordinary Differential Equations for Data-efficient Model-based Reinforcement Learning

被引:0
作者
Nagel, Tobias [1 ]
Huber, Marco F. [2 ]
机构
[1] Fraunhofer Inst Mfg Engn & Automat IPA, D-70569 Stuttgart, Germany
[2] Univ Stuttgart, Inst Ind Mfg & Management IFF, D-70569 Stuttgart, Germany
来源
2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024 | 2024年
关键词
Kalman filtering; Neural nets; Ordinary Differential Equations; Nonlinear approximation; SPARSE IDENTIFICATION;
D O I
10.1109/IJCNN60899.2024.10650369
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The identification of a mathematical dynamics model is a crucial step in the designing process of a controller. However, it is often very difficult to identify the system's governing equations, especially in complex environments that combine physical laws of different disciplines. In this paper, we present a new approach that allows identifying an ordinary differential equation by means of a physics-informed machine learning algorithm. Our method introduces a special neural network that allows exploiting prior human knowledge to a certain degree and extends it autonomously, so that the resulting differential equations describe the system as accurately as possible. We validate the method on a Duffing oscillator with simulation data and, additionally, on a cascaded tank example with real-world data. Subsequently, we use the developed algorithm in a model-based reinforcement learning framework by alternately identifying and controlling a system to a target state. We test the performance by swinging-up an inverted pendulum on a cart.
引用
收藏
页数:10
相关论文
共 34 条
  • [1] Aguirre Luis Antonio, 2019, ARXIV
  • [2] Akman Devin, 2018, Journal of Applied Mathematics, V2018, DOI 10.1155/2018/9160793
  • [3] Discovering governing equations from data by sparse identification of nonlinear dynamical systems
    Brunton, Steven L.
    Proctor, Joshua L.
    Kutz, J. Nathan
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (15) : 3932 - 3937
  • [4] Caicedo J. M., 2017, C P SOC EXPT MECH SE
  • [5] Reinforcement Learning-Based Adaptive Optimal Exponential Tracking Control of Linear Systems With Unknown Dynamics
    Chen, Ci
    Modares, Hamidreza
    Xie, Kan
    Lewis, Frank L.
    Wan, Yan
    Xie, Shengli
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (11) : 4423 - 4438
  • [6] Chen Ricky T.Q., 2018, ADV NEURAL INFORM PR, V31
  • [7] de Silva B., 2020, J OPEN SOURCE SOFTWA, V5, P2104, DOI [10.21105/joss.02104(2020, DOI 10.21105/JOSS.02104]
  • [8] Data-Efficient Reinforcement Learning for Complex Nonlinear Systems
    Donge, Vrushabh S.
    Lian, Bosen
    Lewis, Frank L.
    Davoudi, Ali
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (03) : 1391 - 1402
  • [9] Ensemble-SINDy: Robust sparse model discovery in the low-data, high-noise limit, with active learning and control
    Fasel, U.
    Kutz, J. N.
    Brunton, B. W.
    Brunton, S. L.
    [J]. PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2022, 478 (2260):
  • [10] Sparse Identification of Nonlinear Duffing Oscillator From Measurement Data
    Goharoodi, S. Khatiry
    Dekemele, K.
    Dupre, L.
    Loccufier, M.
    Crevecoeur, G.
    [J]. IFAC PAPERSONLINE, 2018, 51 (33): : 162 - 167