A Hybrid Learning Method for System Identification and Optimal Control

被引:8
|
作者
Schubnel, Baptiste [1 ]
Carrillo, Rafael E. [1 ]
Alet, Pierre-Jean [1 ]
Hutter, Andreas [1 ]
机构
[1] CSEM, CH-2002 Neuchatel, Switzerland
基金
欧盟地平线“2020”;
关键词
Mathematical model; Buildings; Optimization; Optimal control; Neural networks; Data models; Building management systems; deep reinforcement learning (RL); optimal control; system identification; MODEL-PREDICTIVE CONTROL;
D O I
10.1109/TNNLS.2020.3016906
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a three-step method to perform system identification and optimal control of nonlinear systems. Our approach is mainly data-driven and does not require active excitation of the system to perform system identification. In particular, it is designed for systems for which only historical data under closed-loop control are available and where historical control commands exhibit low variability. In the first step, simple simulation models of the system are built and run under various conditions. In the second step, a neural network architecture is extensively trained on the simulation outputs to learn the system physics and retrained with historical data from the real system with stopping rules. These constraints avoid overfitting that arises by fitting closed-loop controlled systems. By doing so, we obtain one (or many) system model(s), represented by this architecture, whose behavior can be chosen to match more or less the real system. Finally, state-of-the-art reinforcement learning with a variant of domain randomization and distributed learning is used for optimal control of the system. We first illustrate the model identification strategy with a simple example, the pendulum with external torque. We then apply our method to model and optimize the control of a large building facility located in Switzerland. Simulation results demonstrate that this approach generates stable functional controllers that outperform on comfort and energy benchmark rule-based controllers.
引用
收藏
页码:4096 / 4110
页数:15
相关论文
共 50 条
  • [1] Identification and control using a hybrid reinforcement learning system
    Mills, Peter M.
    Tade, Moses O.
    Zomaya, Albert Y.
    International Journal in Computer Simulation, 5 (02):
  • [2] An optimal control of a hybrid system
    Gabasov, R.
    Kirillova, F. M.
    Pavlenok, N. S.
    DOKLADY MATHEMATICS, 2007, 76 (03) : 976 - 982
  • [3] An optimal control of a hybrid system
    R. Gabasov
    F. M. Kirillova
    N. S. Pavlenok
    Doklady Mathematics, 2007, 76 : 976 - 982
  • [4] Deep Learning Optimal Control for a Complex Hybrid Energy Storage System
    Zsembinszki, Gabriel
    Fernandez, Cesar
    Verez, David
    Cabeza, Luisa F.
    BUILDINGS, 2021, 11 (05)
  • [5] Hybrid System Identification via Switched System Optimal Control for Bipedal Robotic Walking
    Vasudevan, Ram
    ROBOTICS RESEARCH, ISRR, 2017, 100
  • [6] Mathematic model and optimal control method based on hybrid intelligent system
    Liu Zaiwen
    Wang Xiaoyi
    Hou Chaozhen
    Cui Lifeng
    Xue Hong
    Wu Yelan
    2006 IEEE INTERNATIONAL CONFERENCE ON INFORMATION ACQUISITION, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2006, : 435 - 440
  • [7] Hybrid Reinforcement Learning for Optimal Control of Non-Linear Switching System
    Li, Xiaofeng
    Dong, Lu
    Xue, Lei
    Sun, Changyin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 9161 - 9170
  • [8] H∞ frequency domain method of optimal asynchronous learning control system
    Deng, Zhidong
    Sun, Zengqi
    Zidonghua Xuebao/Acta Automatica Sinica, 1995, 21 (02): : 178 - 183
  • [9] Model Predictive and Iterative Learning Control Based Hybrid Control Method for Hybrid Energy Storage System
    Zhang, Xibeng
    Wang, Benfei
    Gamage, Don
    Ukil, Abhisek
    IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2021, 12 (04) : 2146 - 2158
  • [10] Statistical learning for optimal control of hybrid systems
    Piovesan, Jorge
    Abdallah, Chaouki
    Egerstedt, Magnus
    Tanner, Herbert
    Wardi, Yorai
    2007 AMERICAN CONTROL CONFERENCE, VOLS 1-13, 2007, : 2053 - +