Unconstrained feedback controller design using Q-learning from noisy data

被引:0
|
作者
Kumar, Pratyush [1 ]
Rawlings, James B. [1 ]
机构
[1] Univ Calif Santa Barbara, Dept Chem Engn, Santa Barbara, CA 93106 USA
关键词
Reinforcement learning; Q-learning; Least squares policy iteration; System identification; Maximum likelihood estimation; Linear quadratic regulator; MODEL-PREDICTIVE CONTROL; REINFORCEMENT; STABILITY; MPC;
D O I
10.1016/j.compchemeng.2023.108325
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper develops a novel model-free Q-learning based approach to estimate linear, unconstrained feedback controllers from noisy process data. The proposed method is based on an extension of an available approach developed to estimate the linear quadratic regulator (LQR) for linear systems with full state measurements driven by Gaussian process noise of known covariance. First, we modify the approach to treat the case of an unknown noise covariance. Then, we use the modified approach to estimate a feedback controller for linear systems with both process and measurement noise and only output measurements. We also present a model-based maximum likelihood estimation (MLE) approach to determine a linear dynamic model and noise covariances from data, which is used to construct a regulator and state estimator for comparisons in simulation studies. The performances of the model-free and model-based controller estimation approaches are compared with an example heating, ventilation, and air-conditioning (HVAC) system. We show that the proposed Q-learning approach estimates a reasonably accurate feedback controller from 24 h of noisy data. The controllers estimated using both the model-free and model-based approaches provide similar closed-loop performances with 3.5 and 2.7% losses respectively, compared to a perfect controller that uses the true dynamic model and noise covariances of the HVAC system. Finally, we give future work directions for the model-free controller design approaches by discussing some remaining advantages of the model-based approaches.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] BEAM MANAGEMENT SOLUTION USING Q-LEARNING FRAMEWORK
    Araujo, Daniel C.
    de Almeida, Andre L. F.
    2019 IEEE 8TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2019), 2019, : 594 - 598
  • [22] A Q-learning Approach for SoftECU Design in Hybrid Electric Vehicles
    Natella, Domenico
    Vasca, Francesco
    2020 24TH INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2020, : 763 - 768
  • [23] Feature Extraction in Q-Learning using Neural Networks
    Zhu, Henghui
    Paschalidis, Ioannis Ch.
    Hasselmo, Michael E.
    2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [24] An Autonomous Path Finding Robot Using Q-Learning
    Babu, Madhu
    Krishna, Vamshi U.
    Shahensha, S. K.
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO'16), 2016,
  • [25] Primal-Dual Q-Learning Framework for LQR Design
    Lee, Donghwan
    Hu, Jianghai
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (09) : 3756 - 3763
  • [26] Molecular design based on Q-learning and maximum likelihood estimation
    Liu, Ying
    Zhang, Bingfeng
    Zhao, Jun
    Wang, Wei
    Lv, Zheng
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 2119 - 2124
  • [27] Solving Twisty Puzzles Using Parallel Q-learning
    Hukmani, Kavish
    Kolekar, Sucheta
    Vobugari, Sreekumar
    ENGINEERING LETTERS, 2021, 29 (04) : 1535 - 1543
  • [28] Autonomous Driving in Roundabout Maneuvers Using Reinforcement Learning with Q-Learning
    Garcia Cuenca, Laura
    Puertas, Enrique
    Fernandez Andres, Javier
    Aliane, Nourdine
    ELECTRONICS, 2019, 8 (12)
  • [29] (Data-Driven) Development of dynamic scheduling in semiconductor manufacturing using a Q-learning approach
    Shiue, Yeou-Ren
    Lee, Ken-Chuan
    Su, Chao-Ton
    INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2022, 35 (10-11) : 1188 - 1204
  • [30] Development of a Bias Compensating Q-Learning Controller for a Multi-Zone HVAC Facility
    Asad Rizvi, Syed Ali
    Pertzborn, Amanda J.
    Lin, Zongli
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 10 (08) : 1704 - 1715