On Model-free Reinforcement Learning for Switched Linear Systems: A Subspace Clustering Approach

被引:0
|
作者
Li, Hao [1 ]
Chen, Hua [1 ]
Zhang, Wei [1 ]
机构
[1] Ohio State Univ, Dept Elect & Comp Engn, Columbus, OH 43210 USA
来源
2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON) | 2018年
基金
美国国家科学基金会;
关键词
DISCRETE-TIME; TRACKING CONTROL; ALGORITHM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study optimal control of switched linear systems using reinforcement learning. Instead of directly applying existing model-free reinforcement learning algorithms, we propose a Q-learning-based algorithm designed specifically for discrete time switched linear systems. Inspired by the analytical results from optimal control literature, the Q function in our algorithm is approximated by a point-wise minimum form of a finite number of quadratic functions. An associated update scheme based on subspace clustering for such an approximation is also developed which preserves the desired structure during the training process. Numerical examples for both low-dimensional and high-dimensional switched linear systems are provided to demonstrate the performance of our algorithm.
引用
收藏
页码:123 / 130
页数:8
相关论文
共 50 条
  • [21] Q-learning based adaptive Kalman filtering for partial model-free dynamic systems
    Tang, Kun
    Luan, Xiaoli
    Ding, Feng
    Liu, Fei
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2024, 38 (03) : 954 - 967
  • [22] Automatic reinforcement for robust model-free neurocontrol of robots without persistent excitation
    Pantoja-Garcia, Luis
    Parra-Vega, Vicente
    Garcia-Rodriguez, Rodolfo
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2024, 38 (01) : 221 - 236
  • [23] Model-Free Solution to the Discrete-Time Coupled Riccati Equation Using Off-Policy Reinforcement Learning
    Li, Lu
    Wang, Liming
    Yang, Yongliang
    Dong, Jie
    Yin, Yixin
    Cheng, Shusen
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 6813 - 6818
  • [24] Model-based learning retrospectively updates model-free values
    Doody, Max
    Van Swieten, Maaike M. H.
    Manohar, Sanjay G.
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [25] On a model-free meta-heuristic approach for unconstrained optimization
    Xia, Wei
    He, Deming
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (15) : 22548 - 22562
  • [26] Delta Hedging in Financial Engineering: Towards a Model-Free Approach
    Fliess, Michel
    Join, Cedric
    18TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION, 2010, : 1429 - 1434
  • [27] H∞ Optimal Load Frequency Control of Power System: A Novel Model-Free Approach
    Hu, Shunwei
    Luo, Yanhong
    Xie, Xiangpeng
    Zhang, Huaguang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2025, 72 (01) : 228 - 232
  • [28] Model-Free Fuzzy Control of Twin Rotor Aerodynamic Systems
    Roman, Raul-Cristian
    Precup, Radu-Emil
    Radac, Mircea-Bogdan
    2017 25TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2017, : 559 - 564
  • [29] Decentralized Model-Free Prescribed Performance Control for Interconnected Systems
    Zhang, Jinxi
    Chai, Tianyou
    Chen, Yangquan
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 112 - 117
  • [30] Implementation of model-free motion control for active suspension systems
    Wang, Jue
    Jin, Fujiang
    Zhou, Lichun
    Li, Ping
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2019, 119 : 589 - 602