Reduced-dimensional reinforcement learning control using singular perturbation approximations

被引:34
|
作者
Mukherjee, Sayak [1 ]
Bai, He [2 ]
Chakrabortty, Aranya [1 ]
机构
[1] North Carolina State Univ, Dept Elect & Comp Engn, Raleigh, NC 27695 USA
[2] Oklahoma State Univ, Mech & Aerosp Engn Dept, Stillwater, OK 74078 USA
基金
美国国家科学基金会;
关键词
Reinforcement learning; Linear quadratic regulator; Singular perturbation; Model-free control; Model reduction; TIME LINEAR-SYSTEMS; ALGORITHM;
D O I
10.1016/j.automatica.2020.109451
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a set of model-free, reduced-dimensional reinforcement learning (RL) based optimal control designs for linear time-invariant singularly perturbed (SP) systems. We first present a state feedback and an output feedback based RL control design for a generic SP system with unknown state and input matrices. We take advantage of the underlying time-scale separation property of the plant to learn a linear quadratic regulator (LQR) for only its slow dynamics, thereby saving significant amount of learning time compared to the conventional full-dimensional RL controller. We analyze the sub-optimality of the designs using SP approximation theorems, and provide sufficient conditions for closed-loop stability. Thereafter, we extend both designs to clustered multi-agent consensus networks, where the SP property reflects through clustering. We develop both centralized and cluster-wise block-decentralized RL controllers for such networks, in reduced dimensions. We demonstrate the details of the implementation of these controllers using simulations of relevant numerical examples, and compare them with conventional RL designs to show the computational benefits of our approach. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Multi-Agent Reinforcement Learning Approach Based on Reduced Value Function Approximations
    Abouheaf, Mohammed
    Gueaieb, Wail
    2017 IEEE 5TH INTERNATIONAL SYMPOSIUM ON ROBOTICS AND INTELLIGENT SENSORS (IRIS), 2017, : 111 - 116
  • [22] Two-dimensional DOA estimation using Reduced-Dimensional MUSIC algorithm with strong-constraint optimization
    Cai, J.-J. (jjcai@mail.xidian.edu.cn), 1600, Science Press (36):
  • [23] Control of active suspension systems using the singular perturbation method
    Ando, Y
    Suzuki, M
    CONTROL ENGINEERING PRACTICE, 1996, 4 (03) : 287 - 293
  • [24] Reduced-Dimensional Whole-Body Control Based on Model Simplification for Bipedal Robots With Parallel Mechanisms
    Liang, Yunpeng
    Yin, Fulong
    Li, Zhen
    Xiong, Zhilin
    Peng, Zhihui
    Zhao, Yanzheng
    Yan, Weixin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (02): : 1696 - 1703
  • [25] Reduced-Dimensional Quantum Approach to Tunneling Splittings Using Saddle-Point Normal Coordinates
    Kamarchik, Eugene
    Wang, Yimin
    Bowman, Joel
    JOURNAL OF PHYSICAL CHEMISTRY A, 2009, 113 (26): : 7556 - 7562
  • [26] Design of active suspension control using singular perturbation theory
    Beheshti, MTH
    Nematollahzadeh, SM
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY, 2003, 27 (B4): : 691 - 700
  • [27] AUTOMATIC-GENERATION CONTROL USING SINGULAR PERTURBATION APPROACH
    SURYANARAYANA, NV
    HANMANDLU, M
    ELECTRIC MACHINES AND POWER SYSTEMS, 1988, 14 (01): : 45 - 59
  • [28] Reduced chemical mechanisms for atmospheric pollution using Computational Singular Perturbation analysis
    Neophytou, MK
    Goussis, DA
    van Loon, M
    Mastorakos, E
    ATMOSPHERIC ENVIRONMENT, 2004, 38 (22) : 3661 - 3673
  • [29] Reduced model and simulation of myelinated axon using eigenfunction expansion and singular perturbation
    Woo, Bomje
    Choi, Jinhoon
    COMPUTERS IN BIOLOGY AND MEDICINE, 2007, 37 (08) : 1148 - 1154
  • [30] Reduced order modeling of transcritical AC system dynamics using singular perturbation
    Rasmussen, B
    Alleyne, A
    Shah, R
    PROCEEDINGS OF THE 2003 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2003, : 2264 - 2269