Input-Decoupled Q-Learning for Optimal Control

被引:0
|
作者
Minh Q. Phan
Seyed Mahdi B. Azad
机构
[1] Dartmouth College,Thayer School of Engineering
来源
The Journal of the Astronautical Sciences | 2020年 / 67卷
关键词
Optimal control; Reinforcement learning; Q-learning; Input-decoupled;
D O I
暂无
中图分类号
学科分类号
摘要
A design of optimal controllers based on a reinforcement learning method called Q-Learning is presented. Central to Q-Learning is the Q-function which is a function of the state and all input variables. This paper shows that decoupled-in-the-inputs Q-functions exist, and can be used to find the optimal controllers for each input individually. The method thus converts a multiple-variable optimization problem into much simpler single-variable optimization problems while achieving optimality. An explicit model of the system is not required to learn these decoupled Q-functions, but rather the method relies on the ability to probe the system and observe its state transition. Derived within the framework of modern control theory, the method is applicable to both linear and non-linear systems.
引用
收藏
页码:630 / 656
页数:26
相关论文
共 50 条
  • [21] Learning rates for Q-learning
    Even-Dar, E
    Mansour, Y
    JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 5 : 1 - 25
  • [22] Reinforcement Q-Learning and Non-Zero-Sum Games Optimal Tracking Control for Discrete-Time Linear Multi-Input Systems
    Zhao, Jin-Gang
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 277 - 282
  • [23] CVaR Q-Learning
    Stanko, Silvestr
    Macek, Karel
    COMPUTATIONAL INTELLIGENCE: 11th International Joint Conference, IJCCI 2019, Vienna, Austria, September 17-19, 2019, Revised Selected Papers, 2021, 922 : 333 - 358
  • [24] An Optimal Control Method for Expressways Entering Ramps Metering Based on Q-Learning
    Ji, Xiaofeng
    He, Zenghui
    ICICTA: 2009 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL I, PROCEEDINGS, 2009, : 739 - 741
  • [25] Q-learning optimal state estimation and control for discrete systems with unknown parameters
    Li J.-N.
    Ma S.-K.
    Kongzhi yu Juece/Control and Decision, 2021, 35 (12): : 2889 - 2897
  • [26] Q-learning Based Adaptive Optimal Control for Linear Quadratic Tracking Problem
    Shashi Kant Sharma
    Sumit Kumar Jha
    Amit Dhawan
    Manish Tiwari
    International Journal of Control, Automation and Systems, 2023, 21 : 2718 - 2725
  • [27] Q-learning Based Adaptive Optimal Control for Linear Quadratic Tracking Problem
    Sharma, Shashi Kant
    Jha, Sumit Kumar
    Dhawan, Amit
    Tiwari, Manish
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2023, 21 (08) : 2718 - 2725
  • [28] Periodic Q-Learning
    Lee, Donghwan
    He, Niao
    LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 582 - 598
  • [29] Optimal Tracking Current Control of Switched Reluctance Motor Drives Using Reinforcement Q-Learning Scheduling
    Alharkan, Hamad
    Saadatmand, Sepehr
    Ferdowsi, Mehdi
    Shamsi, Pourya
    IEEE ACCESS, 2021, 9 : 9926 - 9936
  • [30] Analytical Greedy Control and Q-Learning for Optimal Power Management of Plug-in Hybrid Electric Vehicles
    Liu, Chang
    Murphey, Yi Lu
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017,