Input-Decoupled Q-Learning for Optimal Control

被引：0

作者：

Minh Q. Phan

Seyed Mahdi B. Azad

机构：

[1] Dartmouth College,Thayer School of Engineering

来源：

The Journal of the Astronautical Sciences | 2020年 / 67卷

关键词：

Optimal control; Reinforcement learning; Q-learning; Input-decoupled;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

A design of optimal controllers based on a reinforcement learning method called Q-Learning is presented. Central to Q-Learning is the Q-function which is a function of the state and all input variables. This paper shows that decoupled-in-the-inputs Q-functions exist, and can be used to find the optimal controllers for each input individually. The method thus converts a multiple-variable optimization problem into much simpler single-variable optimization problems while achieving optimality. An explicit model of the system is not required to learn these decoupled Q-functions, but rather the method relies on the ability to probe the system and observe its state transition. Derived within the framework of modern control theory, the method is applicable to both linear and non-linear systems.

引用

页码：630 / 656

页数：26

共 50 条

[21] Learning rates for Q-learning
Even-Dar, E
Mansour, Y
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 5 : 1 - 25
[22] Reinforcement Q-Learning and Non-Zero-Sum Games Optimal Tracking Control for Discrete-Time Linear Multi-Input Systems
Zhao, Jin-Gang
2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 277 - 282
[23] CVaR Q-Learning
Stanko, Silvestr
Macek, Karel
COMPUTATIONAL INTELLIGENCE: 11th International Joint Conference, IJCCI 2019, Vienna, Austria, September 17-19, 2019, Revised Selected Papers, 2021, 922 : 333 - 358
[24] An Optimal Control Method for Expressways Entering Ramps Metering Based on Q-Learning
Ji, Xiaofeng
He, Zenghui
ICICTA: 2009 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL I, PROCEEDINGS, 2009, : 739 - 741
[25] Q-learning optimal state estimation and control for discrete systems with unknown parameters
Li J.-N.
Ma S.-K.
Kongzhi yu Juece/Control and Decision, 2021, 35 (12): : 2889 - 2897
[26] Q-learning Based Adaptive Optimal Control for Linear Quadratic Tracking Problem
Shashi Kant Sharma
Sumit Kumar Jha
Amit Dhawan
Manish Tiwari
International Journal of Control, Automation and Systems, 2023, 21 : 2718 - 2725
[27] Q-learning Based Adaptive Optimal Control for Linear Quadratic Tracking Problem
Sharma, Shashi Kant
Jha, Sumit Kumar
Dhawan, Amit
Tiwari, Manish
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2023, 21 (08) : 2718 - 2725
[28] Periodic Q-Learning
Lee, Donghwan
He, Niao
LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 582 - 598
[29] Optimal Tracking Current Control of Switched Reluctance Motor Drives Using Reinforcement Q-Learning Scheduling
Alharkan, Hamad
Saadatmand, Sepehr
Ferdowsi, Mehdi
Shamsi, Pourya
IEEE ACCESS, 2021, 9 : 9926 - 9936
[30] Analytical Greedy Control and Q-Learning for Optimal Power Management of Plug-in Hybrid Electric Vehicles
Liu, Chang
Murphey, Yi Lu
2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017,

← 1 2 3 4 5 →