Input-Decoupled Q-Learning for Optimal Control

被引：0

作者：

Minh Q. Phan

Seyed Mahdi B. Azad

机构：

[1] Dartmouth College,Thayer School of Engineering

来源：

The Journal of the Astronautical Sciences | 2020年 / 67卷

关键词：

Optimal control; Reinforcement learning; Q-learning; Input-decoupled;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

A design of optimal controllers based on a reinforcement learning method called Q-Learning is presented. Central to Q-Learning is the Q-function which is a function of the state and all input variables. This paper shows that decoupled-in-the-inputs Q-functions exist, and can be used to find the optimal controllers for each input individually. The method thus converts a multiple-variable optimization problem into much simpler single-variable optimization problems while achieving optimality. An explicit model of the system is not required to learn these decoupled Q-functions, but rather the method relies on the ability to probe the system and observe its state transition. Derived within the framework of modern control theory, the method is applicable to both linear and non-linear systems.

引用

页码：630 / 656

页数：26

共 50 条

[41] Q-LEARNING BASED PREDICTIVE RELAY SELECTION FOR OPTIMAL RELAY BEAMFORMING
Dimas, Anastasios
Diamantaras, Konstantinos
Petropulu, Athina P.
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 5030 - 5034
[42] A New Method for Fault Tolerant Control through Q-Learning
Hua, Changsheng
Ding, Steven X.
Shardt, Yuri A. W.
IFAC PAPERSONLINE, 2018, 51 (24): : 38 - 45
[43] Q-learning Approach for Optimal Power Dispatch of Microgrid
Samadi, Esmat
Badri, Ali
Ebrahimpour, Reza
2020 28TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2020, : 935 - 939
[44] Fundamental Q-learning Algorithm in Finding Optimal Policy
Sun, Canyu
2017 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA), 2017, : 243 - 246
[45] Optimal Tracking Control of Nonlinear Multiagent Systems Using Internal Reinforce Q-Learning
Peng, Zhinan
Luo, Rui
Hu, Jiangping
Shi, Kaibo
Nguang, Sing Kiong
Ghosh, Bijoy Kumar
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 4043 - 4055
[46] Adjustable Iterative Q-Learning Schemes for Model-Free Optimal Tracking Control
Qiao, Junfei
Zhao, Mingming
Wang, Ding
Ha, Mingming
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (02): : 1202 - 1213
[47] Off-Policy Interleaved Q-Learning: Optimal Control for Affine Nonlinear Discrete-Time Systems
Li, Jinna
Chai, Tianyou
Lewis, Frank L.
Ding, Zhengtao
Jiang, Yi
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (05) : 1308 - 1320
[48] Fuzzy Q-learning Control for Temperature Systems
Chen, Yeong-Chin
Hung, Lon-Chen
Syamsudin, Mariana
22ND IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2021-FALL), 2021, : 148 - 151
[49] Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning
Tan, Fuxiao
Yan, Pengfei
Guan, Xinping
NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 475 - 483
[50] Q-LEARNING WITH CENSORED DATA
Goldberg, Yair
Kosorok, Michael R.
ANNALS OF STATISTICS, 2012, 40 (01) : 529 - 560

← 1 2 3 4 5 →