Synchronous Reinforcement Learning-Based Control for Cognitive Autonomy

被引:29
作者
Vamvoudakis, Kyriakos G. [1 ]
Kokolakis, Nick-Marios T. [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
来源
FOUNDATIONS AND TRENDS IN SYSTEMS AND CONTROL | 2020年 / 8卷 / 1-2期
基金
美国国家科学基金会;
关键词
ADAPTIVE OPTIMAL-CONTROL; OPTIMAL TRACKING CONTROL; EVENT-TRIGGERED CONTROL; TIME LINEAR-SYSTEMS; H-INFINITY CONTROL; ZERO-SUM GAMES; NONLINEAR-SYSTEMS; STACKELBERG STRATEGY; MULTIAGENT SYSTEMS; CONSENSUS PROBLEMS;
D O I
10.1561/2600000022
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This monograph provides an exposition of recently developed reinforcement learning-based techniques for decision and control in human-engineered cognitive systems. The developed methods learn the solution to optimal control, zero-sum, non zero-sum, and graphical game problems completely online by using measured data along the system trajectories and have proved stability, optimality, and robustness. It is true that games have been shown to be important in robust control for disturbance rejection, and in coordinating activities among multiple agents in networked teams. We also consider cases with intermittent (an analogous to triggered control) instead of continuous learning and apply those techniques for optimal regulation and optimal tracking. We also introduce a bounded rational model to quantify the cognitive skills of a reinforcement learning agent. In order to do that, we leverage ideas from behavioral psychology to formulate differential games where the interacting learning agents have different intelligence skills, and we introduce an iterative method of optimal responses that determine the policy of an agent in adversarial environments. Finally, we present applications of reinforcement learning to motion planning and collaborative target tracking of bounded rational unmanned aerial vehicles.
引用
收藏
页码:1 / 175
页数:175
相关论文
共 50 条
[41]   Integral reinforcement learning-based optimal output feedback control for linear continuous-time systems with input delay [J].
Wang, Gao ;
Luo, Biao ;
Xue, Shan .
NEUROCOMPUTING, 2021, 460 :31-38
[42]   Reinforcement learning-based prescribed finite-time optimal tracking control for a vehicle system regardless of initial position [J].
Liu, Ying ;
Li, Xiaohua ;
Liu, Hui .
PROCEEDINGS OF THE 36TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC 2024, 2024, :3453-3458
[43]   Learning-Based Control: A Tutorial and Some Recent Results [J].
Jiang, Zhong-Ping ;
Bian, Tao ;
Gao, Weinan .
FOUNDATIONS AND TRENDS IN SYSTEMS AND CONTROL, 2020, 8 (03) :176-284
[44]   A reinforcement learning-based optimized backstepping control approach for uncertain electro-hydraulic systems [J].
Wang, Chen ;
Wang, Jianhui ;
Ye, Jinping ;
Guo, Qing ;
Li, Tieshan .
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2025, 237
[45]   Reinforcement Learning for Mixed Autonomy Intersections [J].
Yan, Zhongxia ;
Wu, Cathy .
2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, :2089-2094
[46]   Prescribed performance UAV tracking control under disturbance using reinforcement learning-based backstepping [J].
Yang, Meiying ;
Zhu, Hai ;
Zhu, Xiaozhou ;
Liu, Zhe ;
Yao, Wen ;
Chen, Xiaoqian .
CONTROL ENGINEERING PRACTICE, 2025, 164
[47]   Reinforcement Learning-Based Adaptive Finite-Time Performance Constraint Control for Nonlinear Systems [J].
Li, Yongming ;
Li, Kewen ;
Tong, Shaocheng .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (02) :1335-1344
[48]   H∞ optimal output tracking control for Markov jump systems: A reinforcement learning-based approach [J].
Shen, Ying ;
Yao, Cai-Kang ;
Chen, Bo ;
Che, Wei-Wei ;
Wu, Zheng-Guang .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (08) :5149-5167
[49]   Reinforcement Learning-Based Robust Tracking Control for Unknown Markov Jump Systems and its Application [J].
Shen, Hao ;
Wu, Jiacheng ;
Wang, Yun ;
Wang, Jing .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (03) :1211-1215
[50]   Reinforcement learning-based optimal trajectory tracking control of surface vessels under input saturations [J].
Wei, Ziping ;
Du, Jialu .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (06) :3807-3825