Synchronous Reinforcement Learning-Based Control for Cognitive Autonomy

被引:29
作者
Vamvoudakis, Kyriakos G. [1 ]
Kokolakis, Nick-Marios T. [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
来源
FOUNDATIONS AND TRENDS IN SYSTEMS AND CONTROL | 2020年 / 8卷 / 1-2期
基金
美国国家科学基金会;
关键词
ADAPTIVE OPTIMAL-CONTROL; OPTIMAL TRACKING CONTROL; EVENT-TRIGGERED CONTROL; TIME LINEAR-SYSTEMS; H-INFINITY CONTROL; ZERO-SUM GAMES; NONLINEAR-SYSTEMS; STACKELBERG STRATEGY; MULTIAGENT SYSTEMS; CONSENSUS PROBLEMS;
D O I
10.1561/2600000022
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This monograph provides an exposition of recently developed reinforcement learning-based techniques for decision and control in human-engineered cognitive systems. The developed methods learn the solution to optimal control, zero-sum, non zero-sum, and graphical game problems completely online by using measured data along the system trajectories and have proved stability, optimality, and robustness. It is true that games have been shown to be important in robust control for disturbance rejection, and in coordinating activities among multiple agents in networked teams. We also consider cases with intermittent (an analogous to triggered control) instead of continuous learning and apply those techniques for optimal regulation and optimal tracking. We also introduce a bounded rational model to quantify the cognitive skills of a reinforcement learning agent. In order to do that, we leverage ideas from behavioral psychology to formulate differential games where the interacting learning agents have different intelligence skills, and we introduce an iterative method of optimal responses that determine the policy of an agent in adversarial environments. Finally, we present applications of reinforcement learning to motion planning and collaborative target tracking of bounded rational unmanned aerial vehicles.
引用
收藏
页码:1 / 175
页数:175
相关论文
共 50 条
[21]   Reinforcement Learning-Based Optimal Stabilization for Unknown Nonlinear Systems Subject to Inputs With Uncertain Constraints [J].
Zhao, Bo ;
Liu, Derong ;
Luo, Chaomin .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (10) :4330-4340
[22]   Reinforcement learning-based saturated adaptive robust neural-network control of underactuated autonomous underwater vehicles [J].
Elhaki, Omid ;
Shojaei, Khoshnam ;
Mehrmohammadi, Parisa .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 197
[23]   Reinforcement Learning-Based Nearly Optimal Control for Constrained-Input Partially Unknown Systems Using Differentiator [J].
Guo, Xinxin ;
Yan, Weisheng ;
Cui, Rongxin .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (11) :4713-4725
[24]   Learning-based T-sHDP(λ) for optimal control of a class of nonlinear discrete-time systems [J].
Yu, Luyang ;
Liu, Weibo ;
Liu, Yurong ;
Alsaadi, Fawaz E. .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (05) :2624-2643
[25]   Reinforcement Learning-Based Fixed-Time Trajectory Tracking Control for Uncertain Robotic Manipulators With Input Saturation [J].
Cao, Shengjie ;
Sun, Liang ;
Jiang, Jingjing ;
Zuo, Zongyu .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) :4584-4595
[26]   Policy Iteration Reinforcement Learning-based control using a Grey Wolf Optimizer algorithm [J].
Zamfirache, Iuliu Alexandru ;
Precup, Radu-Emil ;
Roman, Raul-Cristian ;
Petriu, Emil M. .
INFORMATION SCIENCES, 2022, 585 :162-175
[27]   Quantifying the impact of non-stationarity in reinforcement learning-based traffic signal control [J].
Alegre, Lucas N. ;
Bazzan, Ana L. C. ;
da Silva, Bruno C. .
PEERJ COMPUTER SCIENCE, 2021,
[28]   Safe Reinforcement Learning-Based Robust Approximate Optimal Control for Hypersonic Flight Vehicles [J].
Shi, Lei ;
Wang, Xuesong ;
Cheng, Yuhu .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (09) :11401-11414
[29]   A Reinforcement Learning-Based Control Approach for Unknown Nonlinear Systems with Persistent Adversarial Inputs [J].
Zhong, Xiangnan ;
He, Haibo .
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[30]   Reinforcement Learning-Based Adaptive Optimal Control for Partially Unknown Systems Using Differentiator [J].
Guo, Xinxin ;
Yan, Weisheng ;
Cui, Rongxin .
2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, :1039-1044