Recovering Robustness in Model-Free Reinforcement Learning

被引：10

作者：

Venkataraman, Harish K.

Seiler, Peter J.

机构：

来源：

2019 AMERICAN CONTROL CONFERENCE (ACC) | 2019年

关键词：

DESIGN;

D O I：

10.23919/acc.2019.8815368

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement learning (RL) is used to directly design a control policy using data collected from the system. This paper considers the robustness of controllers trained via model-free RL. The discussion focuses on posing the (model-free) linear quadratic Gaussian (LQG) problem as a special instance of RL. A simple LQG example is used to demonstrate that RL with partial observations can lead to poor robustness margins. It is proposed to recover robustness by introducing random perturbations at the system input during the RL training. The perturbation magnitude can be used to trade off performance for increased robustness. Two simple examples are presented to demonstrate the proposed method for enhancing robustness during RL training.

引用

收藏

页码：4210 / 4216

页数：7

相关论文

共 50 条

[21] Model-free compensation learning control of asymmetric hysteretic systems with initial state learning [J].

Zhang, Yangming ;

Luo, Biao ;

Zhang, Yanqiong ;

Sun, Shanxun .

JOURNAL OF SOUND AND VIBRATION, 2024, 584

[22] High-order model-free adaptive iterative learning control [J].

Xu, Jian ;

Lin, Na ;

Chi, Ronghu ;

Li, Xueqiang .

TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2023, 45 (10) :1886-1895

[23] Model-Free Learning Control of Nonlinear Discrete-Time Systems [J].

Sadegh, Nader .

2011 AMERICAN CONTROL CONFERENCE, 2011, :3553-3558

[24] Two-dimensional reinforcement learning model-free fault-tolerant control for batch processes against multi- faults [J].

Wang, Limin ;

Jia, Linzhu ;

Zou, Tao ;

Zhang, Ridong ;

Gao, Furong .

COMPUTERS & CHEMICAL ENGINEERING, 2025, 192

[25] A novel model-free robust saturated reinforcement learning-based controller for quadrotors guaranteeing prescribed transient and steady state performance [J].

Elhaki, Omid ;

Shojaei, Khoshnam .

AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 119

[26] Data-driven model-free slip control of anti-lock braking systems using reinforcement Q-learning [J].

Radac, Mircea-Bogdan ;

Precup, Radu-Emil .

NEUROCOMPUTING, 2018, 275 :317-329

[27] Model-free Gradient Iterative Learning Control for Non-linear Systems [J].

Huo, B. ;

Freeman, C. T. ;

Liu, Y. .

IFAC PAPERSONLINE, 2019, 52 (29) :304-309

[28] A Model-Free Loop-Shaping Method based on Iterative Learning Control [J].

Shih, Li-Wei ;

Chen, Cheng-Wei .

IFAC PAPERSONLINE, 2020, 53 (02) :1403-1408

[29] Off-policy reinforcement learning-based novel model-free minmax fault-tolerant tracking control for industrial processes [J].

Li, Xueyu ;

Luo, Qiuwen ;

Wang, Limin ;

Zhang, Ridong ;

Gao, Furong .

JOURNAL OF PROCESS CONTROL, 2022, 115 :145-156

[30] Model-Free Deep Reinforcement Learning with Multiple Line-of-Sight Guidance Laws for Autonomous Underwater Vehicles Full-Attitude and Velocity Control [J].

Yuan, Chengren ;

Shuai, Changgeng ;

Zhang, Zhanshuo ;

Ma, Jianguo ;

Fang, Yuan ;

Sun, Yuchen .

ADVANCED INTELLIGENT SYSTEMS, 2025,

← 1 2 3 4 5 →