Recovering Robustness in Model-Free Reinforcement Learning

被引:12
作者
Venkataraman, Harish K.
Seiler, Peter J.
机构
来源
2019 AMERICAN CONTROL CONFERENCE (ACC) | 2019年
关键词
DESIGN;
D O I
10.23919/acc.2019.8815368
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) is used to directly design a control policy using data collected from the system. This paper considers the robustness of controllers trained via model-free RL. The discussion focuses on posing the (model-free) linear quadratic Gaussian (LQG) problem as a special instance of RL. A simple LQG example is used to demonstrate that RL with partial observations can lead to poor robustness margins. It is proposed to recover robustness by introducing random perturbations at the system input during the RL training. The perturbation magnitude can be used to trade off performance for increased robustness. Two simple examples are presented to demonstrate the proposed method for enhancing robustness during RL training.
引用
收藏
页码:4210 / 4216
页数:7
相关论文
共 50 条
[41]   Brake and velocity model-free control on an actual vehicle [J].
Polack, Philip ;
Delprat, Sebastien ;
D'Andrea-Novel, Brigitte .
CONTROL ENGINEERING PRACTICE, 2019, 92
[42]   Adaptive model-free disturbance rejection for continuum robots [J].
Yilmaz, Cemal Tugrul ;
Watson, Connor ;
Morimoto, Tania K. ;
Krstic, Miroslav .
AUTOMATICA, 2025, 171
[43]   A Model-Free Distributed Approach for Wind Plant Control [J].
Gebraad, Pieter M. O. ;
van Dam, Filip C. ;
van Wingerden, Jan-Willem .
2013 AMERICAN CONTROL CONFERENCE (ACC), 2013, :628-633
[44]   Random Learning Leads to Faster Convergence in 'Model-Free' ILC: With Application to MIMO Feedforward in Industrial Printing [J].
Aarnoudse, Leontine ;
Oomen, Tom .
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2025, 39 (07) :1521-1532
[45]   Model-free sliding mode control, theory and application [J].
Ebrahimi, Nahid ;
Ozgoli, Sadjaad ;
Ramezani, Amin .
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2018, 232 (10) :1292-1301
[46]   On Model-Free Adaptive Control and Its Stability Analysis [J].
Hou, Zhongsheng ;
Xiong, Shuangshuang .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (11) :4555-4569
[47]   EXPERIMENTAL COMPARISON OF MODEL-FREE VIBRATION CONTROL BASED ON VIRTUAL CONTROLLED OBJECT AND MODEL-BASED CONTROL: ROBUSTNESS TO CHARACTERISTIC CHANGES IN ACTUAL CONTROLLED OBJECT [J].
Yonezawa, Ansei ;
Yonezawa, Heisei ;
Kajiwara, Itsuro .
PROCEEDINGS OF ASME 2023 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2023, VOL 6, 2023,
[48]   Model-Free H∞ Optimal Tracking Control of Constrained Nonlinear Systems via an Iterative Adaptive Learning Algorithm [J].
Hou, Jiaxu ;
Wang, Ding ;
Liu, Derong ;
Zhang, Yun .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11) :4097-4108
[49]   Model-Free Control for High Pressure in a Direct Injection System [J].
Carvalho, Amauri Dias ;
Angelico, Bruno Augusto ;
Lagana, Armando Antonio Maria .
JOURNAL OF CONTROL AUTOMATION AND ELECTRICAL SYSTEMS, 2023, 34 (04) :689-699
[50]   Iterative Learning Model-Free Control for Networked Systems With Dual-Direction Data Dropouts and Actuator Faults [J].
Chen, Jiannan ;
Hua, Changchun ;
Guan, Xinping .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (11) :5232-5240