Reinforcement Learning Motivated Feedforward Control Approach for Disturbance Rejection and Tracking

被引：0

作者：

Faktorovich, I ^{[1
]}

Bohn, C. ^{[2
]}

Vogelsang, J. ^{[1
]}

机构：

[1] Volkswagen AG, Berliner Ring 2, D-38440 Wolfsburg, Germany

[2] Tech Univ Clausthal, Leibnizstr 28, D-38678 Clausthal Zellerfeld, Germany

来源：

2021 EUROPEAN CONTROL CONFERENCE (ECC) | 2021年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a new discrete-time method for feedforward control for disturbance rejection and tracking to enhance the dynamic system response of linear time-invariant (LTI) systems. The proposed approach uses optimization techniques prevalent in Reinforcement Learning (RL), without requiring prior knowledge of the plant dynamics or a disturbance model. Unlike in usual RL or Adaptive Dynamic Programming frameworks, where the learning process is driven by temporal differences used to approximate an optimal cost or value function, this paper proposes a learning scheme based on immediate rewards. Thereby, the connection between reward based RL, system identification and adaptive control is addressed. It is further shown, how classical adaptive feedforward control problems can be transformed into a RL setting, and a sample efficient algorithm is presented including some practical implementation advice.

引用

页码：138 / 143

页数：6

共 21 条

[1]

[Anonymous], 1999, System Indentification - Theory for the User

[2] A survey of iterative learning control [J].

Bristow, Douglas A. ;

Tharayil, Marina ;

Alleyne, Andrew G. .

IEEE CONTROL SYSTEMS MAGAZINE, 2006, 26 (03) :96-114

[3] Q-Learning-based parameters adaptive algorithm for active disturbance rejection control and its application to ship course control [J].

Chen, Zengqiang ;

Qin, Beibei ;

Sun, Mingwei ;

Sun, Qinglin .

NEUROCOMPUTING, 2020, 408 :51-63

[4] Using expectation-maximization for reinforcement learning [J].

Dayan, P ;

Hinton, GE .

NEURAL COMPUTATION, 1997, 9 (02) :271-278

[5]

Degris T, 2012, P AMER CONTR CONF, P2177

[6]

Deisenroth M., 2011, P INT C MACH LEARN I, P465

[7]

Diniz P. S. R, 2002, ADAPTIVE FILTERING A

[8]

Hasselt H., 2010, ADV NEURAL INFORM PR, V23, DOI DOI 10.5555/2997046.2997187

[9]

Hester T, 2018, AAAI CONF ARTIF INTE, P3223

[10] Recent advances on active noise control: open issues and innovative applications [J].

Kajikawa, Yoshinobu ;

Gan, Woon-Seng ;

Kuo, Sen M. .

APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2012, 1 (01)

← 1 2 3 →