Adaptive predictive control of a differential drive robot tuned with reinforcement learning

被引：18

作者：

Jardine, P. Travis ^{[1
]}

Kogan, Michael ^{[2
]}

Givigi, Sidney N. ^{[2
]}

Yousefi, Shahram ^{[1
]}

机构：

[1] Queens Univ, Dept Elect & Comp Engn, Kingston, ON, Canada

[2] Royal Mil Coll Canada, Dept Elect & Comp Engn, Kingston, ON, Canada

来源：

INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING | 2019年 / 33卷 / 02期

基金：

加拿大自然科学与工程研究理事会;

关键词：

feedback linearization; machine learning; model predictive control; reinforcement learning;

D O I：

10.1002/acs.2882

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

One of the most important steps in designing a model predictive control strategy is selecting appropriate parameters for the relative weights of the objective function. Typically, these are selected through trial and error to meet the desired performance. In this paper, a reinforcement learning technique called learning automata is used to select appropriate parameters for the controller of a differential drive robot through a simulation process. Results of the simulation show that the parameters always converge, although to different values. A controller chosen by the learning process is then ported to a real platform. The selected controller is shown to control the robot better than a standard model predictive control.

引用

页码：410 / 423

页数：14

共 22 条

[1] Autonomous Construction of Multiple Structures Using Learning Automata: Description and Experimental Validation [J].

Barros dos Santos, Sergio R. ;

Givigi, Sidney N., Jr. ;

Nascimento, Cairo L., Jr. .

IEEE SYSTEMS JOURNAL, 2015, 9 (04) :1376-1387

[2]

Camacho E.F., 2003, MODEL PREDICTIVE CON

[3] Model Predictive Control Tuning Methods: A Review [J].

Garriga, Jorge L. ;

Soroush, Masoud .

INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2010, 49 (08) :3505-3515

[4]

Han K, 2006, 2006 6 WORLD C INT C

[5]

He S, 2005, P 12 INT C ADV ROB I

[6]

Jardine P. T., 2017, 2017 ANN IEEE INT SY, P1

[7]

Jardine PT, 2017, 20 WORLD C INT FED A

[8] Vision-Based Model Predictive Control for Steering of a Nonholonomic Mobile Robot [J].

Li, Zhijun ;

Yang, Chenguang ;

Su, Chun-Yi ;

Deng, Jun ;

Zhang, Weidong .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2016, 24 (02) :553-564

[9] Optimal linear filtering for networked systems with communication constraints, fading measurements, and multiplicative noises [J].

Liu, Wei ;

Zhang, Hongwei ;

Yu, Kaijiang ;

Tan, Xingguo .

INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2017, 31 (07) :1019-1039

[10] PREDICTIVE CONTROLLER-DESIGN BY PRINCIPAL COMPONENTS-ANALYSIS [J].

MAURATH, PR ;

LAUB, AJ ;

SEBORG, DE ;

MELLICHAMP, DA .

INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 1988, 27 (07) :1204-1212

← 1 2 3 →