Optimal control of a two-wheeled self-balancing robot by reinforcement learning

被引：27

作者：

Guo, Linyuan ^{[1
]}

Rizvi, Syed Ali Asad ^{[1
]}

Lin, Zongli ^{[1
]}

机构：

[1] Univ Virginia, Charles L Brown Dept Elect & Comp Engn, Charlottesville, VA 22904 USA

来源：

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL | 2021年 / 31卷 / 06期

关键词：

optimal control; Q-learning; reinforcement learning; robustness; two-wheeled self-balancing robot; MOBILE; MOTION;

D O I：

10.1002/rnc.5058

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article concerns optimal control of the linear motion, tilt motion, and yaw motion of a two-wheeled self-balancing robot (TWSBR). Traditional optimal control methods for the TWSBR usually require a precise model of the system, and other control methods exist that achieve stabilization in the face of parameter uncertainties. In practical applications, it is often desirable to realize optimal control in the absence of the precise knowledge of the system parameters. This article proposes to use a new feedback-based reinforcement learning method to solve the linear quadratic regulation (LQR) control problem for the TWSBR. The proposed control scheme is completely online and does not require any knowledge of the system parameters. The proposed input decoupling mechanism and pre-feedback law overcome the commonly encountered computational difficulties in implementing the learning algorithms. Both state feedback optimal control and output feedback optimal control are presented. Numerical simulation shows that the proposed optimal control scheme is capable of stabilizing the system and converging to the LQR solution obtained through solving the algebraic Riccati equation.

引用

页码：1885 / 1904

页数：20

共 23 条

[1]

[Anonymous], 1989, LEARNING DELAYED REW

[2]

[Anonymous], 1995, Optimal Control

[3]

[Anonymous], P 2009 IEEE INT C SY

[4]

Bertsekas D. P., 1996, Neuro-dynamic Programming

[5]

Bradtke SJ, 1994, P 1994 AM CONTR C 19

[6] Motion Control of a Two-Wheeled Mobile Vehicle with an Inverted Pendulum [J].

Do, Khac Duc ;

Seet, Gerald .

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2010, 60 (3-4) :577-605

[7] JOE: A mobile, inverted pendulum [J].

Grasser, F ;

D'Arrigo, A ;

Colombi, S ;

Rufer, AC .

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2002, 49 (01) :107-114

[8] Sliding-Mode Velocity Control of Mobile-Wheeled Inverted-Pendulum Systems [J].

Huang, Jian ;

Guan, Zhi-Hong ;

Matsuno, Takayuki ;

Fukuda, Toshio ;

Sekiyama, Kosuke .

IEEE TRANSACTIONS ON ROBOTICS, 2010, 26 (04) :750-758

[9] Wheeled inverted pendulum type assistant robot: design concept and mobile control [J].

Jeong, Seonghee ;

Takahashi, Takayuki .

INTELLIGENT SERVICE ROBOTICS, 2008, 1 (04) :313-320

[10]

Landelius T., 1997, THESIS

← 1 2 3 →