Optimal control of a two-wheeled self-balancing robot by reinforcement learning

被引:22
作者
Guo, Linyuan [1 ]
Rizvi, Syed Ali Asad [1 ]
Lin, Zongli [1 ]
机构
[1] Univ Virginia, Charles L Brown Dept Elect & Comp Engn, Charlottesville, VA 22904 USA
关键词
optimal control; Q-learning; reinforcement learning; robustness; two-wheeled self-balancing robot; MOBILE; MOTION;
D O I
10.1002/rnc.5058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article concerns optimal control of the linear motion, tilt motion, and yaw motion of a two-wheeled self-balancing robot (TWSBR). Traditional optimal control methods for the TWSBR usually require a precise model of the system, and other control methods exist that achieve stabilization in the face of parameter uncertainties. In practical applications, it is often desirable to realize optimal control in the absence of the precise knowledge of the system parameters. This article proposes to use a new feedback-based reinforcement learning method to solve the linear quadratic regulation (LQR) control problem for the TWSBR. The proposed control scheme is completely online and does not require any knowledge of the system parameters. The proposed input decoupling mechanism and pre-feedback law overcome the commonly encountered computational difficulties in implementing the learning algorithms. Both state feedback optimal control and output feedback optimal control are presented. Numerical simulation shows that the proposed optimal control scheme is capable of stabilizing the system and converging to the LQR solution obtained through solving the algebraic Riccati equation.
引用
收藏
页码:1885 / 1904
页数:20
相关论文
共 23 条
  • [1] [Anonymous], 1998, INTRO REINFORCEMENT
  • [2] [Anonymous], P 2009 IEEE INT C SY
  • [3] Bertsekas Dimitri P., 1996, NEURODYNAMIC PROGRAM
  • [4] Bradtke SJ, 1994, P 1994 AM CONTR C 19
  • [5] Cornish Christopher John, 1989, (Ph.D. thesis
  • [6] Motion Control of a Two-Wheeled Mobile Vehicle with an Inverted Pendulum
    Do, Khac Duc
    Seet, Gerald
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2010, 60 (3-4) : 577 - 605
  • [7] JOE: A mobile, inverted pendulum
    Grasser, F
    D'Arrigo, A
    Colombi, S
    Rufer, AC
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2002, 49 (01) : 107 - 114
  • [8] Sliding-Mode Velocity Control of Mobile-Wheeled Inverted-Pendulum Systems
    Huang, Jian
    Guan, Zhi-Hong
    Matsuno, Takayuki
    Fukuda, Toshio
    Sekiyama, Kosuke
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2010, 26 (04) : 750 - 758
  • [9] Wheeled inverted pendulum type assistant robot: design concept and mobile control
    Jeong, Seonghee
    Takahashi, Takayuki
    [J]. INTELLIGENT SERVICE ROBOTICS, 2008, 1 (04) : 313 - 320
  • [10] Landelius T., 1997, THESIS