Robust Optimal Control of Point-Feet Biped Robots Using a Reinforcement Learning Approach

被引:3
作者
Hou, Yi-You [1 ]
Lin, Ming-Hung [2 ]
Anjidani, Majid [3 ]
Nik, Hassan Saberi [4 ]
机构
[1] Natl Kaohsiung Univ Sci & Technol, Dept Intelligent Commerce, Kaohsiung 807618, Taiwan
[2] Cheng Shiu Univ, Dept Elect Engn, Kaohsiung 83347, Taiwan
[3] Payame Noor Univ, Dept Comp, POB 19395-3697, Tehran, Iran
[4] Univ Neyshabur, Dept Math & Stat, Neyshabur, Iran
关键词
Legged robots; Reinforcement learning; Robust gait optimization; STABLE WALKING; PLANAR; LOCOMOTION; SYSTEMS;
D O I
10.1080/03772063.2024.2362343
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Gait design for walking biped robots, that can preserve stability against a known range of disturbances, is very important in real applications. Designing an exponentially stable walking gait with desired features for biped robots has been recently done by an online reinforcement learning method. However, the designed gait might not be robust enough against disturbances. In this paper, we extend a robust version of the method against modeling errors/disturbances. It is done by minimizing the costs of worst rollouts which are generated in the presence of different modeling errors/disturbances. The proposed method's ability to adapt the controller is studied for some robust applications. The simulation shows that the resulted gaits are exponentially stable and robust against modeling errors/disturbances in a feasible range.
引用
收藏
页码:7831 / 7846
页数:16
相关论文
共 29 条
[1]   A novel online gait optimization approach for biped robots with point-feet [J].
Anjidani, Majid ;
Motlagh, M. R. Jahed ;
Fathy, M. ;
Ahmadabadi, M. Nili .
ESAIM-CONTROL OPTIMISATION AND CALCULUS OF VARIATIONS, 2019, 25
[2]   Optimization of venture portfolio based on LSTM and dynamic programming [J].
Ban, Jiuchao ;
Wang, Yiran ;
Liu, Bingjie ;
Li, Hongjun .
AIMS MATHEMATICS, 2023, 8 (03) :5462-5483
[3]   RABBIT: A testbed for advanced control theory [J].
Chevallereau, C ;
Abba, G ;
Aoustin, Y ;
Plestan, F ;
Westervelt, ER ;
Canudas-de-Wit, C ;
Grizzle, JW .
IEEE CONTROL SYSTEMS MAGAZINE, 2003, 23 (05) :57-79
[4]   Asymptotically stable running for a five-link, four-actuator, planar, bipedal robot [J].
Chevallereau, C ;
Westervelt, ER ;
Grizzle, JW .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2005, 24 (06) :431-464
[5]   Asymptotically Stable Walking of a Five-Link Underactuated 3-D Bipedal Robot [J].
Chevallereau, Christine ;
Grizzle, J. W. ;
Shih, Ching-Long .
IEEE TRANSACTIONS ON ROBOTICS, 2009, 25 (01) :37-50
[6]  
Dai HK, 2012, IEEE DECIS CONTR P, P1207, DOI 10.1109/CDC.2012.6425971
[7]   An MPC-based two-dimensional push recovery of a quadruped robot in trotting gait using its reduced virtual model [J].
Dini, Navid ;
Majd, Vahid Johari .
MECHANISM AND MACHINE THEORY, 2020, 146
[8]   Reinforcement learning-based saturated adaptive robust neural-network control of underactuated autonomous underwater vehicles [J].
Elhaki, Omid ;
Shojaei, Khoshnam ;
Mehrmohammadi, Parisa .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 197
[9]   A novel model-free robust saturated reinforcement learning-based controller for quadrotors guaranteeing prescribed transient and steady state performance [J].
Elhaki, Omid ;
Shojaei, Khoshnam .
AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 119
[10]   Asymptotically Stable Gait Primitives for Planning Dynamic Bipedal Locomotion in Three Dimensions [J].
Gregg, Robert D. ;
Bretl, Timothy ;
Spong, Mark W. .
2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, :1695-1702