Deep-reinforcement-learning-based gait pattern controller on an uneven terrain for humanoid robots

被引：1

作者：

Kuo, Ping-Huan ^{[1
,2
]}

Pao, Chieh-Hsiu ^{[1
]}

Chang, En-Yi ^{[1
]}

Yau, Her-Terng ^{[1
,2
,3
,4
]}

机构：

[1] Natl Chung Cheng Univ, Dept Mech Engn, Chiayi, Taiwan

[2] Natl Chung Cheng Univ, Adv Inst Mfg High Tech Innovat AIM HI, Chiayi, Taiwan

[3] Natl Chung Cheng Univ, Dept Mech Engn, Chiayi 62102, Taiwan

[4] Natl Chung Cheng Univ, Adv Inst Mfg High Tech Innovat AIM HI, Chiayi 62102, Taiwan

来源：

INTERNATIONAL JOURNAL OF OPTOMECHATRONICS | 2023年 / 17卷 / 01期

关键词：

Gait pattern generator; humanoid robots; deep reinforcement learning; PPO2; WALKING; FUSION; SYSTEM;

D O I：

10.1080/15599612.2023.2222146

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Although conventional gait pattern control in humanoid robots is typically performed on flat terrains, the roads that people walk on every day have bumps and potholes. Therefore, to make humanoid robots more similar to humans, the movement parameters of these robots should be modified to allow them to adapt to uneven terrains. In this study, to solve this problem, reinforcement learning (RL) was used to allow humanoid robots to engage in self-training and automatically adjust their parameters for ultimate gait pattern control. However, RL has multiple types, and each type has its own benefits and shortcomings. Therefore, a series of experiments were performed, and the results indicated that proximal policy optimization (PPO), combining advantage actor-critic and trust region policy optimization, was the most suitable method. Hence, an improved version of PPO, called PPO2, was used, and the experimental results indicated that the combination of deep RL with data preprocessing methods, such as wavelet transform and fuzzification, facilitated the gait pattern control and balance of humanoid robots.

引用

页数：26

共 34 条

[1]

[Anonymous], EXP VID

[2] Semiclosed Greenhouse Climate Control Under Uncertainty via Machine Learning and Data-Driven Robust Model Predictive Control [J].

Chen, Wei-Han ;

You, Fengqi .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2022, 30 (03) :1186-1197

[3] On-Line Gait Adjustment for Humanoid Robot Robust Walking Based on Divergence Component of Motion [J].

Dong, Sheng ;

Yuan, Zhaohui ;

Yu, Xiaojun ;

Zhang, Jianrui ;

Sadiq, Muhammad Tariq ;

Zhang, Fuli .

IEEE ACCESS, 2019, 7 :159507-159518

[4] Learning to Adjust and Refine Gait Patterns for a Biped Robot [J].

Hwang, Kao-Shing ;

Lin, Jin-Ling ;

Yeh, Keng-Hao .

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2015, 45 (12) :1481-1490

[5] A SIMPLE AND NOVEL INTEGRATED OPTO-ELECTRONIC SYSTEM FOR BLOOD VOLUME PULSE SENSING AND HEART RATE MONITORING [J].

Jayasree, Vadakke Kadangote ;

Shaija, Palackappillil Jacob ;

Nampoori, Vadakkedathu Parameswaran Narayanan ;

Girijavallabhan, Chakkalakkal Pavothil ;

Radhakrishnan, Padmanabhan .

INTERNATIONAL JOURNAL OF OPTOMECHATRONICS, 2007, 1 (04) :392-403

[6] Wavelet Domain Multifractal Analysis for Static and Dynamic Texture Classification [J].

Ji, Hui ;

Yang, Xiong ;

Ling, Haibin ;

Xu, Yong .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (01) :286-299

[7] Natural Oscillation Gait in Humanoid Biped Locomotion [J].

Khan, Uzair Ijaz ;

Chen, Zhiyong .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2020, 28 (06) :2309-2321

[8] ON VISUO-INERTIAL FUSION FOR ROBOT POSE ESTIMATION USING HIERARCHICAL FUZZY SYSTEMS [J].

Kyriakoulis, Nikolaos ;

Gasteratos, Antonios .

INTERNATIONAL JOURNAL OF OPTOMECHATRONICS, 2012, 6 (01) :17-36

[9]

Li D., 2021, DDPG CAR FOLLOWING M

[10] Motion Prediction and Robust Tracking of a Dynamic and Temporarily-Occluded Target by an Unmanned Aerial Vehicle [J].

Li, Jun-Ming ;

Chen, Ching-Wen ;

Cheng, Teng-Hu .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2021, 29 (04) :1623-1635

← 1 2 3 4 →