A simple trajectory optimization method with Q-learning for biped gait

被引：0

作者：

Hu, LY ^{[1
]}

Sun, ZQ ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China

来源：

DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS | 2005年 / 1卷

关键词：

D O I：

暂无

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

Stable and intelligent gait trajectory generation is one of the important research issues in biped robot walking. This paper proposed a new simple optimization approach based on reinforcement learning method to achieve a both stable and reasonable trajectory. For a given robot with predefined rough gait, feasible actions were firstly taken on all joints at five key points in the gait to generate different kinds of trajectories, which were clustered later according to the ZMP stability criterion and required torques for learning. The most stable trajectory with feasible torque will finally be produced by using Q-learning method. According to the simulation results, learned trajectory has an obviously better motion curve merit than that before learning. And the corresponding ZMP trajectory approaches continuously toward the middle part of the stable region.

引用

页码：329 / 332

页数：4

共 50 条

[1] Estimating probability distribution with Q-learning for biped gait generation and optimization
Hu, Lingyun
Zhou, Changjiu
Sun, Zengqi
2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 362 - +
[2] Gait Balance and Acceleration of a Biped Robot Based on Q-Learning
Lin, Jin-Ling
Hwang, Kao-Shing
Jiang, Wei-Cheng
Chen, Yu-Jen
IEEE ACCESS, 2016, 4 : 2439 - 2449
[3] Estimating biped gait using spline-based probability distribution function with Q-learning
Hu, Lingyun
Zhou, Changjiu
Sun, Zengqi
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2008, 55 (03) : 1444 - 1452
[4] Modeling and fuzzy Q-learning control of biped walking
Meng Joo Er
Yi Zhou
Proceedings of the 24th Chinese Control Conference, Vols 1 and 2, 2005, : 641 - 646
[5] A Simple Trajectory Generation Method for Biped Walking
Feng, Shuai
Sun, Zengqi
2008 10TH INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION ROBOTICS & VISION: ICARV 2008, VOLS 1-4, 2008, : 2078 - 2082
[6] Route Optimization with Q-learning
Demircan, Semiye
Aydin, Musa
Durduran, S. Savas
PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED COMPUTER SCIENCE (ACS'08): RECENT ADVANCES ON APPLIED COMPUTER SCIENCE, 2008, : 416 - +
[7] Study on structural topology optimization of Q-learning cell method
Song, Xuming
Shi, Zheyu
Bao, Shipeng
Tang, Mian
Journal of Railway Science and Engineering, 2024, 21 (08) : 3274 - 3285
[8] Trajectory Optimization of Flying Energy Sources using Q-Learning to Recharge Hotspot UAVs
Hoseini, Sayed Amir
Hassan, Jahan
Bokani, Ayub
Kanhere, Salil S.
IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2020, : 683 - 688
[9] Footstep planning for biped robot based on fuzzy Q-learning approach
Sabourin, Christophe
Madani, Kurosh
Yu, Weiwei
Yan, Jie
ICINCO 2008: PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL RA-1: ROBOTICS AND AUTOMATION, VOL 1, 2008, : 183 - +
[10] Robotic Arm Assistance System Based on Simple Stereo Matching and Q-Learning Optimization
Hsieh, Yi-Zeng
Lin, Shih-Syun
IEEE SENSORS JOURNAL, 2020, 20 (18) : 10945 - 10954

← 1 2 3 4 5 →