A simple trajectory optimization method with Q-learning for biped gait

被引:0
|
作者
Hu, LY [1 ]
Sun, ZQ [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
关键词
D O I
暂无
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Stable and intelligent gait trajectory generation is one of the important research issues in biped robot walking. This paper proposed a new simple optimization approach based on reinforcement learning method to achieve a both stable and reasonable trajectory. For a given robot with predefined rough gait, feasible actions were firstly taken on all joints at five key points in the gait to generate different kinds of trajectories, which were clustered later according to the ZMP stability criterion and required torques for learning. The most stable trajectory with feasible torque will finally be produced by using Q-learning method. According to the simulation results, learned trajectory has an obviously better motion curve merit than that before learning. And the corresponding ZMP trajectory approaches continuously toward the middle part of the stable region.
引用
收藏
页码:329 / 332
页数:4
相关论文
共 50 条
  • [1] Estimating probability distribution with Q-learning for biped gait generation and optimization
    Hu, Lingyun
    Zhou, Changjiu
    Sun, Zengqi
    2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 362 - +
  • [2] Gait Balance and Acceleration of a Biped Robot Based on Q-Learning
    Lin, Jin-Ling
    Hwang, Kao-Shing
    Jiang, Wei-Cheng
    Chen, Yu-Jen
    IEEE ACCESS, 2016, 4 : 2439 - 2449
  • [3] Estimating biped gait using spline-based probability distribution function with Q-learning
    Hu, Lingyun
    Zhou, Changjiu
    Sun, Zengqi
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2008, 55 (03) : 1444 - 1452
  • [4] Modeling and fuzzy Q-learning control of biped walking
    Meng Joo Er
    Yi Zhou
    Proceedings of the 24th Chinese Control Conference, Vols 1 and 2, 2005, : 641 - 646
  • [5] A Simple Trajectory Generation Method for Biped Walking
    Feng, Shuai
    Sun, Zengqi
    2008 10TH INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION ROBOTICS & VISION: ICARV 2008, VOLS 1-4, 2008, : 2078 - 2082
  • [6] Route Optimization with Q-learning
    Demircan, Semiye
    Aydin, Musa
    Durduran, S. Savas
    PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED COMPUTER SCIENCE (ACS'08): RECENT ADVANCES ON APPLIED COMPUTER SCIENCE, 2008, : 416 - +
  • [7] Study on structural topology optimization of Q-learning cell method
    Song, Xuming
    Shi, Zheyu
    Bao, Shipeng
    Tang, Mian
    Journal of Railway Science and Engineering, 2024, 21 (08) : 3274 - 3285
  • [8] Trajectory Optimization of Flying Energy Sources using Q-Learning to Recharge Hotspot UAVs
    Hoseini, Sayed Amir
    Hassan, Jahan
    Bokani, Ayub
    Kanhere, Salil S.
    IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2020, : 683 - 688
  • [9] Footstep planning for biped robot based on fuzzy Q-learning approach
    Sabourin, Christophe
    Madani, Kurosh
    Yu, Weiwei
    Yan, Jie
    ICINCO 2008: PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL RA-1: ROBOTICS AND AUTOMATION, VOL 1, 2008, : 183 - +
  • [10] Robotic Arm Assistance System Based on Simple Stereo Matching and Q-Learning Optimization
    Hsieh, Yi-Zeng
    Lin, Shih-Syun
    IEEE SENSORS JOURNAL, 2020, 20 (18) : 10945 - 10954