Incorporation of perception-based information in robot learning using fuzzy reinforcement learning agents

被引:0
|
作者
Zhou Changjiu
Meng Qingchun
Guo Zhongwen
Qu Wiefen
Yin Bo
机构
[1] Singapore Polytechnic,School of Electrical and Electronic Engineering
[2] Ocean University of Qingdao,Computer Science Department
[3] State Key Laboratory of Intelligent Systems and Technologies in Tsinghua University,undefined
关键词
Robot learning; reinforcement learning agents; neural-fuzzy systems; genetic algorithms; biped robot;
D O I
10.1007/s11802-002-0038-0
中图分类号
学科分类号
摘要
Robot learning in unstructured environments has been proved to be an extremely challenging problem, mainly because of many uncertainties always present in the real world. Human beings, on the other hand, seem to cope very well with uncertain and unpredictable environments, often relying on perception-based information. Furthermore, humans beings can also utilize perceptions to guide their learning on those parts of the perception-action space that are actually relevant to the task. Therefore, we conduct a research aimed at improving robot learning through the incorporation of both perception-based and measurement-based information. For this reason, a fuzzy reinforcement learning (FRL) agent is proposed in this paper. Based on a neural-fuzzy architecture, different kinds of information can be incorporated into the FRL agent to initialise its action network, critic network and evaluation feedback module so as to accelerate its learning. By making use of the global optimisation capability of GAs (genetic algorithms), a GA-based FRL (GAFRL) agent is presented to solve the local minima problem in traditional actor-critic reinforcement learning. On the other hand, with the prediction capability of the critic network, GAs can perform a more effective global search. Different GAFRL agents are constructed and verified by using the simulation model of a physical biped robot. The simulation analysis shows that the biped learning rate for dynamic balance can be improved by incorporating perception-based information on biped balancing and walking evaluation.
引用
收藏
页码:93 / 100
页数:7
相关论文
共 50 条
  • [2] Robot learning with GA-based fuzzy reinforcement learning agents
    Zhou, CJ
    INFORMATION SCIENCES, 2002, 145 (1-2) : 45 - 68
  • [3] Robot learning assisted by perception-based information: A computing with words approach
    Zhou, CJ
    IROS 2001: PROCEEDINGS OF THE 2001 IEEE/RJS INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4: EXPANDING THE SOCIETAL ROLE OF ROBOTICS IN THE NEXT MILLENNIUM, 2001, : 272 - 277
  • [4] Learning Robust Perception-Based Controller for Quadruped Robot
    Muhamad, Fikih
    Kim, Jung-Su
    Park, Jae-Han
    IEEE ACCESS, 2023, 11 : 94497 - 94505
  • [5] Dynamic balance of a biped robot using fuzzy reinforcement learning agents
    Zhou, CJ
    Meng, QC
    FUZZY SETS AND SYSTEMS, 2003, 134 (01) : 169 - 187
  • [6] Learning of mobile robots using perception-based genetic
    Kubota, N
    Morioka, T
    Kojima, F
    Fukuda, T
    MEASUREMENT, 2001, 29 (03) : 237 - 248
  • [7] Fuzzy situation based navigation of autonomous mobile robot using reinforcement learning
    Guanlao, R
    Musilek, P
    Ahmed, F
    Kaboli, A
    NAFIPS 2004: ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY, VOLS 1AND 2: FUZZY SETS IN THE HEART OF THE CANADIAN ROCKIES, 2004, : 820 - 825
  • [8] Learning integrated perception-based speed control
    Leang, P
    Bhanu, B
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, 2004, : 813 - 816
  • [9] Design of robot fuzzy logic controller based on reinforcement learning
    Duan, Yong
    Liu, Xin-Gang
    Xu, Xin-He
    Xitong Fangzhen Xuebao / Journal of System Simulation, 2006, 18 (06): : 1597 - 1600
  • [10] Co-evolutionary perception-based reinforcement learning for sensor allocation in autonomous vehicles
    Berenji, HR
    Vengerov, D
    Ametha, J
    PROCEEDINGS OF THE 12TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1 AND 2, 2003, : 125 - 130