Deep Reinforcement Learning for Humanoid Robot Behaviors

被引:16
作者
Muzio, Alexandre F. V. [1 ]
Maximo, Marcos R. O. A. [1 ]
Yoneyama, Takashi [2 ]
机构
[1] Aeronaut Inst Technol, Autonomous Computat Syst Lab LAB SCA, Comp Sci Div, Praca Marechal Eduardo Gomes 50, BR-12228900 Sao Jose Dos Campos, SP, Brazil
[2] Aeronaut Inst Technol, Elect Engn Div, Praca Marechal Eduardo Gomes 50, BR-12228900 Sao Jose Dos Campos, SP, Brazil
关键词
Deep reinforcement learning; Robot soccer; Humanoid robots; Robotics;
D O I
10.1007/s10846-022-01619-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RoboCup 3D Soccer Simulation is a robot soccer competition based on a high-fidelity simulator with autonomous humanoid agents, making it an interesting testbed for robotics and artificial intelligence. Due to the recent success of Deep Reinforcement Learning (DRL) in continuous control tasks, many teams have been using this technique to develop motions in Soccer 3D. This article focuses on learning humanoid robot behaviors: completing a racing track as fast as possible and dribbling against a single opponent. Our approach uses a hierarchical controller where a model-free policy learns to interact model-based walking algorithm. Then, we use DRL algorithms for an agent to learn how to perform these behaviors. Finally, the learned dribble policy was evaluated in the Soccer 3D environment. Simulated experiments show that the DRL agent wins against the hand-coded behavior used by the ITAndroids robotics team in 68.2% of dribble attempts.
引用
收藏
页数:16
相关论文
共 65 条
  • [1] Abdolmaleki A., 2017, LEARNING HUMANOID KI
  • [2] Abrel M., 2019, P 2019 ROBOCUP S ROB
  • [3] Al-Shedivat M., 2017, CORR
  • [4] Alcaraz-Jimenez J., 2014, CLOSED LOOP DRIBBLIN
  • [5] [Anonymous], 2018, LECT NOTES ARTIFICIA
  • [6] [Anonymous], 2017, arXiv
  • [7] [Anonymous], 2012, P 26 AAAI C ART INT
  • [8] [Anonymous], 2013, P 12 INT C AUT AG MU
  • [9] [Anonymous], 2017, Protocol Buffers
  • [10] [Anonymous], 2018, NAO ROBOT