Reinforcement Learning with Adaptive Networks

被引:0
作者
Sasaki, Tomoki [1 ]
Yamada, Satoshi [1 ]
机构
[1] Okayama Univ Sci, Dept Intelligent Mech Engn, Okayama, Japan
来源
2017 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION SCIENCES (ICRAS) | 2017年
关键词
reinforcement learning; INGnet; state list; garage parking; stilt-type biped robot; Khepera robot;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The reinforcement learning (RL) with the incremental normalized Gaussian networks (INGnet) using the state list was proposed in order to improve the learning efficiency. This learning system was applied to the garage parking control, the walking control of the stilt-type biped robot on the slope, and the wall avoidance control of Khepera robot. Since in the ordinary condition to add the new processing units they were not set in the central region of the garage parking field or around the states during the normal walking, RL with INGnet did not learn the control effectively. RL with INGnet using the state list set the processing units at the necessary positions and learned the garage parking control and the walking control of the stilt-type biped robot on the slope. However, since it is easy to learn the wall avoidance control, the state list was not needed for RL with INGnet: RL with INGnet without the state list was able to learn it.
引用
收藏
页码:1 / 5
页数:5
相关论文
共 12 条
[1]  
Albus J. S., 1975, Transactions of the ASME. Series G, Journal of Dynamic Systems, Measurement and Control, V97, P220, DOI 10.1115/1.3426922
[2]  
Albus J. S., 1975, Transactions of the ASME. Series G, Journal of Dynamic Systems, Measurement and Control, V97, P228, DOI 10.1115/1.3426923
[3]  
Asano Tsubasa, 2012, International Journal of Advanced Mechatronic Systems, V4, P94, DOI 10.1504/IJAMECHS.2012.048394
[4]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[5]  
Michel O., 1996, KHEPERA SIMULATOR VE
[6]   Fast Learning in Networks of Locally-Tuned Processing Units [J].
Moody, John ;
Darken, Christian J. .
NEURAL COMPUTATION, 1989, 1 (02) :281-294
[7]  
Morimoto J., 1999, Transactions of the Institute of Electronics, Information and Communication Engineers D-II, VJ82D-II, P2118
[8]  
Nakama Hayato, 2011, International Journal of Advanced Mechatronic Systems, V3, P251, DOI 10.1504/IJAMECHS.2011.043373
[9]  
Nishiyama R., 2015, LECT NOTES ELECT ENG, V1, P207
[10]  
Sutton R., 1998, Introduction to reinforcement learning