An improved Q-learning algorithm using experience sharing for multi-robot system

被引:0
作者
Ma, Jiachen [1 ,2 ]
Liu, Qiang [1 ]
Xie, Wei [2 ]
机构
[1] School of Astronautics, Harbin Institute of Technology, Harbin
[2] School of Information and Electrical Engineering, Harbin Institute of Technology (Weihai), Weihai
来源
Journal of Computational Information Systems | 2015年 / 11卷 / 09期
关键词
Experience sharing; Multi-robot system; Q-learning; Reinforcement learning;
D O I
10.12733/jcis14331
中图分类号
学科分类号
摘要
This paper proposes an improved Q-learning algorithm using experience sharing to improve the learning efficiency of traditional Q-learning. Traditional Q-learning as a classic reinforcement learning (RL) has simple operation and small size of state-action space, and can be applied in multi-robot system (MRS). But compared with multiagent reinforcement learning algorithm, traditional Q-learning lacks information exchange with other agents. Experience sharing which imitates human thinking is a good way for solving this problem. By experience sharing each robot can share with other robots'Q values through a gradual learning process using ε-greedy policy to get learning experience with probability 1-ε. Robot soccer is adopted as test platform and simulation result shows that the improved Q-learning algorithm outperforms the traditional Q-learning algorithm. ©, 2015, Binary Information Press. All right reserved.
引用
收藏
页码:3387 / 3394
页数:7
相关论文
共 12 条
[1]  
Kapetanakis S., Kudenko D., Reinforcement learning of coordination in cooperative multi-agent systems, pp. 326-331, (2002)
[2]  
Arai T., Pagello E., Parker L.E., Editorial: Advances in multi-robot systems, IEEE Transactions on robotics and automation, 18, 5, pp. 655-661, (2002)
[3]  
Kim J.H., Vadakkepat P., Multi-agent systems: A survey from the robot-soccer perspective, Intelligent Automation & Soft Computing, 6, 1, pp. 3-17, (2000)
[4]  
Kaelbling L.P., Littman M.L., Moore A.W., Reinforcement learning: A survey, (1996)
[5]  
Barto A.G., Reinforcement Learning: An Introduction, (1998)
[6]  
Watkins C.J.C.H., Dayan P., Q-learning, Machine learning, 8, 3-4, pp. 279-292, (1992)
[7]  
Shoham Y., Powers R., Grenager T., Multi-agent reinforcement learning: A critical survey, (2003)
[8]  
Arai S., Sycara K., Payne T.R., Experience-based reinforcement learning to acquire effective behavior in a multi-agent domain, PRICAI 2000 Topics in Artificial Intelligence, pp. 125-135, (2000)
[9]  
Bowling M., Multiagent learning in the presence of agents with limitations, (2003)
[10]  
Yang E., Gu D., Multiagent reinforcement learning for multi-robot systems: A survey, (2004)