An improved Q-learning algorithm using experience sharing for multi-robot system

被引：0

作者：

Ma, Jiachen ^{[1
,2
]}

Liu, Qiang ^{[1
]}

Xie, Wei ^{[2
]}

机构：

[1] School of Astronautics, Harbin Institute of Technology, Harbin

[2] School of Information and Electrical Engineering, Harbin Institute of Technology (Weihai), Weihai

来源：

Journal of Computational Information Systems | 2015年 / 11卷 / 09期

关键词：

Experience sharing; Multi-robot system; Q-learning; Reinforcement learning;

D O I：

10.12733/jcis14331

中图分类号：

学科分类号：

摘要：

This paper proposes an improved Q-learning algorithm using experience sharing to improve the learning efficiency of traditional Q-learning. Traditional Q-learning as a classic reinforcement learning (RL) has simple operation and small size of state-action space, and can be applied in multi-robot system (MRS). But compared with multiagent reinforcement learning algorithm, traditional Q-learning lacks information exchange with other agents. Experience sharing which imitates human thinking is a good way for solving this problem. By experience sharing each robot can share with other robots'Q values through a gradual learning process using ε-greedy policy to get learning experience with probability 1-ε. Robot soccer is adopted as test platform and simulation result shows that the improved Q-learning algorithm outperforms the traditional Q-learning algorithm. ©, 2015, Binary Information Press. All right reserved.

引用

页码：3387 / 3394

页数：7

共 12 条

[1]

Kapetanakis S., Kudenko D., Reinforcement learning of coordination in cooperative multi-agent systems, pp. 326-331, (2002)

[2]

Arai T., Pagello E., Parker L.E., Editorial: Advances in multi-robot systems, IEEE Transactions on robotics and automation, 18, 5, pp. 655-661, (2002)

[3]

Kim J.H., Vadakkepat P., Multi-agent systems: A survey from the robot-soccer perspective, Intelligent Automation & Soft Computing, 6, 1, pp. 3-17, (2000)

[4]

Kaelbling L.P., Littman M.L., Moore A.W., Reinforcement learning: A survey, (1996)

[5]

Barto A.G., Reinforcement Learning: An Introduction, (1998)

[6]

Watkins C.J.C.H., Dayan P., Q-learning, Machine learning, 8, 3-4, pp. 279-292, (1992)

[7]

Shoham Y., Powers R., Grenager T., Multi-agent reinforcement learning: A critical survey, (2003)

[8]

Arai S., Sycara K., Payne T.R., Experience-based reinforcement learning to acquire effective behavior in a multi-agent domain, PRICAI 2000 Topics in Artificial Intelligence, pp. 125-135, (2000)

[9]

Bowling M., Multiagent learning in the presence of agents with limitations, (2003)

[10]

Yang E., Gu D., Multiagent reinforcement learning for multi-robot systems: A survey, (2004)

← 1 2 →