A Shaped-Q Learning for Multi-Agents Systems

被引：0

作者：

Hwang, Kao-Shing ^{[1
]}

Jiang, Wei-Cheng ^{[1
]}

机构：

[1] Natl Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung, Taiwan

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC) | 2017年

关键词：

Reinforcement learning; Multi-agents System; Cooperation;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes an architecture where each agent maintains a cooperative tendency table (CTT). In the process of learning, agents need not communicate with each other but observe partners' actions while taking actions. If one of the agents meets a bad situation, such as bumping onto obstacles after taking an action. In such a case, agents will receive a bad reward from the environment. Similarly, if one agent reaches a goal after taking an action, agents obtain a good reward instead. Rewards are used to update the policy and to adjust cooperative tendency values which are recorded in the individual CTT. When an agent perceives a state, the corresponding cooperative tendency value, and the Q-value are merged to a Shaped-Q value. The action with maximal Shaped-Q value in this state will be selected. After agents take actions and receive a reward, agents update their own CTTs. Therefore, agents could use this method to reach a consensus more quickly to enhance learning efficiency and reduce the occurrence of stagnation. The simulation results demonstrate that the proposed method can speed up the learning process and solve the problem of huge memory space consumption to some degrees. As well, it can make agents complete the task together more efficiently.

引用

页码：2024 / 2027

页数：4

共 50 条

[1] The use of multi-agents' systems in e-learning platforms
Orzechowski, Tomasz
SIBCON-2007: IEEE INTERNATIONAL SIBERIAN CONFERENCE ON CONTROL AND COMMUNICATION, 2007, : 64 - 71
[2] Robust Collaborative Learning by Multi-Agents
Balasingam, B.
Pattipati, K.
Levchuck, G.
Romano, J. C.
2015 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE FOR SECURITY AND DEFENSE APPLICATIONS (CISDA), 2015, : 183 - 187
[3] Learning Styles Multi-agents Simulation
Juliana Hernandez, Emilcy
Felipe Londono, Luis
Giraldo, Mauricio
Tabares, Valentina
Dario Duque, Nestor
ADVANCES IN PRACTICAL APPLICATIONS OF CYBER-PHYSICAL MULTI-AGENT SYSTEMS: THE PAAMS COLLECTION, PAAMS 2017, 2017, 10349 : 325 - 328
[4] Evaluating reputation in multi-agents systems
Mui, L
Halberstadt, A
Mohtashemi, M
TRUST, REPUTATION, AND SECURITY: THEORIES AND PRACTICE, 2003, 2631 : 123 - 137
[5] Multi-agents and learning: Implications for Webusage mining
Lotfy, Hewayda M. S.
Khamis, Soheir M. S.
Aboghazalah, Maie M.
JOURNAL OF ADVANCED RESEARCH, 2016, 7 (02) : 285 - 295
[6] A study on mathematical modeling for learning multi-agents
Furukawa, Masashi
Watanabe, Michiko
Ohkura, Kazuhiro
Kakazu, Yukinori
Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2003, 69 (02): : 200 - 204
[7] Dynamic Applications Using Multi-Agents Systems
Khazab, Mohammad
Tweedale, Jeffrey
Jain, Lakhmi
INTELLIGENT SYSTEMS AND TECHNOLOGIES: METHODS AND APPLICATIONS, 2009, 217 : 67 - 79
[8] Combination of Interaction Models for Multi-Agents Systems
Ribeiro, Richardson
Guisi, Douglas M.
Teixeira, Marcelo
Dosciatti, Eden R.
Borges, Andre P.
Enembreck, Fabricio
ENTERPRISE INFORMATION SYSTEMS, ICEIS 2016, 2017, 291 : 107 - 121
[9] A contribution to the formal checking of multi-agents systems
Belala, F.
Boucherit, A.
2006 IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1-3, 2006, : 9 - +
[10] Development of intelligent systems and multi-agents systems with Amine platform
Kabbaj, Adil
CONCEPTUAL STRUCTURES: INSPIRATION AND APPLICATION, 2006, 4068 : 286 - 299

← 1 2 3 4 5 →