A Shaped-Q Learning for Multi-Agents Systems

被引:0
|
作者
Hwang, Kao-Shing [1 ]
Jiang, Wei-Cheng [1 ]
机构
[1] Natl Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung, Taiwan
来源
2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC) | 2017年
关键词
Reinforcement learning; Multi-agents System; Cooperation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an architecture where each agent maintains a cooperative tendency table (CTT). In the process of learning, agents need not communicate with each other but observe partners' actions while taking actions. If one of the agents meets a bad situation, such as bumping onto obstacles after taking an action. In such a case, agents will receive a bad reward from the environment. Similarly, if one agent reaches a goal after taking an action, agents obtain a good reward instead. Rewards are used to update the policy and to adjust cooperative tendency values which are recorded in the individual CTT. When an agent perceives a state, the corresponding cooperative tendency value, and the Q-value are merged to a Shaped-Q value. The action with maximal Shaped-Q value in this state will be selected. After agents take actions and receive a reward, agents update their own CTTs. Therefore, agents could use this method to reach a consensus more quickly to enhance learning efficiency and reduce the occurrence of stagnation. The simulation results demonstrate that the proposed method can speed up the learning process and solve the problem of huge memory space consumption to some degrees. As well, it can make agents complete the task together more efficiently.
引用
收藏
页码:2024 / 2027
页数:4
相关论文
共 50 条
  • [1] The use of multi-agents' systems in e-learning platforms
    Orzechowski, Tomasz
    SIBCON-2007: IEEE INTERNATIONAL SIBERIAN CONFERENCE ON CONTROL AND COMMUNICATION, 2007, : 64 - 71
  • [2] Robust Collaborative Learning by Multi-Agents
    Balasingam, B.
    Pattipati, K.
    Levchuck, G.
    Romano, J. C.
    2015 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE FOR SECURITY AND DEFENSE APPLICATIONS (CISDA), 2015, : 183 - 187
  • [3] Learning Styles Multi-agents Simulation
    Juliana Hernandez, Emilcy
    Felipe Londono, Luis
    Giraldo, Mauricio
    Tabares, Valentina
    Dario Duque, Nestor
    ADVANCES IN PRACTICAL APPLICATIONS OF CYBER-PHYSICAL MULTI-AGENT SYSTEMS: THE PAAMS COLLECTION, PAAMS 2017, 2017, 10349 : 325 - 328
  • [4] Evaluating reputation in multi-agents systems
    Mui, L
    Halberstadt, A
    Mohtashemi, M
    TRUST, REPUTATION, AND SECURITY: THEORIES AND PRACTICE, 2003, 2631 : 123 - 137
  • [5] Multi-agents and learning: Implications for Webusage mining
    Lotfy, Hewayda M. S.
    Khamis, Soheir M. S.
    Aboghazalah, Maie M.
    JOURNAL OF ADVANCED RESEARCH, 2016, 7 (02) : 285 - 295
  • [6] A study on mathematical modeling for learning multi-agents
    Furukawa, Masashi
    Watanabe, Michiko
    Ohkura, Kazuhiro
    Kakazu, Yukinori
    Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2003, 69 (02): : 200 - 204
  • [7] Dynamic Applications Using Multi-Agents Systems
    Khazab, Mohammad
    Tweedale, Jeffrey
    Jain, Lakhmi
    INTELLIGENT SYSTEMS AND TECHNOLOGIES: METHODS AND APPLICATIONS, 2009, 217 : 67 - 79
  • [8] Combination of Interaction Models for Multi-Agents Systems
    Ribeiro, Richardson
    Guisi, Douglas M.
    Teixeira, Marcelo
    Dosciatti, Eden R.
    Borges, Andre P.
    Enembreck, Fabricio
    ENTERPRISE INFORMATION SYSTEMS, ICEIS 2016, 2017, 291 : 107 - 121
  • [9] A contribution to the formal checking of multi-agents systems
    Belala, F.
    Boucherit, A.
    2006 IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1-3, 2006, : 9 - +
  • [10] Development of intelligent systems and multi-agents systems with Amine platform
    Kabbaj, Adil
    CONCEPTUAL STRUCTURES: INSPIRATION AND APPLICATION, 2006, 4068 : 286 - 299