Expertness framework in multi-agent systems and its application in credit assignment problem

被引:6
作者
Rahaie, Zahra [1 ]
Beigy, Hamid [1 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Intelligent Syst Lab, Tehran, Iran
关键词
Credit assignment; expertness framework; critic learning; multi-agent systems; cooperative learning; noise;
D O I
10.3233/IDA-140654
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the challenging problems in artificial intelligence is credit assignment which simply means distributing the credit among a group, such as a group of agents. We made an attempt to meet this problem with the aid of the reinforcement learning paradigm. In this paper, expertness framework is defined and applied to the multi-agent credit assignment problem. In the expertness framework, the critic agent, who is responsible for distributing credit among agents, is equipped with learning capability, and the proposed credit assignment solution is based on the critic to learn to assign a proportion of the credit to each agent, and the used proportion should be learned by reinforcement learning. The paper also reports the degree of expertness framework robustness and the amount of performance decline in noisy environments. Experimental results show the superiority of the method over the common methods of credit assignment used in lots of different domains and also show that performance reduction with respect to the quantity of the noise is tolerable and the system ultimately converges to the stable and correct behavior, therefore the agents are still capable of efficiently performing in the noisy environments.
引用
收藏
页码:511 / 528
页数:18
相关论文
共 36 条
  • [1] Agogino A., 2005, GEN EV COMP C
  • [2] Agognio A.K., 2004, AAMAS, V2, P980
  • [3] Expertness based cooperative Q-learning
    Ahmadabadi, MN
    Asadpour, M
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2002, 32 (01): : 66 - 76
  • [4] A study on expertise of agents and its effects on cooperative Q-learning
    Araabi, Babak Nadjar
    Mastoureshgh, Sahar
    Ahmadabadi, Majid Nili
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02): : 398 - 409
  • [5] Balch T., 1999, IJCAI 99 WORKSH AG L
  • [6] Bednar Jenna., 2007, SUP CT EC REV, V15, P285
  • [7] Bianchi D., 1996, P 1 ONL WORKSH SOFT, P113
  • [8] A comprehensive survey of multiagent reinforcement learning
    Busoniu, Lucian
    Babuska, Robert
    De Schutter, Bart
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02): : 156 - 172
  • [9] CHANG YH, 2003, P NEUR INF PROC SYST
  • [10] Spatio-Temporal Credit Assignment in Neuronal Population Learning
    Friedrich, Johannes
    Urbanczik, Robert
    Senn, Walter
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (06)