Expertness framework in multi-agent systems and its application in credit assignment problem

被引：6

作者：

Rahaie, Zahra ^{[1
]}

Beigy, Hamid ^{[1
]}

机构：

[1] Sharif Univ Technol, Dept Comp Engn, Intelligent Syst Lab, Tehran, Iran

来源：

INTELLIGENT DATA ANALYSIS | 2014年 / 18卷 / 03期

关键词：

Credit assignment; expertness framework; critic learning; multi-agent systems; cooperative learning; noise;

D O I：

10.3233/IDA-140654

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

One of the challenging problems in artificial intelligence is credit assignment which simply means distributing the credit among a group, such as a group of agents. We made an attempt to meet this problem with the aid of the reinforcement learning paradigm. In this paper, expertness framework is defined and applied to the multi-agent credit assignment problem. In the expertness framework, the critic agent, who is responsible for distributing credit among agents, is equipped with learning capability, and the proposed credit assignment solution is based on the critic to learn to assign a proportion of the credit to each agent, and the used proportion should be learned by reinforcement learning. The paper also reports the degree of expertness framework robustness and the amount of performance decline in noisy environments. Experimental results show the superiority of the method over the common methods of credit assignment used in lots of different domains and also show that performance reduction with respect to the quantity of the noise is tolerable and the system ultimately converges to the stable and correct behavior, therefore the agents are still capable of efficiently performing in the noisy environments.

引用

页码：511 / 528

页数：18

共 36 条

[1] Agogino A., 2005, GEN EV COMP C
[2] Agognio A.K., 2004, AAMAS, V2, P980
[3] Expertness based cooperative Q-learning
Ahmadabadi, MN
Asadpour, M
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2002, 32 (01): : 66 - 76
[4] A study on expertise of agents and its effects on cooperative Q-learning
Araabi, Babak Nadjar
Mastoureshgh, Sahar
Ahmadabadi, Majid Nili
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02): : 398 - 409
[5] Balch T., 1999, IJCAI 99 WORKSH AG L
[6] Bednar Jenna., 2007, SUP CT EC REV, V15, P285
[7] Bianchi D., 1996, P 1 ONL WORKSH SOFT, P113
[8] A comprehensive survey of multiagent reinforcement learning
Busoniu, Lucian
Babuska, Robert
De Schutter, Bart
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02): : 156 - 172
[9] CHANG YH, 2003, P NEUR INF PROC SYST
[10] Spatio-Temporal Credit Assignment in Neuronal Population Learning
Friedrich, Johannes
Urbanczik, Robert
Senn, Walter
[J]. PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (06)

← 1 2 3 4 →