Fuzzy Reinforcement Learning Control for Decentralized Partially Observable Markov Decision Processes

被引：0

作者：

Sharma, Rajneesh ^{[1
]}

Spaan, Matthijs T. J. ^{[2
]}

机构：

[1] Netaji Subhas Inst Technol, Instrumentat & Control Div, New Delhi, India

[2] Inst Super Tecn, Inst Syst & Robot, Lisbon, Portugal

来源：

IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011) | 2011年

关键词：

Reinforcement learning; Fuzzy systems; Cooperative multiagent systems; Decentralized POMDPs;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Decentralized Partially Observable Markov Decision Processes (Dec-POMDPs) offer a powerful platform for optimizing sequential decision making in partially observable stochastic environments. However, finding optimal solutions for Dec-POMDPs is known to be intractable, necessitating approximate/suboptimal approaches. To address this problem, this work proposes a novel fuzzy reinforcement learning (RL) based game theoretic controller for Dec-POMDPs. The proposed controller implements fuzzy RL on Dec-POMDPs, which are modeled as a sequence of Bayesian games (BG). The main contributions of the work are the introduction of a game based RL paradigm in a Dec-POMDP settings, and the use of fuzzy inference systems to effectively generalize the underlying belief space. We apply the proposed technique on two benchmark problems and compare results against state-of-the-art Dec-POMDP control approach. The results validate the feasibility and effectiveness of using game theoretic RL based fuzzy control for addressing intractability of Dec-POMDPs, thus opening up a new research direction.

引用

页码：1422 / 1429

页数：8

共 26 条

[1]

Amato C., 2009, AUTONOMOUS AGENTS MU

[2]

[Anonymous], 2004, Fuzzy Logic with Engineering Applications

[3]

Bernstein D. S., 2005, P INT JOINT C ART IN

[4] The complexity of decentralized control of Markov decision processes [J].

Bernstein, DS ;

Givan, R ;

Immerman, N ;

Zilberstein, S .

MATHEMATICS OF OPERATIONS RESEARCH, 2002, 27 (04) :819-840

[5]

Boutilier C., 1996, THEORETICAL ASPECTS

[6] A comprehensive survey of multiagent reinforcement learning [J].

Busoniu, Lucian ;

Babuska, Robert ;

De Schutter, Bart .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02) :156-172

[7]

Duman E., 2005, LECT NOTES COMPUTER, P306, DOI [10.1007/ 11559221_ 31, DOI 10.1007/11559221_31]

[8]

Emery-Montemerlo R., 2004, P INT C AUT AG MULT

[9] Fuzzy inference system learning by reinforcement methods [J].

Jouffe, L .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 1998, 28 (03) :338-355

[10] Planning and acting in partially observable stochastic domains [J].

Kaelbling, LP ;

Littman, ML ;

Cassandra, AR .

ARTIFICIAL INTELLIGENCE, 1998, 101 (1-2) :99-134

← 1 2 3 →