Self-organizing cognitive agents and reinforcement learning in multi-agent environment

被引：0

作者：

Tan, AH ^{[1
]}

Xiao, D ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore

来源：

2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS | 2005年

关键词：

ARCHITECTURE;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a self-organizing cognitive architecture, known as TD-FALCON, that learns to function through its interaction with the environment. TD-FALCON learns the value functions of the state-action space estimated through a temporal difference (TD) method. The learned value functions are then used to determine the optimal actions based on an action selection policy. We present a specific instance of TD-FALCON based on an e-greedy action policy and a Q-learning value estimation formula. Experiments based on a minefield navigation task and a minefield pursuit task show that TD-FALCON systems are able to adapt and function well in a multi-agent environment without an explicit mechanism for collaboration.

引用

页码：351 / 357

页数：7

共 15 条

[1] BRENDA M, 1986, BCSG201028 BOEING AD
[2] FUZZY ART - FAST STABLE LEARNING AND CATEGORIZATION OF ANALOG PATTERNS BY AN ADAPTIVE RESONANCE SYSTEM
CARPENTER, GA
GROSSBERG, S
ROSEN, DB
[J]. NEURAL NETWORKS, 1991, 4 (06) : 759 - 771
[3] FUZZY ARTMAP - A NEURAL NETWORK ARCHITECTURE FOR INCREMENTAL SUPERVISED LEARNING OF ANALOG MULTIDIMENSIONAL MAPS
CARPENTER, GA
GROSSBERG, S
MARKUZON, N
REYNOLDS, JH
ROSEN, DB
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (05): : 698 - 713
[4] A MASSIVELY PARALLEL ARCHITECTURE FOR A SELF-ORGANIZING NEURAL PATTERN-RECOGNITION MACHINE
CARPENTER, GA
GROSSBERG, S
[J]. COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1987, 37 (01): : 54 - 115
[5] GORDON D, 1997, 19 ANN C COGN SCI SO
[6] KORG RE, 1992, 11 INT WORKSH DISTR, P183
[7] Levy R., 1992, 11th International Workshop on Distributed Artificial Intelligence, P195
[8] PANAIT L, 2003, GMUCSTR20031
[9] PEREZURIBE A, 2002, THESIS SWISS FEDERAL
[10] Multiagent systems: A survey from a machine learning perspective
Stone, P
Veloso, M
[J]. AUTONOMOUS ROBOTS, 2000, 8 (03) : 345 - 383

← 1 2 →