Self-organizing cognitive agents and reinforcement learning in multi-agent environment

被引:0
作者
Tan, AH [1 ]
Xiao, D [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
来源
2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS | 2005年
关键词
ARCHITECTURE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a self-organizing cognitive architecture, known as TD-FALCON, that learns to function through its interaction with the environment. TD-FALCON learns the value functions of the state-action space estimated through a temporal difference (TD) method. The learned value functions are then used to determine the optimal actions based on an action selection policy. We present a specific instance of TD-FALCON based on an e-greedy action policy and a Q-learning value estimation formula. Experiments based on a minefield navigation task and a minefield pursuit task show that TD-FALCON systems are able to adapt and function well in a multi-agent environment without an explicit mechanism for collaboration.
引用
收藏
页码:351 / 357
页数:7
相关论文
共 15 条
  • [1] BRENDA M, 1986, BCSG201028 BOEING AD
  • [2] FUZZY ART - FAST STABLE LEARNING AND CATEGORIZATION OF ANALOG PATTERNS BY AN ADAPTIVE RESONANCE SYSTEM
    CARPENTER, GA
    GROSSBERG, S
    ROSEN, DB
    [J]. NEURAL NETWORKS, 1991, 4 (06) : 759 - 771
  • [3] FUZZY ARTMAP - A NEURAL NETWORK ARCHITECTURE FOR INCREMENTAL SUPERVISED LEARNING OF ANALOG MULTIDIMENSIONAL MAPS
    CARPENTER, GA
    GROSSBERG, S
    MARKUZON, N
    REYNOLDS, JH
    ROSEN, DB
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (05): : 698 - 713
  • [4] A MASSIVELY PARALLEL ARCHITECTURE FOR A SELF-ORGANIZING NEURAL PATTERN-RECOGNITION MACHINE
    CARPENTER, GA
    GROSSBERG, S
    [J]. COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1987, 37 (01): : 54 - 115
  • [5] GORDON D, 1997, 19 ANN C COGN SCI SO
  • [6] KORG RE, 1992, 11 INT WORKSH DISTR, P183
  • [7] Levy R., 1992, 11th International Workshop on Distributed Artificial Intelligence, P195
  • [8] PANAIT L, 2003, GMUCSTR20031
  • [9] PEREZURIBE A, 2002, THESIS SWISS FEDERAL
  • [10] Multiagent systems: A survey from a machine learning perspective
    Stone, P
    Veloso, M
    [J]. AUTONOMOUS ROBOTS, 2000, 8 (03) : 345 - 383