Rapid behavior learning in multi-agent environment based on state value estimation of others

被引:0
|
作者
Takahashi, Yasutake [1 ]
Noma, Kentaro [1 ]
Asada, Minoru [1 ]
机构
[1] Osaka Univ, Dept Adapt Machine Syst, Suita, Osaka 5650871, Japan
来源
2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9 | 2007年
关键词
D O I
10.1109/IROS.2007.4399294
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The existing reinforcement learning approaches have been suffering from the curse of dimension problem when they are applied to multiagent dynamic environments. One of the typical examples is a case of RoboCup competitions since other agents and their behaviors easily cause state and action space explosion. This paper presents a method of modular learning in a multiagent environment by which the learning agent can acquire cooperative behaviors with its team mates and competitive ones against its opponents. The key ideas to resolve the issue are as follows. First, a two-layer hierarchical system with multi learning modules is adopted to reduce the size of the sensor and action spaces. The state space of the top layer consists of the state values from the lower level, and the macro actions are used to reduce the size of the physical action space. Second, the state of the other to what extent it is close to its own goal is estimated by observation and used as a state value in the top layer state space to realize the cooperative/competitive behaviors. The method is applied to 4 (defense team) on 5 (offense team) game task, and the learning agent successfully acquired the teamwork plays (pass and shoot) within much shorter learning time (30 times quicker than the earlier work).
引用
收藏
页码:76 / 81
页数:6
相关论文
共 50 条
  • [21] Modeling Others using Oneself in Multi-Agent Reinforcement Learning
    Raileanu, Roberta
    Denton, Emily
    Szlam, Arthur
    Fergus, Rob
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [22] A context model based on multi-agent in U-learning environment
    Jun, SooJin
    Han, SeonKwan
    Kim, SooHwan
    Kim, HyeonCheol
    Lee, WonGyu
    TECHNOLOGIES FOR E-LEARNING AND DIGITAL ENTERTAINMENT, PROCEEDINGS, 2007, 4469 : 274 - +
  • [23] Case-based student modeling in multi-agent learning environment
    González, C
    Burguillo, JC
    Llamas, M
    MULTI-AGENT SYSTEMS AND APPLICATIONS IV, PROCEEDINGS, 2005, 3690 : 72 - 81
  • [24] Multi-agent Q-learning Based Navigation in an Unknown Environment
    Nath, Amar
    Niyogi, Rajdeep
    Singh, Tajinder
    Kumar, Virendra
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, AINA-2022, VOL 1, 2022, 449 : 330 - 340
  • [25] Constructing adaptive individual learning environment based on multi-agent system
    Chen, Peng
    Meng, Anbo
    Zhao, Chunhua
    CIS WORKSHOPS 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY WORKSHOPS, 2007, : 374 - +
  • [26] Distributed State Estimation for Multi-Agent based Active Distribution Networks
    Nguyen, P. H.
    Kling, W. L.
    IEEE POWER AND ENERGY SOCIETY GENERAL MEETING 2010, 2010,
  • [27] Cooperative Multi-Agent Learning: The State of the Art
    Liviu Panait
    Sean Luke
    Autonomous Agents and Multi-Agent Systems, 2005, 11 : 387 - 434
  • [28] Simulation for behavior learning of multi-agent robot
    Maeda, Y
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 1998, 6 (01) : 53 - 64
  • [29] Cooperative multi-agent learning: The state of the art
    Panait, L
    Luke, S
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2005, 11 (03) : 387 - 434
  • [30] State-based episodic memory for multi-agent reinforcement learning
    Xiao Ma
    Wu-Jun Li
    Machine Learning, 2023, 112 : 5163 - 5190