Reinforcement learning in a continuum of agents

被引:0
作者
Adrian Šošić
Abdelhak M. Zoubir
Heinz Koeppl
机构
[1] Technische Universität Darmstadt,Department of Electrical Engineering and Information Technology
来源
Swarm Intelligence | 2018年 / 12卷
关键词
Reinforcement learning; Multi-agent systems; Decentralized control; Collective behavior; Swarm intelligence; Active particles; Continuum mechanics;
D O I
暂无
中图分类号
学科分类号
摘要
We present a decision-making framework for modeling the collective behavior of large groups of cooperatively interacting agents based on a continuum description of the agents’ joint state. The continuum model is derived from an agent-based system of locally coupled stochastic differential equations, taking into account that each agent in the group is only partially informed about the global system state. The usefulness of the proposed framework is twofold: (i) for multi-agent scenarios, it provides a computational approach to handling large-scale distributed decision-making problems and learning decentralized control policies. (ii) For single-agent systems, it offers an alternative approximation scheme for evaluating expectations of state distributions. We demonstrate our framework on a variant of the Kuramoto model using a variety of distributed control tasks, such as positioning and aggregation. As part of our experiments, we compare the effectiveness of the controllers learned by the continuum model and agent-based systems of different sizes, and we analyze how the degree of observability in the system affects the learning process.
引用
收藏
页码:23 / 51
页数:28
相关论文
共 50 条
[41]   Anticheat System Based on Reinforcement Learning Agents in Unity [J].
Lukas, Mihael ;
Tomicic, Igor ;
Bernik, Andrija .
INFORMATION, 2022, 13 (04)
[42]   Reinforcement Learning Strategy for Solving the MRCPSP by a Team of Agents [J].
Jedrzejowicz, Piotr ;
Ratajczak-Ropel, Ewa .
INTELLIGENT DECISION TECHNOLOGIES, 2015, 39 :537-548
[43]   Mitigating Cowardice for Reinforcement Learning Agents in Combat Scenarios [J].
Bakos, Steve ;
Davoudi, Heidar .
2022 IEEE CONFERENCE ON GAMES, COG, 2022, :377-384
[44]   A generic architecture for adaptive agents based on reinforcement learning [J].
Preux, P ;
Delepoulle, S ;
Darcheville, JC .
INFORMATION SCIENCES, 2004, 161 (1-2) :37-55
[45]   Reinforcement learning through interaction among multiple agents [J].
Iima, Hitoshi ;
Kuroe, Yasuaki .
2006 SICE-ICASE INTERNATIONAL JOINT CONFERENCE, VOLS 1-13, 2006, :100-+
[46]   Analysis of a Method Improving Reinforcement Learning Agents'Policies [J].
Kitakoshi, Daisuke ;
Shioya, Hiroyuki ;
Kurihara, Masahito .
2003, Fuji Technology Press (07) :276-282
[47]   Composing Synergistic Macro Actions for Reinforcement Learning Agents [J].
Chen, Yu-Ming ;
Chang, Kaun-Yu ;
Liu, Chien ;
Hsiao, Tsu-Ching ;
Hong, Zhang-Wei ;
Lee, Chun-Yi .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (05) :7251-7258
[48]   Validation of Reinforcement Learning Agents and Safety Shields with ProB [J].
Vu, Fabian ;
Dunkelau, Jannik ;
Leuschel, Michael .
NASA FORMAL METHODS, NFM 2024, 2024, 14627 :279-297
[49]   Testing of Deep Reinforcement Learning Agents with Surrogate Models [J].
Biagiola, Matteo ;
Tonella, Paolo .
ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (03)
[50]   Hammers for Robots: Designing Tools for Reinforcement Learning Agents [J].
Law, Matthew, V ;
Li, Zhilong ;
Rajesh, Amit ;
Dhawan, Nikhil ;
Kwatra, Amritansh ;
Hoffman, Guy .
PROCEEDINGS OF THE 2021 ACM DESIGNING INTERACTIVE SYSTEMS CONFERENCE (DIS 2021), 2021, :1638-1653