Reinforcement learning in a continuum of agents

被引：0

作者：

Adrian Šošić

Abdelhak M. Zoubir

Heinz Koeppl

机构：

[1] Technische Universität Darmstadt,Department of Electrical Engineering and Information Technology

来源：

Swarm Intelligence | 2018年 / 12卷

关键词：

Reinforcement learning; Multi-agent systems; Decentralized control; Collective behavior; Swarm intelligence; Active particles; Continuum mechanics;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We present a decision-making framework for modeling the collective behavior of large groups of cooperatively interacting agents based on a continuum description of the agents’ joint state. The continuum model is derived from an agent-based system of locally coupled stochastic differential equations, taking into account that each agent in the group is only partially informed about the global system state. The usefulness of the proposed framework is twofold: (i) for multi-agent scenarios, it provides a computational approach to handling large-scale distributed decision-making problems and learning decentralized control policies. (ii) For single-agent systems, it offers an alternative approximation scheme for evaluating expectations of state distributions. We demonstrate our framework on a variant of the Kuramoto model using a variety of distributed control tasks, such as positioning and aggregation. As part of our experiments, we compare the effectiveness of the controllers learned by the continuum model and agent-based systems of different sizes, and we analyze how the degree of observability in the system affects the learning process.

引用

页码：23 / 51

页数：28