Reinforcement learning in a continuum of agents

被引:0
作者
Adrian Šošić
Abdelhak M. Zoubir
Heinz Koeppl
机构
[1] Technische Universität Darmstadt,Department of Electrical Engineering and Information Technology
来源
Swarm Intelligence | 2018年 / 12卷
关键词
Reinforcement learning; Multi-agent systems; Decentralized control; Collective behavior; Swarm intelligence; Active particles; Continuum mechanics;
D O I
暂无
中图分类号
学科分类号
摘要
We present a decision-making framework for modeling the collective behavior of large groups of cooperatively interacting agents based on a continuum description of the agents’ joint state. The continuum model is derived from an agent-based system of locally coupled stochastic differential equations, taking into account that each agent in the group is only partially informed about the global system state. The usefulness of the proposed framework is twofold: (i) for multi-agent scenarios, it provides a computational approach to handling large-scale distributed decision-making problems and learning decentralized control policies. (ii) For single-agent systems, it offers an alternative approximation scheme for evaluating expectations of state distributions. We demonstrate our framework on a variant of the Kuramoto model using a variety of distributed control tasks, such as positioning and aggregation. As part of our experiments, we compare the effectiveness of the controllers learned by the continuum model and agent-based systems of different sizes, and we analyze how the degree of observability in the system affects the learning process.
引用
收藏
页码:23 / 51
页数:28
相关论文
共 50 条
[21]   Reinforcement learning of competitive skills with soccer agents [J].
Leng, Jinsong ;
Fyfe, Colin ;
Jain, Lakhmi .
KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS: KES 2007 - WIRN 2007, PT I, PROCEEDINGS, 2007, 4692 :572-+
[22]   Coordinating multiple agents via reinforcement learning [J].
Chen, G ;
Yang, ZH ;
He, H ;
Goh, KM .
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2005, 10 (03) :273-328
[23]   Optimizing the Neural Architecture of Reinforcement Learning Agents [J].
Mazyavkina, N. ;
Moustafa, S. ;
Trofimov, I ;
Burnaev, E. .
INTELLIGENT COMPUTING, VOL 2, 2021, 284 :591-606
[24]   Emotion in reinforcement learning agents and robots: a survey [J].
Moerland, Thomas M. ;
Broekens, Joost ;
Jonker, Catholijn M. .
MACHINE LEARNING, 2018, 107 (02) :443-480
[25]   Coordinating Multiple Agents via Reinforcement Learning [J].
Gang Chen ;
Zhonghua Yang ;
Hao He ;
Kiah Mok Goh .
Autonomous Agents and Multi-Agent Systems, 2005, 10 :273-328
[26]   Enabling Motivated Believable Agents with Reinforcement Learning [J].
Forgette, Jacquelyne ;
Katchabaw, Michael .
2014 IEEE GAMES, MEDIA, ENTERTAINMENT (GEM), 2014,
[27]   Learning Task Decomposition and Exploration Shaping for Reinforcement Learning Agents [J].
Djurdjevic, Predrag ;
Huber, Manfred .
2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, :365-372
[28]   Learning key steps to attack deep reinforcement learning agents [J].
Chien-Min Yu ;
Ming-Hsin Chen ;
Hsuan-Tien Lin .
Machine Learning, 2023, 112 :1499-1522
[29]   Learning key steps to attack deep reinforcement learning agents [J].
Yu, Chien-Min ;
Chen, Ming-Hsin ;
Lin, Hsuan-Tien .
MACHINE LEARNING, 2023, 112 (05) :1499-1522
[30]   Task Offloading in Computing Continuum Using Collaborative Reinforcement Learning [J].
Robles-Enciso, Alberto ;
Skarmeta, Antonio F. .
INTERNET OF THINGS, GIOTS 2022, 2022, 13533 :82-95