Reinforcement learning in a continuum of agents

被引：0

作者：

Adrian Šošić

Abdelhak M. Zoubir

Heinz Koeppl

机构：

[1] Technische Universität Darmstadt,Department of Electrical Engineering and Information Technology

来源：

Swarm Intelligence | 2018年 / 12卷

关键词：

Reinforcement learning; Multi-agent systems; Decentralized control; Collective behavior; Swarm intelligence; Active particles; Continuum mechanics;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We present a decision-making framework for modeling the collective behavior of large groups of cooperatively interacting agents based on a continuum description of the agents’ joint state. The continuum model is derived from an agent-based system of locally coupled stochastic differential equations, taking into account that each agent in the group is only partially informed about the global system state. The usefulness of the proposed framework is twofold: (i) for multi-agent scenarios, it provides a computational approach to handling large-scale distributed decision-making problems and learning decentralized control policies. (ii) For single-agent systems, it offers an alternative approximation scheme for evaluating expectations of state distributions. We demonstrate our framework on a variant of the Kuramoto model using a variety of distributed control tasks, such as positioning and aggregation. As part of our experiments, we compare the effectiveness of the controllers learned by the continuum model and agent-based systems of different sizes, and we analyze how the degree of observability in the system affects the learning process.

引用

页码：23 / 51

页数：28

共 50 条

[31] Decentralized multi-agent reinforcement learning with networked agents: recent advances
Zhang, Kaiqing
Yang, Zhuoran
Basar, Tamer
[J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2021, 22 (06) : 802 - 814
[32] Robot learning with GA-based fuzzy reinforcement learning agents
Zhou, CJ
[J]. INFORMATION SCIENCES, 2002, 145 (1-2) : 45 - 68
[33] Evaluating reinforcement learning agents for anatomical landmark detection
Alansary, Amir
Oktay, Ozan
Li, Yuanwei
Le Folgoc, Loic
Hou, Benjamin
Vaillant, Ghislain
Kamnitsas, Konstantinos
Vlontzos, Athanasios
Glocker, Ben
Kainz, Bernhard
Rueckert, Daniel
[J]. MEDICAL IMAGE ANALYSIS, 2019, 53 : 156 - 164
[34] Emergence of Emotional Appraisal Signals in Reinforcement Learning Agents
Sequeira, Pedro
Melo, Francisco S.
Paiva, Ana
[J]. AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 466 - 467
[35] Emergence of emotional appraisal signals in reinforcement learning agents
Sequeira, Pedro
Melo, Francisco S.
Paiva, Ana
[J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2015, 29 (04) : 537 - 568
[36] An Approach for Training Moral Agents via Reinforcement Learning
Gu T.
Gao H.
Li L.
Bao X.
Li Y.
[J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (09): : 2039 - 2050
[37] Collaborative hunting in artificial agents with deep reinforcement learning
Tsutsui, Kazushi
Tanaka, Ryoya
Takeda, Kazuya
Fujii, Keisuke
[J]. ELIFE, 2024, 13
[38] Cyber Resilience Using Autonomous Agents and Reinforcement Learning
Cam, Hasan
[J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS II, 2020, 11413
[39] Graph Convolutional Reinforcement Learning for Collaborative Queuing Agents
Fawaz, Hassan
Lesca, Julien
Quang, Pham Tran Anh
Leguay, Jeremie
Zeghlache, Djamal
Medagliani, Paolo
[J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (02): : 1363 - 1377
[40] ON THE DEVELOPMENT OF AUTONOMOUS AGENTS USING DEEP REINFORCEMENT LEARNING
Barbu, Clara
Mocanu, Stefan Alexandru
[J]. UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2021, 83 (03): : 97 - 116

← 1 2 3 4 5 →