Reinforcement learning in a continuum of agents

被引:0
作者
Adrian Šošić
Abdelhak M. Zoubir
Heinz Koeppl
机构
[1] Technische Universität Darmstadt,Department of Electrical Engineering and Information Technology
来源
Swarm Intelligence | 2018年 / 12卷
关键词
Reinforcement learning; Multi-agent systems; Decentralized control; Collective behavior; Swarm intelligence; Active particles; Continuum mechanics;
D O I
暂无
中图分类号
学科分类号
摘要
We present a decision-making framework for modeling the collective behavior of large groups of cooperatively interacting agents based on a continuum description of the agents’ joint state. The continuum model is derived from an agent-based system of locally coupled stochastic differential equations, taking into account that each agent in the group is only partially informed about the global system state. The usefulness of the proposed framework is twofold: (i) for multi-agent scenarios, it provides a computational approach to handling large-scale distributed decision-making problems and learning decentralized control policies. (ii) For single-agent systems, it offers an alternative approximation scheme for evaluating expectations of state distributions. We demonstrate our framework on a variant of the Kuramoto model using a variety of distributed control tasks, such as positioning and aggregation. As part of our experiments, we compare the effectiveness of the controllers learned by the continuum model and agent-based systems of different sizes, and we analyze how the degree of observability in the system affects the learning process.
引用
收藏
页码:23 / 51
页数:28
相关论文
共 50 条
  • [31] Decentralized multi-agent reinforcement learning with networked agents: recent advances
    Zhang, Kaiqing
    Yang, Zhuoran
    Basar, Tamer
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2021, 22 (06) : 802 - 814
  • [32] Robot learning with GA-based fuzzy reinforcement learning agents
    Zhou, CJ
    [J]. INFORMATION SCIENCES, 2002, 145 (1-2) : 45 - 68
  • [33] Evaluating reinforcement learning agents for anatomical landmark detection
    Alansary, Amir
    Oktay, Ozan
    Li, Yuanwei
    Le Folgoc, Loic
    Hou, Benjamin
    Vaillant, Ghislain
    Kamnitsas, Konstantinos
    Vlontzos, Athanasios
    Glocker, Ben
    Kainz, Bernhard
    Rueckert, Daniel
    [J]. MEDICAL IMAGE ANALYSIS, 2019, 53 : 156 - 164
  • [34] Emergence of Emotional Appraisal Signals in Reinforcement Learning Agents
    Sequeira, Pedro
    Melo, Francisco S.
    Paiva, Ana
    [J]. AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 466 - 467
  • [35] Emergence of emotional appraisal signals in reinforcement learning agents
    Sequeira, Pedro
    Melo, Francisco S.
    Paiva, Ana
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2015, 29 (04) : 537 - 568
  • [36] An Approach for Training Moral Agents via Reinforcement Learning
    Gu T.
    Gao H.
    Li L.
    Bao X.
    Li Y.
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (09): : 2039 - 2050
  • [37] Collaborative hunting in artificial agents with deep reinforcement learning
    Tsutsui, Kazushi
    Tanaka, Ryoya
    Takeda, Kazuya
    Fujii, Keisuke
    [J]. ELIFE, 2024, 13
  • [38] Cyber Resilience Using Autonomous Agents and Reinforcement Learning
    Cam, Hasan
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS II, 2020, 11413
  • [39] Graph Convolutional Reinforcement Learning for Collaborative Queuing Agents
    Fawaz, Hassan
    Lesca, Julien
    Quang, Pham Tran Anh
    Leguay, Jeremie
    Zeghlache, Djamal
    Medagliani, Paolo
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (02): : 1363 - 1377
  • [40] ON THE DEVELOPMENT OF AUTONOMOUS AGENTS USING DEEP REINFORCEMENT LEARNING
    Barbu, Clara
    Mocanu, Stefan Alexandru
    [J]. UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2021, 83 (03): : 97 - 116