A Heterogeneous Acceleration System for Attention-Based Multi-Agent Reinforcement Learning

被引：0

作者：

Wiggins, Samuel ^{[1
]}

Meng, Yuan ^{[1
]}

Iyer, Mahesh A. ^{[2
]}

Prasanna, Viktor ^{[1
]}

机构：

[1] Univ Southern Calif, Ming Hsieh Dept Elect & Comp Engn, Los Angeles, CA 90007 USA

[2] Intel Corp, Santa Clara, CA USA

来源：

2024 34TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL 2024 | 2024年

基金：

美国国家科学基金会;

关键词：

Multi-Agent Reinforcement Learning; Hardware Accelerator; Heterogeneous Computing;

D O I：

10.1109/FPL64840.2024.00040

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-Agent Reinforcement Learning (MARL) is an emerging technology that has seen success in many AI applications. Multi-Actor-Attention-Critic (MAAC) is a state-of-the-art MARL algorithm that uses a Multi-Head Attention (MHA) mechanism to learn messages communicated among agents during the training process. Current implementations of MAAC using CPU and CPU-GPU platforms lack fine-grained parallelism among agents, sequentially executing each stage of the training loop, and their performance suffers from costly data movement involved in MHA communication learning. In this work, we develop the first high-throughput accelerator for MARL with attention-based communication on a CPU-FPGA heterogeneous system. We alleviate the limitations of existing implementations through a combination of data- and pipeline-parallel modules in our accelerator design and enable fine-grained system scheduling for exploiting concurrency among heterogeneous resources. Our design increases the overall system throughput by 4.6x and 4.1x compared to CPU and CPU-GPU implementations, respectively.

引用

页码：236 / 242

页数：7

共 50 条

[31] A Graph Attention Mechanism Based Multi-Agent Reinforcement Learning Method for Efficient Traffic Light Control
Su, Changqing
Yan, Yan
Wang, Tao
Zhang, Baoxian
Li, Cheng
IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 1332 - 1337
[32] Concept Learning for Interpretable Multi-Agent Reinforcement Learning
Zabounidis, Renos
Campbell, Joseph
Stepputtis, Simon
Hughes, Dana
Sycara, Katia
CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1828 - 1837
[33] Learning structured communication for multi-agent reinforcement learning
Sheng, Junjie
Wang, Xiangfeng
Jin, Bo
Yan, Junchi
Li, Wenhao
Chang, Tsung-Hui
Wang, Jun
Zha, Hongyuan
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2022, 36 (02)
[34] Learning structured communication for multi-agent reinforcement learning
Junjie Sheng
Xiangfeng Wang
Bo Jin
Junchi Yan
Wenhao Li
Tsung-Hui Chang
Jun Wang
Hongyuan Zha
Autonomous Agents and Multi-Agent Systems, 2022, 36
[35] Multi-agent reinforcement learning for character control
Li, Cheng
Fussell, Levi
Komura, Taku
VISUAL COMPUTER, 2021, 37 (12): : 3115 - 3123
[36] HALFTONING WITH MULTI-AGENT DEEP REINFORCEMENT LEARNING
Jiang, Haitian
Xiong, Dongliang
Jiang, Xiaowen
Yin, Aiguo
Ding, Li
Huang, Kai
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 641 - 645
[37] Multi-Agent Reinforcement Learning with Reward Delays
Zhang, Yuyang
Zhang, Runyu
Gu, Yuantao
Li, Na
LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
[38] Deep reinforcement learning for multi-agent interaction
Ahmed, Ibrahim H.
Brewitt, Cillian
Carlucho, Ignacio
Christianos, Filippos
Dunion, Mhairi
Fosong, Elliot
Garcin, Samuel
Guo, Shangmin
Gyevnar, Balint
McInroe, Trevor
Papoudakis, Georgios
Rahman, Arrasy
Schafer, Lukas
Tamborski, Massimiliano
Vecchio, Giuseppe
Wang, Cheng
Albrecht, Stefano, V
AI COMMUNICATIONS, 2022, 35 (04) : 357 - 368
[39] A Review of Multi-Agent Reinforcement Learning Algorithms
Liang, Jiaxin
Miao, Haotian
Li, Kai
Tan, Jianheng
Wang, Xi
Luo, Rui
Jiang, Yueqiu
ELECTRONICS, 2025, 14 (04):
[40] A Multi-Agent Reinforcement Learning Algorithm for Disambiguation in a Spoken Dialogue System
Wang, Fangju
INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 116 - 123

← 1 2 3 4 5 →