Massively High-Throughput Reinforcement Learning for Classic Control on GPUs

被引：0

作者：

Sha, Xuan ^{[1
,2
]}

Lan, Tian ^{[3
]}

机构：

[1] Southeast Univ, Chengxian Coll, Sch Civil & Transportat Engn, Nanjing 210088, Jiangsu, Peoples R China

[2] Hohai Univ, Coll Mech & Engn Sci, Nanjing 211100, Jiangsu, Peoples R China

[3] Salesforce AI Res, Palo Alto, CA 94301 USA

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Graphics processing units; Reinforcement learning; Training; Instruction sets; Computer architecture; Trajectory; Throughput; Control systems; Classic control; GPU acceleration; high-throughput; reinforcement learning;

D O I：

10.1109/ACCESS.2024.3441242

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This study presents a novel massively high-throughput reinforcement learning (RL) framework specifically designed for addressing classic control problems, leveraging our proposed architecture and algorithms optimized for efficient concurrent computations on GPUs. Our research demonstrates the effectiveness of our methods in efficiently training RL agents across various classic control problems, encompassing both discrete and continuous domains, while achieving rapid and stable performance up to 10K concurrent environment instances. Furthermore, we observe that RL exploration with a large number of parallel instances significantly enhances the stability of updating a shared model. For instance, we show that the stability of Deep Deterministic Policy Gradient (DDPG) training can be achieved without requiring experience replay, as evidenced in our study.

引用

页码：117737 / 117744

页数：8

共 50 条

[1] Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Hou, Jing
Chen, Guang
Zhang, Ruiqi
Li, Zhijun
Gu, Shangding
Jiang, Changjun
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2025, 36 (02) : 282 - 292
[2] Anchored Hybrid Enrichment for Massively High-Throughput Phylogenomics
Lemmon, Alan R.
Emme, Sandra A.
Lemmon, Emily Moriarty
SYSTEMATIC BIOLOGY, 2012, 61 (05) : 727 - 744
[3] Agents that Listen: High-Throughput Reinforcement Learning with Multiple Sensory Systems
Hegde, Shashank
Kanervisto, Anssi
Petrenko, Aleksei
2021 IEEE CONFERENCE ON GAMES (COG), 2021, : 1006 - 1010
[4] High-Throughput Transistor-Level Fault Simulation on GPUs
Schneider, Eric
Wunderlich, Hans-Joachim
2016 IEEE 25TH ASIAN TEST SYMPOSIUM (ATS), 2016, : 150 - 155
[5] Optimizing High-Throughput Capabilities by Leveraging Reinforcement Learning Methods with the Bluesky Suite
Olds, Daniel
Allan, Daniel B.
Caswell, Thomas A.
Lynch, Joshua
Maffettone, Phillip M.
Campbell, Stuart, I
PROCEEDINGS OF XLOOP 2021: THE 3RD ANNUAL WORKSHOP ON EXTREME-SCALE EXPERIMENT-IN-THE-LOOP COMPUTING, 2021, : 36 - 42
[6] Microbial experimental evolution in a massively multiplexed and high-throughput era
Jagdish, Tanush
Ba, Alex N. Nguyen
CURRENT OPINION IN GENETICS & DEVELOPMENT, 2022, 75
[7] High-throughput tumor genomic profiling by massively parallel sequencing
Wagle, Nikhil
Davis, Matt
Berger, Michael F.
Blumenstiel, Brendan
Defelice, Matthew
Hahn, VVilliam
Meyerson, Matthew
Gabriel, Stacey B.
MacConaill, Laura
Garraway, Levi A.
CANCER RESEARCH, 2010, 70
[8] Implementation of a high-throughput low-latency polyphase channelizer on GPUs
Kim, Scott C.
Bhattacharyya, Shuvra S.
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2014,
[9] Implementation of a high-throughput low-latency polyphase channelizer on GPUs
Scott C Kim
Shuvra S Bhattacharyya
EURASIP Journal on Advances in Signal Processing, 2014 (1)
[10] Mckeycutter: A High-throughput Key Generator of Classic McEliece on Hardware
Zhu, Yihong
Zhu, Wenping
Chen, Chen
Zhu, Min
Li, Zhengdong
Wei, Shaojun
Liu, Leibo
2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,

← 1 2 3 4 5 →