Massively High-Throughput Reinforcement Learning for Classic Control on GPUs

被引：0

作者：

Sha, Xuan ^{[1
,2
]}

Lan, Tian ^{[3
]}

机构：

[1] Southeast Univ, Chengxian Coll, Sch Civil & Transportat Engn, Nanjing 210088, Jiangsu, Peoples R China

[2] Hohai Univ, Coll Mech & Engn Sci, Nanjing 211100, Jiangsu, Peoples R China

[3] Salesforce AI Res, Palo Alto, CA 94301 USA

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Graphics processing units; Reinforcement learning; Training; Instruction sets; Computer architecture; Trajectory; Throughput; Control systems; Classic control; GPU acceleration; high-throughput; reinforcement learning;

D O I：

10.1109/ACCESS.2024.3441242

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This study presents a novel massively high-throughput reinforcement learning (RL) framework specifically designed for addressing classic control problems, leveraging our proposed architecture and algorithms optimized for efficient concurrent computations on GPUs. Our research demonstrates the effectiveness of our methods in efficiently training RL agents across various classic control problems, encompassing both discrete and continuous domains, while achieving rapid and stable performance up to 10K concurrent environment instances. Furthermore, we observe that RL exploration with a large number of parallel instances significantly enhances the stability of updating a shared model. For instance, we show that the stability of Deep Deterministic Policy Gradient (DDPG) training can be achieved without requiring experience replay, as evidenced in our study.

引用

页码：117737 / 117744

页数：8

共 50 条

[21] Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control Problem
Zhang, Chengwei
Jin, Shan
Xue, Wanli
Xie, Xiaofei
Chen, Shengyong
Chen, Rong
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (08) : 7426 - 7436
[22] High-Throughput Venomics
Slagboom, Julien
Derks, Rico J. E.
Sadighi, Raya
Somsen, Govert W.
Ulens, Chris
Casewell, Nicholas R.
Kool, Jeroen
JOURNAL OF PROTEOME RESEARCH, 2023, 22 (06) : 1734 - 1746
[23] Categorical Matrix Completion With Active Learning for High-Throughput Screening
Chen, Junyi
Hou, Junhui
Wong, Ka-Chun
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (06) : 2261 - 2270
[24] High-throughput to success
Winder, Robert
Chemistry and Industry (London), 2003, (01):
[25] Trajectory Design and Access Control for Air-Ground Coordinated Communications System With Multiagent Deep Reinforcement Learning
Ding, Ruijin
Xu, Yadong
Gao, Feifei
Shen, Xuemin
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (08) : 5785 - 5798
[26] Accelerating materials science with high-throughput computations and machine learning
Ong, Shyue Ping
COMPUTATIONAL MATERIALS SCIENCE, 2019, 161 : 143 - 150
[27] An ultra high-throughput, massively multiplexable, single-cell RNA-seq platform in yeasts
Brettner, Leandra
Eder, Rachel
Schmidlin, Kara
Geiler-Samerotte, Kerry
YEAST, 2024, 41 (04) : 242 - 255
[28] Safe Reinforcement Learning via Episodic Control
Li, Zhuo
Zhu, Derui
Grossklags, Jens
IEEE ACCESS, 2025, 13 : 35270 - 35280
[29] High-Throughput LDPC-CC Decoders Based on Storage, Arithmetic, and Control Improvements
Chen, Yuxing
Cui, Hangxuan
Wang, Zhongfeng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (03) : 1069 - 1073
[30] Optimizing Data Transfers for Improved Performance on Shared GPUs Using Reinforcement Learning
Luley, Ryan S.
Qiu, Qinru
2018 18TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2018, : 378 - 381

← 1 2 3 4 5 →