Massively High-Throughput Reinforcement Learning for Classic Control on GPUs

被引:0
|
作者
Sha, Xuan [1 ,2 ]
Lan, Tian [3 ]
机构
[1] Southeast Univ, Chengxian Coll, Sch Civil & Transportat Engn, Nanjing 210088, Jiangsu, Peoples R China
[2] Hohai Univ, Coll Mech & Engn Sci, Nanjing 211100, Jiangsu, Peoples R China
[3] Salesforce AI Res, Palo Alto, CA 94301 USA
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Graphics processing units; Reinforcement learning; Training; Instruction sets; Computer architecture; Trajectory; Throughput; Control systems; Classic control; GPU acceleration; high-throughput; reinforcement learning;
D O I
10.1109/ACCESS.2024.3441242
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study presents a novel massively high-throughput reinforcement learning (RL) framework specifically designed for addressing classic control problems, leveraging our proposed architecture and algorithms optimized for efficient concurrent computations on GPUs. Our research demonstrates the effectiveness of our methods in efficiently training RL agents across various classic control problems, encompassing both discrete and continuous domains, while achieving rapid and stable performance up to 10K concurrent environment instances. Furthermore, we observe that RL exploration with a large number of parallel instances significantly enhances the stability of updating a shared model. For instance, we show that the stability of Deep Deterministic Policy Gradient (DDPG) training can be achieved without requiring experience replay, as evidenced in our study.
引用
收藏
页码:117737 / 117744
页数:8
相关论文
共 50 条
  • [21] Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control Problem
    Zhang, Chengwei
    Jin, Shan
    Xue, Wanli
    Xie, Xiaofei
    Chen, Shengyong
    Chen, Rong
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (08) : 7426 - 7436
  • [22] High-Throughput Venomics
    Slagboom, Julien
    Derks, Rico J. E.
    Sadighi, Raya
    Somsen, Govert W.
    Ulens, Chris
    Casewell, Nicholas R.
    Kool, Jeroen
    JOURNAL OF PROTEOME RESEARCH, 2023, 22 (06) : 1734 - 1746
  • [23] Categorical Matrix Completion With Active Learning for High-Throughput Screening
    Chen, Junyi
    Hou, Junhui
    Wong, Ka-Chun
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (06) : 2261 - 2270
  • [24] High-throughput to success
    Winder, Robert
    Chemistry and Industry (London), 2003, (01):
  • [25] Trajectory Design and Access Control for Air-Ground Coordinated Communications System With Multiagent Deep Reinforcement Learning
    Ding, Ruijin
    Xu, Yadong
    Gao, Feifei
    Shen, Xuemin
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (08) : 5785 - 5798
  • [26] Accelerating materials science with high-throughput computations and machine learning
    Ong, Shyue Ping
    COMPUTATIONAL MATERIALS SCIENCE, 2019, 161 : 143 - 150
  • [27] An ultra high-throughput, massively multiplexable, single-cell RNA-seq platform in yeasts
    Brettner, Leandra
    Eder, Rachel
    Schmidlin, Kara
    Geiler-Samerotte, Kerry
    YEAST, 2024, 41 (04) : 242 - 255
  • [28] Safe Reinforcement Learning via Episodic Control
    Li, Zhuo
    Zhu, Derui
    Grossklags, Jens
    IEEE ACCESS, 2025, 13 : 35270 - 35280
  • [29] High-Throughput LDPC-CC Decoders Based on Storage, Arithmetic, and Control Improvements
    Chen, Yuxing
    Cui, Hangxuan
    Wang, Zhongfeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (03) : 1069 - 1073
  • [30] Optimizing Data Transfers for Improved Performance on Shared GPUs Using Reinforcement Learning
    Luley, Ryan S.
    Qiu, Qinru
    2018 18TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2018, : 378 - 381