Massively High-Throughput Reinforcement Learning for Classic Control on GPUs

被引:0
|
作者
Sha, Xuan [1 ,2 ]
Lan, Tian [3 ]
机构
[1] Southeast Univ, Chengxian Coll, Sch Civil & Transportat Engn, Nanjing 210088, Jiangsu, Peoples R China
[2] Hohai Univ, Coll Mech & Engn Sci, Nanjing 211100, Jiangsu, Peoples R China
[3] Salesforce AI Res, Palo Alto, CA 94301 USA
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Graphics processing units; Reinforcement learning; Training; Instruction sets; Computer architecture; Trajectory; Throughput; Control systems; Classic control; GPU acceleration; high-throughput; reinforcement learning;
D O I
10.1109/ACCESS.2024.3441242
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study presents a novel massively high-throughput reinforcement learning (RL) framework specifically designed for addressing classic control problems, leveraging our proposed architecture and algorithms optimized for efficient concurrent computations on GPUs. Our research demonstrates the effectiveness of our methods in efficiently training RL agents across various classic control problems, encompassing both discrete and continuous domains, while achieving rapid and stable performance up to 10K concurrent environment instances. Furthermore, we observe that RL exploration with a large number of parallel instances significantly enhances the stability of updating a shared model. For instance, we show that the stability of Deep Deterministic Policy Gradient (DDPG) training can be achieved without requiring experience replay, as evidenced in our study.
引用
收藏
页码:117737 / 117744
页数:8
相关论文
共 50 条
  • [1] Spreeze: High-Throughput Parallel Reinforcement Learning Framework
    Hou, Jing
    Chen, Guang
    Zhang, Ruiqi
    Li, Zhijun
    Gu, Shangding
    Jiang, Changjun
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2025, 36 (02) : 282 - 292
  • [2] Anchored Hybrid Enrichment for Massively High-Throughput Phylogenomics
    Lemmon, Alan R.
    Emme, Sandra A.
    Lemmon, Emily Moriarty
    SYSTEMATIC BIOLOGY, 2012, 61 (05) : 727 - 744
  • [3] Agents that Listen: High-Throughput Reinforcement Learning with Multiple Sensory Systems
    Hegde, Shashank
    Kanervisto, Anssi
    Petrenko, Aleksei
    2021 IEEE CONFERENCE ON GAMES (COG), 2021, : 1006 - 1010
  • [4] High-Throughput Transistor-Level Fault Simulation on GPUs
    Schneider, Eric
    Wunderlich, Hans-Joachim
    2016 IEEE 25TH ASIAN TEST SYMPOSIUM (ATS), 2016, : 150 - 155
  • [5] Optimizing High-Throughput Capabilities by Leveraging Reinforcement Learning Methods with the Bluesky Suite
    Olds, Daniel
    Allan, Daniel B.
    Caswell, Thomas A.
    Lynch, Joshua
    Maffettone, Phillip M.
    Campbell, Stuart, I
    PROCEEDINGS OF XLOOP 2021: THE 3RD ANNUAL WORKSHOP ON EXTREME-SCALE EXPERIMENT-IN-THE-LOOP COMPUTING, 2021, : 36 - 42
  • [6] Microbial experimental evolution in a massively multiplexed and high-throughput era
    Jagdish, Tanush
    Ba, Alex N. Nguyen
    CURRENT OPINION IN GENETICS & DEVELOPMENT, 2022, 75
  • [7] High-throughput tumor genomic profiling by massively parallel sequencing
    Wagle, Nikhil
    Davis, Matt
    Berger, Michael F.
    Blumenstiel, Brendan
    Defelice, Matthew
    Hahn, VVilliam
    Meyerson, Matthew
    Gabriel, Stacey B.
    MacConaill, Laura
    Garraway, Levi A.
    CANCER RESEARCH, 2010, 70
  • [8] Implementation of a high-throughput low-latency polyphase channelizer on GPUs
    Kim, Scott C.
    Bhattacharyya, Shuvra S.
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2014,
  • [9] Implementation of a high-throughput low-latency polyphase channelizer on GPUs
    Scott C Kim
    Shuvra S Bhattacharyya
    EURASIP Journal on Advances in Signal Processing, 2014 (1)
  • [10] Mckeycutter: A High-throughput Key Generator of Classic McEliece on Hardware
    Zhu, Yihong
    Zhu, Wenping
    Chen, Chen
    Zhu, Min
    Li, Zhengdong
    Wei, Shaojun
    Liu, Leibo
    2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,