Massively High-Throughput Reinforcement Learning for Classic Control on GPUs

被引:0
作者
Sha, Xuan [1 ,2 ]
Lan, Tian [3 ]
机构
[1] Southeast Univ, Chengxian Coll, Sch Civil & Transportat Engn, Nanjing 210088, Jiangsu, Peoples R China
[2] Hohai Univ, Coll Mech & Engn Sci, Nanjing 211100, Jiangsu, Peoples R China
[3] Salesforce AI Res, Palo Alto, CA 94301 USA
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Graphics processing units; Reinforcement learning; Training; Instruction sets; Computer architecture; Trajectory; Throughput; Control systems; Classic control; GPU acceleration; high-throughput; reinforcement learning;
D O I
10.1109/ACCESS.2024.3441242
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study presents a novel massively high-throughput reinforcement learning (RL) framework specifically designed for addressing classic control problems, leveraging our proposed architecture and algorithms optimized for efficient concurrent computations on GPUs. Our research demonstrates the effectiveness of our methods in efficiently training RL agents across various classic control problems, encompassing both discrete and continuous domains, while achieving rapid and stable performance up to 10K concurrent environment instances. Furthermore, we observe that RL exploration with a large number of parallel instances significantly enhances the stability of updating a shared model. For instance, we show that the stability of Deep Deterministic Policy Gradient (DDPG) training can be achieved without requiring experience replay, as evidenced in our study.
引用
收藏
页码:117737 / 117744
页数:8
相关论文
共 50 条
  • [31] Machine-learning and high-throughput studies for high-entropy materials
    Huang, E-Wen
    Lee, Wen-Jay
    Singh, Sudhanshu Shekhar
    Kumar, Poresh
    Lee, Chih-Yu
    Lam, Tu-Ngoc
    Chin, Hsu-Hsuan
    Lin, Bi-Hsuan
    Liaw, Peter K.
    MATERIALS SCIENCE & ENGINEERING R-REPORTS, 2022, 147
  • [32] High-Throughput Adaptive List Decoding Architecture for Polar Codes on GPU
    Liu, Zhanxian
    Liu, Rongke
    Zhang, Haijun
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 878 - 889
  • [33] Using Emotions as Intrinsic Motivation to Accelerate Classic Reinforcement Learning
    Lu, Cheng-Xiang
    Sun, Zhi-Yuan
    Shi, Zhong-Zhi
    Cao, Bao-Xiang
    2016 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI 2016), 2016, : 332 - 337
  • [34] Model Predictive Control-Based Value Estimation for Efficient Reinforcement Learning
    Wu, Qizhen
    Liu, Kexin
    Chen, Lei
    IEEE INTELLIGENT SYSTEMS, 2024, 39 (03) : 63 - 72
  • [35] Offline Meta-Reinforcement Learning for Active Pantograph Control in High-Speed Railways
    Wang, Hui
    Liu, Zhigang
    Hu, Guiyang
    Wang, Xufan
    Han, Zhiwei
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (08) : 10669 - 10679
  • [36] SmartLA: Reinforcement learning-based link adaptation for high throughput wireless access networks
    Karmakar, Raja
    Chattopadhyay, Samiran
    Chakraborty, Sandip
    COMPUTER COMMUNICATIONS, 2017, 110 : 1 - 25
  • [37] Harmonia: A High Throughput B plus tree for GPUs
    Yan, Zhaofeng
    Lin, Yuzhe
    Peng, Lu
    Zhang, Weihua
    PROCEEDINGS OF THE 24TH SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING (PPOPP '19), 2019, : 133 - 144
  • [38] Disturbance rejection and high dynamic quadrotor control based on reinforcement learning and supervised learning
    Mingjun Li
    Zhihao Cai
    Jiang Zhao
    Jinyan Wang
    Yingxun Wang
    Neural Computing and Applications, 2022, 34 : 11141 - 11161
  • [39] Disturbance rejection and high dynamic quadrotor control based on reinforcement learning and supervised learning
    Li, Mingjun
    Cai, Zhihao
    Zhao, Jiang
    Wang, Jinyan
    Wang, Yingxun
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (13) : 11141 - 11161
  • [40] Trajectory Planning With Deep Reinforcement Learning in High-Level Action Spaces
    Williams, Kyle R.
    Schlossman, Rachel
    Whitten, Daniel
    Ingram, Joe
    Musuvathy, Srideep
    Pagan, James
    Williams, Kyle A.
    Green, Sam
    Patel, Anirudh
    Mazumdar, Anirban
    Parish, Julie
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (03) : 2513 - 2529