Massively High-Throughput Reinforcement Learning for Classic Control on GPUs

被引：0

作者：

Sha, Xuan ^{[1
,2
]}

Lan, Tian ^{[3
]}

机构：

[1] Southeast Univ, Chengxian Coll, Sch Civil & Transportat Engn, Nanjing 210088, Jiangsu, Peoples R China

[2] Hohai Univ, Coll Mech & Engn Sci, Nanjing 211100, Jiangsu, Peoples R China

[3] Salesforce AI Res, Palo Alto, CA 94301 USA

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Graphics processing units; Reinforcement learning; Training; Instruction sets; Computer architecture; Trajectory; Throughput; Control systems; Classic control; GPU acceleration; high-throughput; reinforcement learning;

D O I：

10.1109/ACCESS.2024.3441242

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This study presents a novel massively high-throughput reinforcement learning (RL) framework specifically designed for addressing classic control problems, leveraging our proposed architecture and algorithms optimized for efficient concurrent computations on GPUs. Our research demonstrates the effectiveness of our methods in efficiently training RL agents across various classic control problems, encompassing both discrete and continuous domains, while achieving rapid and stable performance up to 10K concurrent environment instances. Furthermore, we observe that RL exploration with a large number of parallel instances significantly enhances the stability of updating a shared model. For instance, we show that the stability of Deep Deterministic Policy Gradient (DDPG) training can be achieved without requiring experience replay, as evidenced in our study.

引用

页码：117737 / 117744

页数：8

共 50 条

[31] Machine-learning and high-throughput studies for high-entropy materials
Huang, E-Wen
Lee, Wen-Jay
Singh, Sudhanshu Shekhar
Kumar, Poresh
Lee, Chih-Yu
Lam, Tu-Ngoc
Chin, Hsu-Hsuan
Lin, Bi-Hsuan
Liaw, Peter K.
MATERIALS SCIENCE & ENGINEERING R-REPORTS, 2022, 147
[32] High-Throughput Adaptive List Decoding Architecture for Polar Codes on GPU
Liu, Zhanxian
Liu, Rongke
Zhang, Haijun
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 878 - 889
[33] Using Emotions as Intrinsic Motivation to Accelerate Classic Reinforcement Learning
Lu, Cheng-Xiang
Sun, Zhi-Yuan
Shi, Zhong-Zhi
Cao, Bao-Xiang
2016 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI 2016), 2016, : 332 - 337
[34] Model Predictive Control-Based Value Estimation for Efficient Reinforcement Learning
Wu, Qizhen
Liu, Kexin
Chen, Lei
IEEE INTELLIGENT SYSTEMS, 2024, 39 (03) : 63 - 72
[35] Offline Meta-Reinforcement Learning for Active Pantograph Control in High-Speed Railways
Wang, Hui
Liu, Zhigang
Hu, Guiyang
Wang, Xufan
Han, Zhiwei
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (08) : 10669 - 10679
[36] SmartLA: Reinforcement learning-based link adaptation for high throughput wireless access networks
Karmakar, Raja
Chattopadhyay, Samiran
Chakraborty, Sandip
COMPUTER COMMUNICATIONS, 2017, 110 : 1 - 25
[37] Harmonia: A High Throughput B plus tree for GPUs
Yan, Zhaofeng
Lin, Yuzhe
Peng, Lu
Zhang, Weihua
PROCEEDINGS OF THE 24TH SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING (PPOPP '19), 2019, : 133 - 144
[38] Disturbance rejection and high dynamic quadrotor control based on reinforcement learning and supervised learning
Mingjun Li
Zhihao Cai
Jiang Zhao
Jinyan Wang
Yingxun Wang
Neural Computing and Applications, 2022, 34 : 11141 - 11161
[39] Disturbance rejection and high dynamic quadrotor control based on reinforcement learning and supervised learning
Li, Mingjun
Cai, Zhihao
Zhao, Jiang
Wang, Jinyan
Wang, Yingxun
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (13) : 11141 - 11161
[40] Trajectory Planning With Deep Reinforcement Learning in High-Level Action Spaces
Williams, Kyle R.
Schlossman, Rachel
Whitten, Daniel
Ingram, Joe
Musuvathy, Srideep
Pagan, James
Williams, Kyle A.
Green, Sam
Patel, Anirudh
Mazumdar, Anirban
Parish, Julie
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (03) : 2513 - 2529

← 1 2 3 4 5 →