Developing Real-Time Scheduling Policy by Deep Reinforcement Learning

被引:8
|
作者
Bo, Zitong [1 ,2 ]
Qiao, Ying [1 ]
Leng, Chang [1 ]
Wang, Hongan [1 ]
Guo, Chaoping [1 ]
Zhang, Shaohui [3 ]
机构
[1] Chinese Acad Sci, Inst Software, Beijing Key Lab Human Comp Interact, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Beijing Natl Speed Skating Oval Operat Co Ltd, Beijing, Peoples R China
来源
2021 IEEE 27TH REAL-TIME AND EMBEDDED TECHNOLOGY AND APPLICATIONS SYMPOSIUM (RTAS 2021) | 2021年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
real-time scheduling; reinforcement learning; multiprocessor system; deep neural network;
D O I
10.1109/RTAS52030.2021.00019
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Designing scheduling policies for multiprocessor real-time systems is challenging since the multiprocessor scheduling problem is NP-complete. The existing heuristics are customized policies that may achieve poor performance under some specific task loads. Thus, a new design pattern is needed to make the multiprocessor scheduling policies perform well under various task loads. In this paper, we investigate a new real-time scheduling policy based on reinforcement learning. For any given real-time task set, our policy can automatically derive a high performance by online learning. Specifically, we model the real-time scheduling process as a multi-agent cooperative game and propose multi-agent self-cooperative learning that overcomes the curse of dimensionality and credit assignment problems. Simulation results show that our approach can learn high-performance policies for various task/system models.
引用
收藏
页码:131 / 142
页数:12
相关论文
共 50 条
  • [21] Real-Time Microgrid Energy Scheduling Using Meta-Reinforcement Learning
    Shen, Huan
    Shen, Xingfa
    Chen, Yiming
    ENERGIES, 2024, 17 (10)
  • [22] Reinforcement learning control method for real-time hybrid simulation based on deep deterministic policy gradient algorithm
    Li, Ning
    Tang, Jichuan
    Li, Zhong-Xian
    Gao, Xiuyu
    STRUCTURAL CONTROL & HEALTH MONITORING, 2022, 29 (10)
  • [23] Real-time scheduling strategy optimization for electric vehicle battery swapping station based on reinforcement learning
    Zhang W.
    Li R.
    Zang X.
    Yan J.
    Zhu J.
    Dianli Zidonghua Shebei/Electric Power Automation Equipment, 2022, 42 (10): : 134 - 141
  • [24] Deep reinforcement learning based AGVs real-time scheduling with mixed rule for flexible shop floor in industry 4.0
    Hu, Hao
    Jia, Xiaoliang
    He, Qixuan
    Fu, Shifeng
    Liu, Kuo
    COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 149
  • [25] Deep Learning for Real-time Applications: A Survey
    Zhang Z.-K.
    Pang W.-G.
    Xie W.-J.
    Lü M.-S.
    Wang Y.
    Zhang, Zheng-Kui (zhangzhengkui@cse.neu.edu.cn), 1600, Chinese Academy of Sciences (31): : 2654 - 2677
  • [26] Reinforcement learning and digital twin-based real-time scheduling method in intelligent manufacturing systems
    Zhang, Lixiang
    Yan, Yan
    Hu, Yaoguang
    Ren, Weibo
    IFAC PAPERSONLINE, 2022, 55 (10): : 359 - 364
  • [27] Real-time scheduling for production-logistics collaborative environment using multi-agent deep reinforcement learning
    Li, Yuxin
    Li, Xinyu
    Gao, Liang
    ADVANCED ENGINEERING INFORMATICS, 2025, 65
  • [28] Reinforcement Learning for Online Dispatching Policy in Real-Time Train Timetable Rescheduling
    Yue P.
    Jin Y.
    Dai X.
    Feng Z.
    Cui D.
    IEEE Transactions on Intelligent Transportation Systems, 2024, 25 (01) : 478 - 490
  • [29] Digital twin and deep reinforcement learning enabled real-time scheduling for complex product flexible shop-floor
    Chang, Xiao
    Jia, Xiaoliang
    Fu, Shifeng
    Hu, Hao
    Liu, Kuo
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2023, 237 (08) : 1254 - 1268
  • [30] Enhancing deep reinforcement learning for scale flexibility in real-time strategy games
    Lemos, Marcelo Luiz Harry Diniz
    Vieira, Ronaldo Silva
    Tavares, Anderson Rocha
    Marcolino, Leandro Soriano
    Chaimowicz, Luiz
    ENTERTAINMENT COMPUTING, 2025, 52