Task Scheduling Based on Adaptive Priority Experience Replay on Cloud Platforms

被引:1
作者
Li, Cuixia [1 ,2 ]
Gao, Wenlong [2 ]
Shi, Li [1 ,3 ]
Shang, Zhiquan [2 ]
Zhang, Shuyan [2 ]
机构
[1] Zhengzhou Univ, Sch Elect Engn, Zhengzhou 450001, Peoples R China
[2] Zhengzhou Univ, Sch Cyber Sci & Engn, Zhengzhou 450001, Peoples R China
[3] Tsinghua Univ, Dept Automation, Beijing 100084, Peoples R China
关键词
reinforce learning; adaptive priority experience replay (APER); task scheduling; cloud platform; SCIENTIFIC WORKFLOWS; REINFORCEMENT; SCHEME;
D O I
10.3390/electronics12061358
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Task scheduling algorithms based on reinforce learning (RL) have been important methods with which to improve the performance of cloud platforms; however, due to the dynamics and complexity of the cloud environment, the action space has a very high dimension. This not only makes agent training difficult but also affects scheduling performance. In order to guide an agent's behavior and reduce the number of episodes by using historical records, a task scheduling algorithm based on adaptive priority experience replay (APER) is proposed. APER uses performance metrics as scheduling and sampling optimization objectives with which to improve network accuracy. Combined with prioritized experience replay (PER), an agent can decide how to use experiences. Moreover, this algorithm also considers whether a subtask is executed in a workflow to improve scheduling efficiency. Experimental results on Tpc-h, Alibaba cluster data, and scientific workflows show that a model with APER has significant benefits in terms of convergence and performance.
引用
收藏
页数:20
相关论文
共 69 条
[1]   Efficient Task Scheduling for Applications on Clouds [J].
Al-Zoubi, Hussein .
2019 6TH IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND CLOUD COMPUTING (IEEE CSCLOUD 2019) / 2019 5TH IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING AND SCALABLE CLOUD (IEEE EDGECOM 2019), 2019, :10-13
[2]  
[Anonymous], 1998, Reinforcement Learning: An Introduction
[3]   Memory trace replay: the shaping of memory consolidation by neuromodulation [J].
Atherton, Laura A. ;
Dupret, David ;
Mellor, Jack R. .
TRENDS IN NEUROSCIENCES, 2015, 38 (09) :560-570
[4]   Deep Learning-Based Job Placement in Distributed Machine Learning Clusters With Heterogeneous Workloads [J].
Bao, Yixin ;
Peng, Yanghua ;
Wu, Chuan .
IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (02) :634-647
[5]  
Bengio Y, 2009, P 26 ANN INT C MACH, P41, DOI [10.1145/1553374.1553380, DOI 10.1145/1553374.1553380]
[6]  
Bharathi S, 2008, 2008 THIRD WORKSHOP ON WORKFLOWS IN SUPPORT OF LARGE-SCALE SCIENCE (WORKS 2008), P11
[7]  
Bondy J.A., 1976, SIAM Review, V21, P429, DOI [10.1137/1021086, DOI 10.1137/1021086]
[8]   Uncertainty-Aware Online Scheduling for Real-Time Workflows in Cloud Service Environment [J].
Chen, Huangke ;
Zhu, Xiaomin ;
Liu, Guipeng ;
Pedrycz, Witold .
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2021, 14 (04) :1167-1178
[9]   Scheduling Jobs across Geo-Distributed Datacenters with Max-Min Fairness [J].
Chen, Li ;
Liu, Shuhao ;
Li, Baochun ;
Li, Bo .
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2019, 6 (03) :488-500
[10]   HD Live Maps for Automated Driving: An AI Approach [J].
Chen, Xin .
26TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2018), 2018, :1-1