Fregata: A Low-Latency and Resource-Efficient Scheduling for Heterogeneous Jobs in Clouds

被引:1
作者
Liu, Jinwei [1 ]
机构
[1] Florida A&M Univ, Dept Comp & Informat Sci, Tallahassee, FL 32307 USA
来源
2022 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (IEEE BIGCOMP 2022) | 2022年
关键词
scheduling; task dependency; resource utilization; latency; machine learning;
D O I
10.1109/BigComp54360.2022.00013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An increasing number of large-scale data analytics frameworks move towards larger degrees of parallelism aiming at low-latency guarantees. It is challenging to design a scheduler with low latency and high resource utilization due to task dependency and job heterogeneity. The state-of-the-art schedulers in cloud/datacenters cannot well handle the scheduling of heterogeneous jobs with dependency constraints (e.g., dependency among tasks of a job) for simultaneously achieving low latency and high resource utilization. The key issues lie in the scalability in centralized schedulers, ineffective and inefficient probing and resource sharing in both distributed and hybrid schedulers. To address this challenge, we propose Fregata, a low-latency and resource-efficient scheduling for heterogeneous jobs with constraints (e.g., dependency constraints among tasks of a job) in clouds. Fregata first uses the machine learning algorithm to classify jobs into two categories (high priority jobs and low priority jobs) based on the extracted features. Next, Fregata splits the jobs into tasks and distributes the tasks to the master nodes based on task dependency and the load of master nodes. Then, Fregata utilizes the dependency information of tasks to determine task priority (tasks with more dependent tasks have higher priority), and packs tasks by leveraging the complementary of tasks' requirements on different resource types and task dependency. Finally, the master nodes distribute tasks to workers in the system based on priority of tasks and workers and the resource demands of tasks and the available resources of workers. To test the performance of Fregata, we conduct tracedriven experiments. Extensive experimental results based on a real cluster and Amazon EC2 cloud service show that Fregata achieves low-latency and high resource utilization compared to existing schedulers.
引用
收藏
页码:15 / 22
页数:8
相关论文
共 50 条
[21]   A machine learning-based resource-efficient task scheduler for heterogeneous computer systems [J].
Hayat, Asad ;
Khalid, Yasir Noman ;
Rathore, Muhammad Siraj ;
Nadir, Muhammad Nadeem .
JOURNAL OF SUPERCOMPUTING, 2023, 79 (14) :15700-15728
[22]   Work-in-Progress: Efficient Low-latency Near-Memory Addition [J].
Reaugh, Alexander ;
Salehi, Sayed Ahmad .
2022 INTERNATIONAL CONFERENCE ON COMPILERS, ARCHITECTURE, AND SYNTHESIS FOR EMBEDDED SYSTEMS (CASES 2022), 2022, :33-34
[23]   Highly-Efficient Low-Latency HARQ Built on NOMA for URLLC: Radio Resource Allocation and Transmission Rate Control Aspects [J].
Kobayashi, Ryota ;
Yuda, Yasuaki ;
Higuchi, Kenichi .
IEICE TRANSACTIONS ON COMMUNICATIONS, 2023, E106B (10) :1015-1023
[24]   Resource Allocation for High-Reliability Low-Latency Vehicular Communications With Packet Retransmission [J].
Guo, Chongtao ;
Liang, Le ;
Li, Geoffrey Ye .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (07) :6219-6230
[25]   A Resource Scheduling Algorithm with Low Latency for 5G Networks [J].
Wang C. ;
Tang H. ;
You W. ;
Wang X. ;
Yuan Q. .
Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2018, 52 (04) :117-124
[26]   Guaranteed Dynamic Scheduling of Ultra-Reliable Low-Latency Traffic via Conformal Prediction [J].
Cohen, Kfir M. ;
Park, Sangwoo ;
Simeone, Osvaldo ;
Popovski, Petar ;
Shamai, Shlomo .
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 :473-477
[27]   LL-PGS: A Lightweight and Low-Latency Proactive Grant Scheduling Algorithm for Industrial IoT [J].
Lai, Shilin ;
Li, Jin ;
Zhang, Dongxu ;
Zhang, Min .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2025, 14 (02) :295-299
[28]   AI-Integrated Extreme Massive MIMO Scheduling for Hyper Reliable Low-Latency Communication [J].
Kim, Jonghyun ;
Kim, Kwang Soon .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2025, 14 (05) :1426-1430
[29]   Low-Latency Data Aggregation Scheduling for Cognitive Radio Networks With Non-Predetermined Structure [J].
Chen, Quan ;
Cai, Zhipeng ;
Cheng, Lianglun ;
Gao, Hong .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2021, 20 (07) :2412-2426
[30]   A Predictive Semi-Persistent Scheduling Scheme for Low-Latency Applications in LTE and NR networks [J].
Feng, Ye ;
Nirmalathas, Ampalavanapillai ;
Wong, Elaine .
ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,