Optimization of Big Data Parallel Scheduling Based on Dynamic Clustering Scheduling Algorithm

被引:3
作者
Liu, Fang [1 ,2 ]
He, Yanxiang [1 ]
He, Jing [4 ]
Gao, Xing [5 ]
Huang, Feihu [3 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Bayi Rd, Wuhan 430072, Hubei, Peoples R China
[2] Wuhan Inst City, Informat Engn Dept, Huangjia Dawan Rd, Wuhan 430083, Hubei, Peoples R China
[3] Wuhan Univ Sci & Technol, Coll Comp Sci, Huangjiahu West Rd, Wuhan 430065, Hubei, Peoples R China
[4] Kennesaw State Univ, Dept Comp Sci, 1100 South Pkwy, Marietta, GA 30060 USA
[5] Wuhan Inst City, Admiss Off, Huangjia Dawan Rd, Wuhan 430083, Hubei, Peoples R China
来源
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY | 2022年 / 94卷 / 11期
基金
中国国家自然科学基金;
关键词
Big data environment; Dynamic data; Clustering algorithm; Scheduling optimization;
D O I
10.1007/s11265-022-01765-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In today's data age, the big data processing analysis framework plays an important role in mass information processing, along with the increasing of massive data. "Sharing Data" is proposed to enhance the performance of data processing through structured data scheduling. However, such approach makes the higher communication cost and buffer cost for the extra data copy and buffering. Hence, in the big data analysis environment, this paper uses based on the correlation of data, Dynamic Cluster Scheduling Algorithm(DCSA) is proposed for parallel optimization of big data tasks. Firstly, a dynamic data queue based on the server's request database is generated. The priority of data item and size of data item are as the considerations of dynamic data queue for data clustering association. And then the weights are introduced, the dynamic data item is made equalization to provide the basis for the multi-channel optimal scheduling. Secondly, according to the relevance of the data items, the mechanism of data optimized placement is used to make the data which are aggregated in the same frame. After the placement is completed, the dynamic data is uniformly scheduled to minimize the cost at the time of migration, with the local characteristics of the data item as constraints. Through the target iteration, the optimal scheduling scheme is adjusted, and finally to achieve multi-channel optimal scheduling. Experiments show that the proposed method enables dynamic data to achieve optimal scheduling.
引用
收藏
页码:1243 / 1251
页数:9
相关论文
共 29 条
[1]   A unified view of parallel machine scheduling with interdependent processing rates [J].
Alidaee, Bahram ;
Wang, Haibo ;
Kethley, R. Bryan ;
Landram, Frank .
JOURNAL OF SCHEDULING, 2019, 22 (05) :499-515
[2]   Big data medical behavior analysis based on machine learning and wireless sensors [J].
Cui, Moyang .
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (12) :9413-9427
[3]   A Data-Driven Parallel Scheduling Approach for Multiple Agile Earth Observation Satellites [J].
Du, Yonghao ;
Wang, Tao ;
Xin, Bin ;
Wang, Ling ;
Chen, Yingguo ;
Xing, Lining .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2020, 24 (04) :679-693
[4]  
Goossens S, 2016, EMBED SYST, P1, DOI 10.1007/978-3-319-32094-6
[5]   Improved approximation algorithms for the combination problem of parallel machine scheduling and path [J].
Guan, Li ;
Li, Jianping ;
Li, Weidong ;
Lichen, Junran .
JOURNAL OF COMBINATORIAL OPTIMIZATION, 2019, 38 (03) :689-697
[6]   Big data based stock trend prediction using deep CNN with reinforcement-LSTM model [J].
Ishwarappa ;
Anuradha, J. .
INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2021,
[7]   A new method for a class of parallel batch machine scheduling problem [J].
Jiang, Wei ;
Shen, Yilan ;
Liu, Lingxuan ;
Zhao, Xiancong ;
Shi, Leyuan .
FLEXIBLE SERVICES AND MANUFACTURING JOURNAL, 2022, 34 (02) :518-550
[8]  
Kordon A. M., 2020, DISCRETE APPL MATH
[9]   Memory-Aware Scheduling Parallel Real-Time Tasks for Multicore Systems [J].
Lei, Zhenyang ;
Lei, Xiangdong ;
Long, Jun .
INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2021, 31 (04) :613-634
[10]   Intelligent Fault Diagnosis by Fusing Domain Adversarial Training and Maximum Mean Discrepancy via Ensemble Learning [J].
Li, Yibin ;
Song, Yan ;
Jia, Lei ;
Gao, Shengyao ;
Li, Qiqiang ;
Qiu, Meikang .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (04) :2833-2841