An Enhanced Data-Locality-Aware Task Scheduling Algorithm for Hadoop Applications

被引:13
|
作者
Choi, Dongjoo [1 ]
Jeon, Myunghoon [1 ]
Kim, Namgi [1 ]
Lee, Byoung-Dai [1 ]
机构
[1] Kyonggi Univ, Comp Sci Dept, Suwon 443760, South Korea
来源
IEEE SYSTEMS JOURNAL | 2018年 / 12卷 / 04期
基金
新加坡国家研究基金会;
关键词
Data locality; Hadoop distributed file system (HDFS); MapReduce; task scheduling;
D O I
10.1109/JSYST.2017.2764481
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In general, Hadoop improves the task scheduling performance by determining data locality based on the location in which the input splits and MapTask are executed. However, if an input split consists of multiple data blocks that are distributed and stored in different nodes, this data location method fails to cope with the degradation in processing performance due to the increased frequency of data block copying. We propose a task scheduling algorithm that solves this issue by defining a method to classify data locality taking into account the location of all data blocks that comprise an input split, categorizing tasks based on the defined method, and sequentially assigning tasks according to a given priority. This study measures the performance of the proposed algorithm through a comparison of the total processing time, MapTask performance time, and data block copying frequency between the proposed algorithm and Hadoop's default task scheduling algorithm. The test results show that the proposed algorithm improved the total processing time by up to 25% and the data block copying frequency by up to 28%, when compared to the default algorithm.
引用
收藏
页码:3346 / 3357
页数:12
相关论文
共 50 条
  • [41] Performance optimization of computing task scheduling based on the Hadoop big data platform
    Li, Yang
    Hei, Xinhong
    NEURAL COMPUTING & APPLICATIONS, 2022, 37 (13): : 8181 - 8192
  • [42] Prediction-Based and Locality-Aware Task Scheduling for Parallelizing Video Transcoding Over Heterogeneous MapReduce Cluster
    Zhao, Hui
    Zheng, Qinghua
    Zhang, Weizhan
    Wang, Jing
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (04) : 1009 - 1020
  • [43] An Enhanced Task Scheduling Algorithm on Cloud Computing Environment
    Alkhashai, Hussin M.
    Omara, Fatma A.
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2016, 9 (07): : 91 - 100
  • [44] QoE Enhancement of Task Scheduling Algorithm for VANET Applications
    Ding, Nan
    Nie, Shuaihang
    Si, Huaiwei
    Gao, Huanbo
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2017, 2017, 10251 : 338 - 343
  • [45] Load Balance Aware Genetic Algorithm for Task Scheduling in Cloud Computing
    Zhan, Zhi-Hui
    Zhang, Ge-Yi
    Ying-Lin
    Gong, Yue-Jiao
    Zhang, Jun
    SIMULATED EVOLUTION AND LEARNING (SEAL 2014), 2014, 8886 : 644 - 655
  • [46] Proposal of a Context-aware Task Scheduling Algorithm for the Fog Paradigm
    Barros, Celestino
    Rocio, Vitor
    Sousa, Andre
    Paredes, Hugo
    Teixeira, Olavo
    2022 SEVENTH INTERNATIONAL CONFERENCE ON FOG AND MOBILE EDGE COMPUTING, FMEC, 2022, : 63 - 70
  • [47] A novel battery-aware task scheduling algorithm for multiprocessor systems
    Xie Yufeng
    Liu Leibo
    Dai Rui
    Wei Shaojin
    CHINESE JOURNAL OF ELECTRONICS, 2008, 17 (03): : 421 - 426
  • [48] Uncertainty-aware task scheduling algorithm in edge computing environments
    Yin L.
    Zhou J.-L.
    Sun J.
    Wu Z.-B.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (07): : 2405 - 2413
  • [49] A Resource-Aware Task Scheduling Algorithm on Mobile Computational Grid
    Chang, Yue-Shan
    Chang, Hung-Hsiang
    Sheu, Ruey-Kai
    Tsai, Ching-Tsorng
    JOURNAL OF INTERNET TECHNOLOGY, 2011, 12 (02): : 279 - 291
  • [50] Energy-Aware Marine Predators Algorithm for Task Scheduling in IoT-Based Fog Computing Applications
    Abdel-Basset, Mohamed
    Mohamed, Reda
    Elhoseny, Mohamed
    Bashir, Ali Kashif
    Jolfaei, Alireza
    Kumar, Neeraj
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (07) : 5068 - 5076