SPO: A Secure and Performance-aware Optimization for MapReduce Scheduling

被引:10
作者
Maleki, Neda [1 ]
Rahmani, Amir Masoud [1 ]
Conti, Mauro [2 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Sci & Res Branch, Tehran, Iran
[2] Univ Padua, Dept Math, Padua, Italy
关键词
Bigdata; Hadoop; MapReduce; Scheduling; Makespan; Security; Optimization model; Heterogeneity; LOCALITY-AWARE; CLOUD; ALGORITHMS; MAKESPAN; TIME; SYSTEMS;
D O I
10.1016/j.jnca.2020.102944
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
MapReduce is a common framework that effectively processes multi-petabyte data in a distributed manner. Therefore, MapReduce is widely used in heterogeneous environments, such as cloud, to provide performance adequate for system needs. Despite the MapReduce benefits, tweaking the system configuration to achieve the maximum performance is still challenging and needs deep expertise. Besides, some new MapReduce security issues, which has not been well-addressed yet, are recently raised. In this paper, we present a performance-aware and secure framework, named SPO, to minimize the makespan of the tasks while considering task security constraints. Inspired by the HEFT algorithm, first, we introduce SPO, which proposes a two-stage static scheduler in Map and Reduce phases, respectively, to minimize makespan while considering network traffic. Plus, SPO* introduces a mathematical optimization model of the proposed scheduler aiming to estimate the system performance while considering security constraints with an error of less than 2%. The experimental results demonstrate that SPO outperforms Hadoop-stock in terms of makespan and network traffic by 29% and 31%, respectively, for the tasks running in heterogeneous environments.
引用
收藏
页数:24
相关论文
共 85 条
[1]  
Ahmad S, 2018, INT BHURBAN C APPL S, P495, DOI 10.1109/IBCAST.2018.8312270
[2]  
Alapati S.R., 2016, Expert Hadoop Administration: Managing, Tuning, and Securing Spark, YARN
[3]  
Alrokayan M, 2014, 2014 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING IN EMERGING MARKETS (CCEM), P49
[4]  
[Anonymous], 2015, FIELD GUIDE HADOOP I
[5]  
[Anonymous], 2020, APACHE SPARK TUTORIA
[6]  
[Anonymous], 2010, NSDI
[7]  
[Anonymous], 2012, Hadoop: The Definitive Guide
[8]  
[Anonymous], 2010, NSDI
[9]  
Apache Software Foundation, 2010, HAD
[10]   Log files Analysis Using MapReduce to Improve Security [J].
Azizi, Yassine ;
Azizi, Mostafa ;
Elboukhari, Mohamed .
SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2018), 2019, 148 :37-44