Toward Scheduling I/O Request of Mapreduce Tasks Based on Markov Model

被引:1
作者
Ikken, Sonia [1 ,2 ]
Renault, Eric [1 ,2 ]
Kechadi, M. Tahar [3 ]
Tari, Abdelkamel [4 ]
机构
[1] Telecom SudParis, Inst Mines Telecom, Evry, France
[2] CNRS, Lab Samovar, UMR 5157, Evry, France
[3] UCD Sch Comp Sci & Informat, Dublin, Ireland
[4] Univ Abdarahmane Mira, Bejaia, Algeria
来源
MOBILE, SECURE, AND PROGRAMMABLE NETWORKING, MSPN 2015 | 2015年 / 9395卷
关键词
Mapreduce; Cloud storage; Disk I/O; Markov model; Scheduling algorithm;
D O I
10.1007/978-3-319-25744-0_7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In Cloud storage of multiple CPU cores, many Mapreduce applications may run in parallel on each compute node and collocate with local Disks storage. These Disks storage are shared by multiple applications that use full CPU power of the node. Each application tends to issue contiguous I/O requests in parallel to the same Disk; however if large number of Mapreduce tasks enters the I/O phase at the same time, the requests from the same task may be interrupted by the requests of other tasks. Then, the I/O nodes receive these requests as non-contiguous way under I/O contention. This interleaved access pattern causes performance degradation for Mapreduce application, this is particularly important when writing intermediate files by multiple tasks in parallel to the shared Disk storage. In order to overcome this problem, we have proposed approach for optimizing write access for Mapreduce application. The contributions of this paper are: (1) analyze the open issues on scheduling access request of Mapreduce workload; (2) propose framework for scheduling and predicting I/O request of Mapreduce application; (3) describe each role of component that intervenes in the scheduling theses I/O request on Block-level of storage server to provide contiguous access.
引用
收藏
页码:78 / 89
页数:12
相关论文
共 20 条
[1]  
[Anonymous], ACM S OP SYST PRINC
[2]  
Celis J.R., 2014, INT J COMPUT SCI ISS, V11, P74
[3]  
Chiang R.C., 2011, Proceedings of the 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, SC'11, P1
[4]  
Ching WK, 2006, INT SER OPER RES MAN, P1
[5]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[6]  
Filip B., 2013, P HOTCLOUD 2013 5 US
[7]  
Gulati A., 2011, ACM Symposium on Cloud Computing, P19, DOI DOI 10.1145/2038916.2038935
[8]  
Herodotou H, 2010, TECHNICAL REPORT
[9]  
Huai Y., 2011, P SOCC, P4
[10]   Automatic Optimization for MapReduce Programs [J].
Jahani, Eaman ;
Cafarella, Michael J. ;
Re, Christopher .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (06) :385-396