Dynamic Token Based Improving MapReduce Performance in Cloud Computing

被引:0
作者
Zhou, Mosong [1 ]
Chen, Heng [1 ]
Dong, Xiaoshe [1 ]
Zhu, Zhengdong [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian, Peoples R China
来源
PROCEEDINGS 2015 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING BDCLOUD 2015 | 2015年
关键词
cloud computing; MapRduce; Hadoop; resource allocation; dynamic scheduling;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In recent years, Hadoop, the open-source implementation of Google's MapReduce, is widely used and has become the de facto standard of big data processing. A typical running environment of Hadoop is cloud computing in which resource heterogeneity is very common due to varied factors including the different hardware of nodes and the different workload on the nodes and etc. The slot-based scheduling in Hadoop causes the inefficient utilization of computing resources in cloud computing environment, which lead to the performance degradation. To solve the problem mentioned above, we propose a dynamic token based method which dynamically controls the number of tasks running on each node according to the available computing resources on a node and the resource requirement of a task. The results of evaluations show that the completion times of single jobs with the proposed method are approaching to the static optimum in the dedicated environment and better than the static optimums in the other two competitive environments. Moreover, the proposed method significantly improves the throughput of mixed workloads in all computing environments and performance in real cloud computing environment have been improved by 45.9% on average.
引用
收藏
页码:180 / 186
页数:7
相关论文
共 12 条
[1]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[2]  
Herodotou H., 2011, CIDR, V11, P261
[3]  
Huang SS, 2010, I C DATA ENGIN WORKS, P41, DOI 10.1109/ICDEW.2010.5452747
[4]  
Kwon Y., 2012, SIGMOD 12, P25
[5]   Workload Characteristic Oriented Scheduler for MapReduce [J].
Lu, Peng ;
Lee, Young Choon ;
Wang, Chen ;
Zhou, Bing Bing ;
Chen, Junliang ;
Zomaya, Albert Y. .
PROCEEDINGS OF THE 2012 IEEE 18TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2012), 2012, :156-163
[6]  
Polo J, 2011, LECT NOTES COMPUT SC, V7049, P187
[7]   A Hybrid Scheduling Approach for Scalable Heterogeneous Hadoop Systems [J].
Rasooli, Aysan ;
Down, Douglas G. .
2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, :1284-1291
[8]  
Reiss C., 2012, P 3 ACM S CLOUD COMP, P1, DOI 10.1145/2391229.2391236
[9]   A Dynamic MapReduce Scheduler for Heterogeneous Workloads [J].
Tian, Chao ;
Zhou, Haojie ;
He, Yongqiang ;
Zha, Li .
2009 EIGHTH INTERNATIONAL CONFERENCE ON GRID AND COOPERATIVE COMPUTING, PROCEEDINGS, 2009, :218-224
[10]   MapReduce Workload Modeling with Statistical Approach [J].
Yang, Hailong ;
Luan, Zhongzhi ;
Li, Wenjun ;
Qian, Depei .
JOURNAL OF GRID COMPUTING, 2012, 10 (02) :279-310