Modified MapReduce Framework for Enhancing Performance of Graph Based Algorithms by Fast Convergence in Distributed Environment

被引:0
作者
Singhal, Hitesh [1 ]
Guddeti, Ram Mohana Reddy [1 ]
机构
[1] Natl Inst Technol Kamataka, Dept Informat Technol, Mangalore, India
来源
2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI) | 2014年
关键词
MapReduce; Graph algorithms; Iterative Computations;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The amount of data which is produced is huge in current world and more importantly it is increasing exponentially. Traditional data storage and processing techniques are ineffective in handling such huge data [10]. Many real life applications require iterative computations in general and in particular used in most of machine learning and data mining algorithms over large datasets, such as web link structures and social network graphs. MapReduce is a software framework for easily writing applications which process large amount of data (multi-terabyte) in parallel on large clusters (thousands of nodes) of commodity hardware. However, because of batch oriented processing of MapReduce we are unable to utilize the benefits of MapReduce in iterative computations. Our proposed work is mainly focused on optimizing three factors resulting in performance improvement of iterative algorithms in MapReduce environment. In this paper, we address the key issues based on execution of tasks, the unnecessary creation of new task in each iteration and excessive shuffling of data in each iteration. Our preliminary experiments have shown promising results over the basic MapReduce framework. The comparative study with existing solutions based on MapReduce framework like HaLoop, has also shown better performance w.r.t algorithm run time and amount of data traffic over Hadoop Cluster.
引用
收藏
页码:1240 / 1245
页数:6
相关论文
共 14 条
[1]  
[Anonymous], 2010, P 19 ACM INT S HIGH, DOI DOI 10.1145/1851476.1851593
[2]  
[Anonymous], 2013, 46 HAW INT C SYST SC
[3]  
[Anonymous], 2010, HOTCLOUD 10
[4]  
Backstrom L., 2006, GROUP FORMATION LARG
[5]  
Condie Tyson, YAH RES ACM NSDI 10
[6]  
Elnikety Eslam, 2011, 3 IEEE INT C CLOUD C
[7]  
Hobi Livio, SQL VERSUS MAPREDUCE
[8]  
Humbetov S., 2012, APPL INFORM COMMUNIC, P1
[9]  
Khambatla Kartik, 2010, P 2010 IEEE INT C CL
[10]   Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters [J].
Leskovec, Jure ;
Lang, Kevin J. ;
Dasgupta, Anirban ;
Mahoney, Michael W. .
INTERNET MATHEMATICS, 2009, 6 (01) :29-123