Parallel Processing of Big Data using Power Iteration Clustering over MapReduce

被引:2
作者
Jayalatchumy, D. [1 ]
Thambidurai, P. [1 ]
Alamelu, A. Vasumathi [1 ]
机构
[1] PKIET, CSE, Karaikal, India
来源
2014 WORLD CONGRESS ON COMPUTING AND COMMUNICATION TECHNOLOGIES (WCCCT 2014) | 2014年
关键词
p-PIC; Hadoop; Fault tolerance; GBC;
D O I
10.1109/WCCCT.2014.16
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Extracting useful information from dataset measuring in gigabytes and tetrabytes is a real challenge for data miners. Clustering algorithm have the problem of scalability while dealing with big data. The problem can be handled using parallel algorithm by executing them along with input data on high performance computer. The problem with graph based application requires much time for computation. PIC is an algorithm that is simple, fast, relatively scalable which requires the data and its associated matrix to fit in memory and this becomes infeasible for big data applications. Scalability has been increased using p-PIC and this paper focus on exploring different parallelization strategies for minimizing and compelling communication cost. The algorithm works on with a parallel framework MapReduce. p-PIC algorithm deals with Hadoop cloud a parallel store and computing platform implementing p-PIC using Hadoop framework.
引用
收藏
页码:176 / 178
页数:3
相关论文
共 14 条
[1]   pPOP: Fast yet accurate parallel hierarchical clustering using partitioning [J].
Dash, Manoranjan ;
Petrutiu, Simona ;
Scheuermann, Peter .
DATA & KNOWLEDGE ENGINEERING, 2007, 61 (03) :563-578
[2]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[3]   Mapreduce: Simplified data processing on large clusters [J].
Dean, Jeffrey ;
Ghemawat, Sanjay .
COMMUNICATIONS OF THE ACM, 2008, 51 (01) :107-113
[4]  
Donatella Firmani G., MINIMUM SPANNING TRE
[5]  
Lin frank Frank, 2013, POWER ITERATION CLUS
[6]  
Mentor Deepika Mohan, 2011, HADOOP 0 20 2 GRAPH
[7]  
Omran M. G. H., 2005, WORLD ENF C, P332
[8]  
Philippe Biela HEI - LAGIS Philippe, INTR SPECTR CLUST
[9]  
Potter MA, 2010, LECT NOTES COMPUT SC, V6238, P374, DOI 10.1007/978-3-642-15844-5_38
[10]  
Shao J., 2012, IEEE T KNOWLEDGE DAT