<bold>Data mining in Cloud Computing </bold>

被引:0
作者
Geng, Xia [1 ]
Yang, Zhi [2 ]
机构
[1] Jiangsu Univ, Sch Comp Sci & Telecommun Engn, Zhenjiang, Jiangsu, Peoples R China
[2] Jiangsu Univ, Sch Management, Zhenjiang, Jiangsu, Peoples R China
来源
PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND COMPUTER APPLICATIONS (ICSA 2013) | 2013年 / 92卷
关键词
Data Mining; Cloud Computing; Map-Reduce; Hadoop; MAPREDUCE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
,Data. Mining is a process of extracting potentially useful information from raw so as to improve the quality of the information service. With the rapid development of the Internet, the size of the data has increased from KB level to TB even PB level; The object of data mining is also more and more complicated, so the data mining algorithm need to be more efficient. Cloud computing can provide infrastructure to massive and complex data of data mining, as well as new challenging issues for data mining of cloud computing research are emerged. This paper introduces the basic concept of cloud computing and data mining firstly, and sketches out how data mining is used in cloud computing; Then summarizes the research of parallel programming mode especially analyses the Map-reduce programming model and it's development platform-Hadoop; finally, overviews efficient mass data mining algorithm based on parallel programming model and mass data mining service based on the cloud computing.
引用
收藏
页码:1 / 7
页数:7
相关论文
共 10 条
[1]  
[Anonymous], 1995, IEEE STANDARD INFO 1
[2]  
[Anonymous], P 2010 ACM SIGMOD IN, DOI [DOI 10.1145/1807167.1807184, 10.1145/1807167.1807184]
[3]  
BHADURI K, 2011, DISTRIBUTED DATA MIN
[4]  
BRADSKI G., 2007, NIPS, P281
[5]   Mapreduce: Simplified data processing on large clusters [J].
Dean, Jeffrey ;
Ghemawat, Sanjay .
COMMUNICATIONS OF THE ACM, 2008, 51 (01) :107-113
[6]  
Gropp W.D., 1999, Using MPI: Portable parallel programming with the message-passing interface
[7]  
Ranger C, 2007, INT S HIGH PERF COMP, P13
[8]   The Weka4WS framework for distributed data mining in service-oriented Grids [J].
Talia, Domenico ;
Trunfio, Paolo ;
Verta, Oreste .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2008, 20 (16) :1933-1951
[9]   How Distributed Data Mining Tasks can Thrive as Knowledge Services [J].
Talia, Domenico ;
Trunfio, Paolo .
COMMUNICATIONS OF THE ACM, 2010, 53 (07) :132-137
[10]  
Yu L., 2012, P 18 ACM SIGKDD INT, P1496, DOI DOI 10.1145/2339530.2339764