PARADIGMS FOR REALIZING MACHINE LEARNING ALGORITHMS

被引:13
作者
Agneeswaran, Vijay Srinivas [1 ]
Tonpay, Pranay [1 ]
Tiwary, Jayati [1 ]
机构
[1] Impetus Infotech India Private Ltd, Bangalore 560103, Karnataka, India
关键词
D O I
10.1089/big.2013.0006
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The article explains the three generations of machine learning algorithms-with all three trying to operate on big data. The first generation tools are SAS, SPSS, etc., while second generation realizations include Mahout and RapidMiner (that work over Hadoop), and the third generation paradigms include Spark and GraphLab, among others. The essence of the article is that for a number of machine learning algorithms, it is important to look beyond the Hadoop's Map- Reduce paradigm in order to make them work on big data. A number of promising contenders have emerged in the third generation that can be exploited to realize deep analytics on big data.
引用
收藏
页码:BD207 / BD214
页数:8
相关论文
共 14 条
[1]  
[Anonymous], 2010, 19 ACM INT S HIGH PE
[2]  
[Anonymous], 2010, Programming in Scala
[3]  
[Anonymous], 2011, P MACHINE LEARNING R
[4]  
[Anonymous], P 2010 ACM SIGMOD IN, DOI [DOI 10.1145/1807167.1807184, 10.1145/1807167.1807184]
[5]  
[Anonymous], 2012, P 10 USENIX S OP SYS
[6]  
Bu YY, 2010, PROC VLDB ENDOW, V3, P285
[7]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[8]  
Ekstrand M. D., 2011, P 5 ACM C RECOMMENDE, P133, DOI DOI 10.1145/2043932.2043958
[9]   Distributed GraphLab: A Framework for Machine Learning and Data Mining in the Cloud [J].
Low, Yucheng ;
Gonzalez, Joseph ;
Kyrola, Aapo ;
Bickson, Danny ;
Guestrin, Carlos ;
Hellerstein, Joseph M. .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (08) :716-727
[10]  
Sangwon Seo, 2010, Proceedings of the 2010 IEEE 2nd International Conference on Cloud Computing Technology and Science (CloudCom 2010), P721, DOI 10.1109/CloudCom.2010.17