PredictionIO: A Distributed Machine Learning Server for Practical Software Development

被引:18
作者
Chan, Simon [1 ]
Stone, Thomas [1 ]
Szeto, Kit Pang [2 ]
Chan, Ka Hou [2 ]
机构
[1] UCL, Dept Comp Sci, London, England
[2] TappingStone Inc, Walnut, CA USA
来源
PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13) | 2013年
关键词
Machine Learning Server; Algorithm Selection;
D O I
10.1145/2505515.2508198
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the biggest challenges for software developers to build real-world predictive applications with machine learning is the steep learning curve of data processing frameworks, learning algorithms and scalable system infrastructure. We present PredictionIO, an open source machine learning server that comes with a step-by-step graphical user interface for developers to (i) evaluate, compare and deploy scalable learning algorithms, (ii) tune hyperparameters of algorithms manually or automatically and (iii) evaluate model training status. The system also comes with an Application Programming Interface (API) to communicate with software applications for data collection and prediction retrieval. The whole infrastructure of PredictionIO is horizontally scalable with a distributed computing component based on Hadoop. The demonstration shows a live example and workflows of building real-world predictive applications with the graphical user interface of PredictionIO, from data collection, algorithm tuning and selection, model training and re-training to real-time prediction querying.
引用
收藏
页数:3
相关论文
共 7 条
[1]  
[Anonymous], 2010, Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence (UAI)
[2]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[3]  
Fielding RaRT, 2000, PhD Dissertation
[4]   Evaluating collaborative filtering recommender systems [J].
Herlocker, JL ;
Konstan, JA ;
Terveen, K ;
Riedl, JT .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2004, 22 (01) :5-53
[5]  
Paulson LD, 2005, COMPUTER, V38, P14, DOI 10.1109/MC.2005.330
[6]  
Shvachko K, 2010, IEEE S MASS STOR SYS
[7]  
Zaharia Matei, 2010, P 2 USENIX C HOT TOP, V10, P95