Performance improvement of data mining in Weka through multi-core and GPU acceleration: opportunities and pitfalls

被引:10
作者
Engel, Tiago Augusto [1 ]
Charao, Andrea Schwertner [1 ]
Kirsch-Pinheiro, Manuele [2 ]
Steffenel, Luiz-Angelo [3 ]
机构
[1] Univ Fed Santa Maria, Lab Sistemas Computacao, BR-97119900 Santa Maria, RS, Brazil
[2] Univ Paris 01, Ctr Rech Informat, F-75231 Paris 05, France
[3] Univ Reims, Equipe SysCom, Lab CReSTIC, Reims, France
关键词
TOOLKIT;
D O I
10.1007/s12652-015-0292-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data mining tools may be computationally demanding, which leads to an increasing interest on parallel computing strategies in order to improve their performance. While multi-core processors and Graphics Processing Units (GPUs) accelerators increased the computing power of current desktop computers, we observe that desktop-based data mining tools do not take full advantage of these architectures yet. This paper investigates strategies to improve the performance of Weka, a popular data mining tool, through multi-core and GPU acceleration. Using performance profiling of Weka, we identify operations that could improve the data mining performance when parallelized. We selected two of these operations, and analyze the impact of their parallel execution on Weka's performance. These experiments demonstrate that while significant speedups can be achieved, all operations are not prone to be parallelized, which reinforces the need for a careful and well-studied selection of the candidates.
引用
收藏
页码:377 / 390
页数:14
相关论文
共 46 条
[11]  
Dotzler G., 2010, IWMSE '10: Proceedings of the 3rd International Workshop on Multicore Software Engineering, P10, DOI [10.1145/1808954.1808959, DOI 10.1145/1808954.1808959]
[12]   Performance improvement of data mining in Weka through GPU acceleration [J].
Engel, Tiago Augusto ;
Charao, Andrea Schwertner ;
Kirsch-Pinheiro, Manuele ;
Steffenel, Luiz-Angelo .
5TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2014), THE 4TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2014), 2014, 32 :93-100
[13]   Friend or foe? Fake profile identification in online social networks [J].
Fire, Michael ;
Kagan, Dima ;
Elyashar, Aviad ;
Elovici, Yuval .
SOCIAL NETWORK ANALYSIS AND MINING, 2014, 4 (01) :1-23
[14]  
Ghoting A., 2011, Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD' 11, P334
[15]  
Graf F, 2011, PROCEEDINGS OF THE 2
[16]  
Graf F, 2011, LECT NOTES COMPUT SC, V6892, P607, DOI 10.1007/978-3-642-23629-7_74
[17]  
Hailemariam G., 2012, P INT C MAN EM DIG E, P183, DOI DOI 10.1145/2457276.2457310
[18]  
Hall M., 2009, SIGKDD Explorations, V11, P10, DOI DOI 10.1145/1656274.1656278
[19]  
JCublas, 2013, JAVA BINDINGS FOR CU
[20]  
JCuda, 2013, JCUDA JAVA BINDINGS