Algorithms and software for data mining and machine learning: a critical comparative view from a systematic review of the literature

被引:10
作者
Taranto-Vera, Gilda [1 ]
Galindo-Villardon, Purificacion [1 ]
Merchan-Sanchez-Jara, Javier [1 ]
Salazar-Pozo, Julio [1 ]
Moreno-Salazar, Alex [1 ,2 ]
Salazar-Villalva, Vanessa [1 ,2 ]
机构
[1] Univ Salamanca, Salamanca, Spain
[2] Escuela Super Politecn Litoral, Guayaquil, Ecuador
关键词
Data mining; Machine learning techniques; Algorithms; Systematic literature review; Software tools; Performance evaluation;
D O I
10.1007/s11227-021-03708-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Today, a greater generation of information is produced as a consequence of the technological development of society. The Internet has facilitated the access and extraction of this information, thus pursuing the automatic discovery of the knowledge contained within. In this context, data mining aims to discover patterns, profiles and trends of a large volume of data, for which multiple learning techniques are available. The selection of which technique to use depends on the type of result desired to obtain and the data that are available, considering that the algorithms for these tasks date mostly from the early twentieth century and are now the basis of these new technologies. The aim of this study is to show the development of these techniques in the field of scientific research and to present the evolution of algorithms and software for data mining in recent years. To this end, the systematic literature review methodology was applied, as it is considered a systematic process that identifies, evaluates, and interprets the work of researchers in a chosen field. As a result, we present a comparative analysis of the most outstanding software: Alteryx, TIBCO Data Science, RapidMiner and WEKA, their capacities for data mining processes and a description of the algorithms and techniques of machine learning that are currently on the rise.
引用
收藏
页码:11481 / 11513
页数:33
相关论文
共 58 条
[1]  
Abd El-Jawad MH, 2018, INT COMPUT ENG CONF, P174, DOI 10.1109/ICENCO.2018.8636124
[2]   Evolutionary data mining and applications: A revision on the most cited papers from the last 10 years (2007-2017) [J].
Alcala, Rafael ;
Jose Gacto, Maria ;
Alcala-Fdez, Jesus .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (02)
[3]  
[Anonymous], 2014, RapidMiner Studio Manual
[4]  
Azevedo AnaIsabel Rojao Lourenco., 2008, KDD SEMMA CRISP DM P
[5]  
Babi C, MINING FREQUENT PATT
[6]  
Bengio Y., 2007, Large scale kernel machines
[7]  
Bermudez JAG, 2010, THESIS U TECNOLOGICA
[8]  
Blei D. M., 2006, Proceedings of the 23rd international conference on Machine learning, P113, DOI DOI 10.1145/1143844.1143859
[9]  
Bucheli H, 2014, INS AN 2014 C
[10]   Active deep Q-learning with demonstration [J].
Chen, Si-An ;
Tangkaratt, Voot ;
Lin, Hsuan-Tien ;
Sugiyama, Masashi .
MACHINE LEARNING, 2020, 109 (9-10) :1699-1725