Data mining tools

被引:13
作者
Bartschat, Andreas [1 ]
Reischl, Markus [1 ]
Mikut, Ralf [1 ]
机构
[1] Karlsruhe Inst Technol, Inst Automat & Appl Informat, D-76344 Eggenstein Leopoldshafen, Germany
关键词
big data; data analytics; data mining; software tools; KNOWLEDGE DISCOVERY; MACHINE; ANALYTICS; SOFTWARE;
D O I
10.1002/widm.1309
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The development and application of data mining algorithms requires the use of powerful software tools. As the number of available tools continues to grow, the choice of the most suitable tool becomes increasingly difficult. This paper attempts to support the decision-making process by discussing the historical development and presenting a range of existing state-of-the-art data mining and related tools. Furthermore, we propose criteria for the tool categorization based on different user groups, data structures, data mining tasks and methods, visualization and interaction styles, import and export options for data and models, platforms, and license policies. These criteria are then used to classify data mining tools into nine different types. The typical characteristics of these types are explained and a selection of the most important tools is categorized. This paper is organized as follows: the first section Historical Development and State-of-the-Art highlights the historical development of data mining software until present; the criteria to compare data mining software are explained in the second section Criteria for Comparing Data Mining Software. The last section Categorization of Data Mining Software into Different Types proposes a categorization of data mining software and introduces typical software tools for the different types. (C)0 2011 John Wiley & Sons, Inc.
引用
收藏
页数:14
相关论文
共 87 条
  • [1] Abadi M., 2015, TensorFlow: Large-scale machine learning on heterogeneous systems
  • [2] Experience, Results and Lessons Learned from Automated Driving on Germany's Highways
    Aeberhard, Michael
    Rauch, Sebastian
    Bahram, Mohammad
    Tanzmeister, Georg
    Thomas, Julian
    Pilat, Yves
    Homm, Florian
    Huber, Werner
    Kaempchen, Nico
    [J]. IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2015, 7 (01) : 42 - 57
  • [3] KEEL: a software tool to assess evolutionary algorithms for data mining problems
    Alcala-Fdez, J.
    Sanchez, L.
    Garcia, S.
    del Jesus, M. J.
    Ventura, S.
    Garrell, J. M.
    Otero, J.
    Romero, C.
    Bacardit, J.
    Rivas, V. M.
    Fernandez, J. C.
    Herrera, F.
    [J]. SOFT COMPUTING, 2009, 13 (03) : 307 - 318
  • [4] Evaluation and comparison of open source software suites for data mining and knowledge discovery
    Altalhi, Abdulrahman H.
    Luna, J. M.
    Vallejo, M. A.
    Ventura, S.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2017, 7 (03)
  • [5] Knowledge discovery standards
    Anand, Sarabjot Singh
    Grobelnik, Marko
    Herrmann, Frank
    Hornick, Mark
    Lingenfelder, Christoph
    Rooney, Niall
    Wettschereck, Dietrich
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2007, 27 (01) : 21 - 56
  • [6] [Anonymous], 2013, UNDEFINED DATA SURVE
  • [7] [Anonymous], 2016, 2015 DATA SCI SURVEY
  • [8] [Anonymous], G00171189 RAS
  • [9] [Anonymous], ARXIV14026013
  • [10] [Anonymous], 2000, J DATA WAREHOUSING