Adaptations of data mining methodologies: A systematic literature review

被引:0
|
作者
Plotnikova V. [1 ]
Dumas M. [1 ]
Milani F. [1 ]
机构
[1] Institute of Computer Science, University of Tartu, Tartu
来源
Plotnikova, Veronika (veronika.plotnikova@ut.ee) | 1600年 / PeerJ Inc.卷 / 06期
关键词
CRISP-DM; Data mining; Data mining methodology; Literature review;
D O I
10.7717/PEERJ-CS.267
中图分类号
学科分类号
摘要
The use of end-to-end data mining methodologies such as CRISP-DM, KDD process, and SEMMA has grown substantially over the past decade. However, little is known as to how these methodologies are used in practice. In particular, the question of whether data mining methodologies are used 'as-is' or adapted for specific purposes, has not been thoroughly investigated. This article addresses this gap via a systematic literature review focused on the context in which data mining methodologies are used and the adaptations they undergo. The literature review covers 207 peerreviewed and 'grey' publications. We find that data mining methodologies are primarily applied 'as-is'. At the same time, we also identify various adaptations of data mining methodologies and we note that their number is growing rapidly. The dominant adaptations pattern is related to methodology adjustments at a granular level (modifications) followed by extensions of existing methodologies with additional elements. Further, we identify two recurrent purposes for adaptation: (1) adaptations to handle Big Data technologies, tools and environments (technological adaptations); and (2) adaptations for context-awareness and for integrating data mining solutions into business processes and IT systems (organizational adaptations). The study suggests that standard data mining methodologies do not pay sufficient attention to deployment issues, which play a prominent role when turning data mining models into software products that are integrated into the IT architectures and business processes of organizations. We conclude that refinements of existing methodologies aimed at combining data, technological, and organizational aspects, could help to mitigate these gaps. © 2020 Plotnikova et al.
引用
收藏
页码:1 / 43
页数:42
相关论文
共 50 条
  • [1] Adaptations of data mining methodologies: a systematic literature review
    Plotnikova, Veronika
    Dumas, Marlon
    Milani, Fredrik
    PEERJ COMPUTER SCIENCE, 2020,
  • [2] Data Mining Methodologies in the Banking Domain: A Systematic Literature Review
    Plotnikova, Veronika
    Dumas, Marlon
    Milani, Fredrik P.
    PERSPECTIVES IN BUSINESS INFORMATICS RESEARCH, BIR 2019, 2019, 365 : 104 - 118
  • [3] Regression Method in Data Mining: A Systematic Literature Review
    Sebt, Mohammad Vahid
    Sadati-Keneti, Yaser
    Rahbari, Misagh
    Gholipour, Zohreh
    Mehri, Hamid
    ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2024, 31 (06) : 3515 - 3534
  • [4] A Systematic Literature Review of Data Mining Applications in Healthcare
    Niaksu, Olegas
    Skinulyte, Jolita
    Duhaze, Hermine Grubinger
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2013 WORKSHOPS, 2014, 8182 : 313 - 324
  • [5] Applications, Methodologies, and Technologies for Linked Open Data: A Systematic Literature Review
    Avila-Garzon, Cecilia
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2020, 16 (03) : 53 - 69
  • [6] Applying Data Mining in Graduates? Employability: A Systematic Literature Review
    Mpia, Heritier Nsenge
    Mburu, Lucy Waruguru
    Mwendia, Simon Nyaga
    INTERNATIONAL JOURNAL OF ENGINEERING PEDAGOGY, 2023, 13 (02): : 86 - 108
  • [7] Prescriptive Analytics-based Methodologies for Healthcare Data: A Systematic Literature Review
    Mendoza-Olguin, Gustavo Emilio
    Somodevilla-Garcia, Maria Josefa
    Perez-de-Celis, Concepcion
    Chavarri-Guerra, Yanin
    COMPUTACION Y SISTEMAS, 2024, 28 (04): : 2369 - 2383
  • [9] Systematic Review of Methodologies in Data Science
    Ruiz-Lopez, Francisco
    Perez-Ortega, Joaquin
    Ortiz-Hernandez, Javier
    Hernandez-Perez, Yasmin
    Saenz-Sanchez, Socorro
    2021 MEXICAN INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE (ENC 2021), 2021,
  • [10] Scanning the horizon: a systematic literature review of methodologies
    Hines, Philip
    Yu, Li Hiu
    Guy, Richard H.
    Brand, Angela
    Papaluca-Amati, Marisa
    BMJ OPEN, 2019, 9 (05):