Machine Learning and Data Mining Methods in Diabetes Research

被引:606
|
作者
Kavakiotis, Ioannis [1 ,2 ]
Tsave, Olga [3 ]
Salifoglou, Athanasios [3 ]
Maglaveras, Nicos [2 ,4 ]
Vlahavas, Ioannis [1 ]
Chouvarda, Ioanna [2 ,4 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Thessaloniki 54124, Greece
[2] CERTH, Inst Appl Biosci, Thessaloniki, Greece
[3] Aristotle Univ Thessaloniki, Inorgan Chem Lab, Dept Chem Engn, Thessaloniki 54124, Greece
[4] Aristotle Univ Thessaloniki, Lab Comp & Med Informat, Sch Med, Thessaloniki 54124, Greece
来源
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL | 2017年 / 15卷
关键词
Machine learning; Data mining; Diabetes mellitus; Diabetic complications; Disease prediction models; Biomarker(s) identification; PREDICTIVE MODELS; RISK-ASSESSMENT; RETINOPATHY; MELLITUS; DISEASE; DIAGNOSIS; CLASSIFICATION; OPTIMIZATION; ASSOCIATION; EXTRACTION;
D O I
10.1016/j.csbj.2016.12.005
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The remarkable advances in biotechnology and health sciences have led to a significant production of data, such as high throughput genetic data and clinical information, generated from large Electronic Health Records (EHRs). To this end, application of machine learning and data mining methods in biosciences is presently, more than ever before, vital and indispensable in efforts to transform intelligently all available information into valuable knowledge. Diabetes mellitus (DM) is defined as a group of metabolic disorders exerting significant pressure on human health worldwide. Extensive research in all aspects of diabetes (diagnosis, etiopathophysiology, therapy, etc.) has led to the generation of huge amounts of data. The aim of the present study is to conduct a systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular. A wide range of machine learning algorithms were employed. In general, 85% of those used were characterized by supervised learning approaches and 15% by unsupervised ones, and more specifically, association rules. Support vector machines (SVM) arise as the most successful and widely used algorithm. Concerning the type of data, clinical datasets were mainly used. The title applications in the selected articles project the usefulness of extracting valuable knowledge leading to new hypotheses targeting deeper understanding and further investigation in DM. (C) 2017 The Authors. Published by Elsevier B.V.
引用
收藏
页码:104 / 116
页数:13
相关论文
共 50 条
  • [41] Research on Radiation Damage Characteristics of Optical Fiber Materials Based on Data Mining and Machine Learning
    Li, Ang
    Wang, Tian-hui
    ADVANCED HYBRID INFORMATION PROCESSING, ADHIP 2019, PT I, 2019, 301 : 410 - 418
  • [42] Diabetes Detection by Data Mining Methods
    Ambikavathi, V.
    Arumugam, P.
    Jose, P.
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 133 (04) : 2087 - 2104
  • [43] A Syllabus on Data Mining and Machine Learning with Applications to Cybersecurity
    Epishkina, Anna
    Zapechnikov, Sergey
    2016 THIRD INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION PROCESSING, DATA MINING, AND WIRELESS COMMUNICATIONS (DIPDMWC), 2016, : 194 - 199
  • [44] Diabetes Detection by Data Mining Methods
    V. Ambikavathi
    P. Arumugam
    P. Jose
    Wireless Personal Communications, 2023, 133 : 2087 - 2104
  • [45] Predictive Machine Learning Approach for Complex Problem Solving Process Data Mining
    Pejic, Aleksandar
    Molcer, Piroska Stanic
    ACTA POLYTECHNICA HUNGARICA, 2021, 18 (01) : 45 - 63
  • [46] A data mining approach based on machine learning techniques to classify biological sequences
    Maddouri, M
    Elloumi, M
    KNOWLEDGE-BASED SYSTEMS, 2002, 15 (04) : 217 - 223
  • [47] Optimisation of Machine Learning Based Data Mining Methods for Network Intrusion Detection
    Li, Mingxiao
    Li, Ziqing
    Liu, Chenlong
    Chen, Wanqi
    Ma, Chaojie
    2024 6TH INTERNATIONAL CONFERENCE ON BIG-DATA SERVICE AND INTELLIGENT COMPUTATION, BDSIC 2024, 2024, : 17 - 25
  • [48] A Review: Machine Learning and Data Mining Approaches for Cardiovascular Disease Diagnosis and Prediction
    Rao G.S.
    Muneeswari G.
    EAI Endorsed Transactions on Pervasive Health and Technology, 2024, 10
  • [49] On Machine Learning with Imbalanced Data and Research Quality Evaluation Methodologies
    Lipitakis, Anastasia-Dimitra
    Lipitakis, Evangelia A. E. C.
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), VOL 1, 2014, : 451 - 457
  • [50] Fuzzy machine learning and data mining
    Huellermeier, Eyke
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (04) : 269 - 283