Machine Learning and Data Mining Methods in Diabetes Research

被引:606
|
作者
Kavakiotis, Ioannis [1 ,2 ]
Tsave, Olga [3 ]
Salifoglou, Athanasios [3 ]
Maglaveras, Nicos [2 ,4 ]
Vlahavas, Ioannis [1 ]
Chouvarda, Ioanna [2 ,4 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Thessaloniki 54124, Greece
[2] CERTH, Inst Appl Biosci, Thessaloniki, Greece
[3] Aristotle Univ Thessaloniki, Inorgan Chem Lab, Dept Chem Engn, Thessaloniki 54124, Greece
[4] Aristotle Univ Thessaloniki, Lab Comp & Med Informat, Sch Med, Thessaloniki 54124, Greece
来源
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL | 2017年 / 15卷
关键词
Machine learning; Data mining; Diabetes mellitus; Diabetic complications; Disease prediction models; Biomarker(s) identification; PREDICTIVE MODELS; RISK-ASSESSMENT; RETINOPATHY; MELLITUS; DISEASE; DIAGNOSIS; CLASSIFICATION; OPTIMIZATION; ASSOCIATION; EXTRACTION;
D O I
10.1016/j.csbj.2016.12.005
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The remarkable advances in biotechnology and health sciences have led to a significant production of data, such as high throughput genetic data and clinical information, generated from large Electronic Health Records (EHRs). To this end, application of machine learning and data mining methods in biosciences is presently, more than ever before, vital and indispensable in efforts to transform intelligently all available information into valuable knowledge. Diabetes mellitus (DM) is defined as a group of metabolic disorders exerting significant pressure on human health worldwide. Extensive research in all aspects of diabetes (diagnosis, etiopathophysiology, therapy, etc.) has led to the generation of huge amounts of data. The aim of the present study is to conduct a systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular. A wide range of machine learning algorithms were employed. In general, 85% of those used were characterized by supervised learning approaches and 15% by unsupervised ones, and more specifically, association rules. Support vector machines (SVM) arise as the most successful and widely used algorithm. Concerning the type of data, clinical datasets were mainly used. The title applications in the selected articles project the usefulness of extracting valuable knowledge leading to new hypotheses targeting deeper understanding and further investigation in DM. (C) 2017 The Authors. Published by Elsevier B.V.
引用
收藏
页码:104 / 116
页数:13
相关论文
共 50 条
  • [31] A survey on data mining and machine learning techniques for diagnosing hepatitis disease
    Tasneem, Tabeen
    Kabir, Mir Md. Jahangir
    Xu, Shuxiang
    Tasneem, Tazeen
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2023, 41 (04) : 340 - 375
  • [32] Machine Learning Methods for BIM Data
    Slusarczyk, Grazyna
    Strug, Barbara
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, PT II, 2022, 13758 : 230 - 240
  • [33] Applications of data mining and machine learning framework in aquaculture and fisheries: A review
    Gladju, J.
    Kamalam, Biju Sam
    Kanagaraj, A.
    SMART AGRICULTURAL TECHNOLOGY, 2022, 2
  • [34] A Data Mining Framework for Glaucoma Decision Support Based on Optic Nerve Image Analysis Using Machine Learning Methods
    Abidi S.S.R.
    Roy P.C.
    Shah M.S.
    Yu J.
    Yan S.
    Journal of Healthcare Informatics Research, 2018, 2 (4) : 370 - 401
  • [35] DIAGNOSIS OF DIABETES MELLITUS USING STATISTICAL METHODS AND MACHINE LEARNING ALGORITHMS
    Pekel, Ebru
    Ozcan, Tuncay
    SIGMA JOURNAL OF ENGINEERING AND NATURAL SCIENCES-SIGMA MUHENDISLIK VE FEN BILIMLERI DERGISI, 2018, 36 (04): : 1263 - 1280
  • [36] Machine Learning Refutes Loss of Smell as a Risk Indicator of Diabetes Mellitus
    Loetsch, Joern
    Haehner, Antje
    Schwarz, Peter E. H.
    Tselmin, Sergey
    Hummel, Thomas
    JOURNAL OF CLINICAL MEDICINE, 2021, 10 (21)
  • [37] A review and analysis on data mining methods to predict diabetes
    Ladha, Girdhar Gopal
    Pippal, Ravi Kumar Singh
    2017 7TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT), 2017, : 334 - 337
  • [38] Artificial intelligence and machine learning in diabetes research
    Phong Nguyen
    Ohnmacht, Alexander J.
    Galhoz, Ana
    Buettner, Maren
    Theis, Fabian
    Menden, Michael P.
    DIABETOLOGE, 2021, 17 (08): : 788 - 798
  • [39] An Overview of Recent Machine Learning Strategies in Data Mining
    Battula, Bhanu Prakash
    Prasad, R. Satya
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2013, 4 (03) : 50 - 54
  • [40] Editorial: Machine Learning and Data Mining in Materials Science
    Huber, Norbert
    Kalidindi, Surya R.
    Klusemann, Benjamin
    Cyron, Christian J.
    FRONTIERS IN MATERIALS, 2020, 7