Potential value and impact of data mining and machine learning in clinical diagnostics

被引:66
作者
Saberi-Karimian, Maryam [1 ,2 ]
Khorasanchi, Zahra [3 ]
Ghazizadeh, Hamideh [1 ,2 ]
Tayefi, Maryam [4 ]
Saffar, Sara [1 ]
Ferns, Gordon A. [5 ]
Ghayour-Mobarhan, Majid [1 ]
机构
[1] Mashhad Univ Med Sci, Int UNESCO Ctr Hlth Related Basic Sci & Human Nut, Mashhad, Razavi Khorasan, Iran
[2] Mashhad Univ Med Sci, Student Res Comm, Mashhad, Razavi Khorasan, Iran
[3] Mashhad Univ Med Sci, Fac Med, Dept Nutr, Mashhad, Razavi Khorasan, Iran
[4] Univ Hosp North Norway, Norwegian Ctr Hlth Res, Tromso, Norway
[5] Brighton & Sussex Med Sch, Div Med Educ, Falmer, England
关键词
Data mining; machine learning; decision tree; REFERENCE INTERVALS; LOGISTIC-REGRESSION; PREDICTION MODEL; RISK-FACTORS; CLASSIFICATION; BIOMARKERS; MEDICINE; CARE;
D O I
10.1080/10408363.2020.1857681
中图分类号
R446 [实验室诊断]; R-33 [实验医学、医学实验];
学科分类号
1001 ;
摘要
Data mining involves the use of mathematical sciences, statistics, artificial intelligence, and machine learning to determine the relationships between variables from a large sample of data. It has previously been shown that data mining can improve the prediction and diagnostic precision of type 2 diabetes mellitus. A few studies have applied machine learning to assess hypertension and metabolic syndrome-related biomarkers, as well as refine the assessment of cardiovascular disease risk. Machine learning methods have also been applied to assess new biomarkers and survival outcomes in patients with renal diseases to predict the development of chronic kidney disease, disease progression, and renal graft survival. In the latter, random forest methods were found to be the best for the prediction of chronic kidney disease. Some studies have investigated the prognosis of nonalcoholic fatty liver disease and acute liver failure, as well as therapy response prediction in patients with viral disorders, using decision tree models. Machine learning techniques, such as Sparse High-Order Interaction Model with Rejection Option, have been used for diagnosing Alzheimer's disease. Data mining techniques have also been applied to identify the risk factors for serious mental illness, such as depression and dementia, and help to diagnose and predict the quality of life of such patients. In relation to child health, some studies have determined the best algorithms for predicting obesity and malnutrition. Machine learning has determined the important risk factors for preterm birth and low birth weight. Published studies of patients with cancer and bacterial diseases are limited and should perhaps be addressed more comprehensively in future studies. Herein, we provide an in-depth review of studies in which biochemical biomarker data were analyzed using machine learning methods to assess the risk of several common diseases, in order to summarize the potential applications of data mining methods in clinical diagnosis. Data mining techniques have now been increasingly applied to clinical diagnostics, and they have the potential to support this field.
引用
收藏
页码:275 / 296
页数:22
相关论文
共 113 条
  • [1] Establishment of Reference Intervals for Alkaline Phosphatase in Pakistani Children Using a Data Mining Approach
    Ahmed, Sibtain
    Zierk, Jakob
    Khan, Aysha Habib
    [J]. LABORATORY MEDICINE, 2020, 51 (05) : 484 - 490
  • [2] Physical Activity Dimensions Associated with Impaired Glucose Metabolism
    Amadid, Hanan
    Johansen, Nanna B.
    Bjerregaard, Anne-Louise
    Vistisen, Dorte
    Faerch, Kristine
    Brage, Soren
    Lauritzen, Torsten
    Witte, Daniel R.
    Sandbaek, Annelli
    Jorgensen, Marit E.
    [J]. MEDICINE AND SCIENCE IN SPORTS AND EXERCISE, 2017, 49 (11) : 2176 - 2184
  • [3] Cardiovascular Event Prediction by Machine Learning The Multi-Ethnic Study of Atherosclerosis
    Ambale-Venkatesh, Bharath
    Yang, Xiaoying
    Wu, Colin O.
    Liu, Kiang
    Hundley, W. Gregory
    McClelland, Robyn
    Gomes, Antoinette S.
    Folsom, Aaron R.
    Shea, Steven
    Guallar, Eliseo
    Bluemke, David A.
    Lima, Joao A. C.
    [J]. CIRCULATION RESEARCH, 2017, 121 (09) : 1092 - +
  • [4] Aqlan F, 2017, P 67 ANN C EXP I IND
  • [5] Blood Glucose Level Prediction with Minimal Inputs Using Feedforward Neural Network for Diabetic Type 1 Patients
    Asad, Muhammad
    Qamar, Usman
    Zeb, Babar
    Khan, Aimal
    Khan, Younas
    [J]. ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 182 - 185
  • [6] Machine learning algorithms enhance the specificity of cancer biomarker detection using SERS-based immunoassays in microfluidic chips
    Banaei, Nariman
    Moshfegh, Javad
    Mohseni-Kabir, Arman
    Houghton, Jean Marie
    Sun, Yubing
    Kim, Byung
    [J]. RSC ADVANCES, 2019, 9 (04) : 1859 - 1868
  • [7] Quad-phased data mining modeling for dementia diagnosis
    Bang, Sunjoo
    Son, Sangjoon
    Roh, Hyunwoong
    Lee, Jihye
    Bae, Sungyun
    Lee, Kyungwon
    Hong, Changhyung
    Shin, Hyunjung
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2017, 17
  • [8] Characteristics Associated With Decreased or Increased Mortality Risk From Glycemic Therapy Among Patients With Type 2 Diabetes and High Cardiovascular Risk: Machine Learning Analysis of the ACCORD Trial
    Basu, Sanjay
    Raghavan, Sridharan
    Wexler, Deborah J.
    Berkowitz, Seth A.
    [J]. DIABETES CARE, 2018, 41 (03) : 604 - 612
  • [9] A systematic review of data mining and machine learning for air pollution epidemiology
    Bellinger, Colin
    Jabbar, Mohomed Shazan Mohomed
    Zaiane, Osmar
    Osornio-Vargas, Alvaro
    [J]. BMC PUBLIC HEALTH, 2017, 17
  • [10] Artificial Intelligence and Big Data in Public Health
    Benke, Kurt
    Benke, Geza
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2018, 15 (12)