Identification of risk factors for patients with diabetes: diabetic polyneuropathy case study

被引:11
作者
Metsker, Oleg [1 ]
Magoev, Kirill [2 ,3 ]
Yakovlev, Alexey [1 ,2 ]
Yanishevskiy, Stanislav [1 ]
Kopanitsa, Georgy [2 ]
Kovalchuk, Sergey [2 ]
Krzhizhanovskaya, Valeria V. [2 ,3 ]
机构
[1] Almazov Natl Med Res Ctr, St Petersburg, Russia
[2] ITMO Univ, Birzhevaya 4, St Petersburg, Russia
[3] Univ Amsterdam, Amsterdam, Netherlands
基金
俄罗斯科学基金会;
关键词
Polyneuropathy; Machine learning; Risk factors; Clinical decision support; PERIPHERAL NEUROPATHY; VALIDATION;
D O I
10.1186/s12911-020-01215-w
中图分类号
R-058 [];
学科分类号
摘要
Background Methods of data mining and analytics can be efficiently applied in medicine to develop models that use patient-specific data to predict the development of diabetic polyneuropathy. However, there is room for improvement in the accuracy of predictive models. Existing studies of diabetes polyneuropathy considered a limited number of predictors in one study to enable a comparison of efficiency of different machine learning methods with different predictors to find the most efficient one. The purpose of this study is the implementation of machine learning methods for identifying the risk of diabetes polyneuropathy based on structured electronic medical records collected in databases of medical information systems. Methods For the purposes of our study, we developed a structured procedure for predictive modelling, which includes data extraction and preprocessing, model adjustment and performance assessment, selection of the best models and interpretation of results. The dataset contained a total number of 238,590 laboratory records. Each record 27 laboratory tests, age, gender and presence of retinopathy or nephropathy). The records included information about 5846 patients with diabetes. Diagnosis served as a source of information about the target class values for classification. Results It was discovered that inclusion of two expressions, namely "nephropathy" and "retinopathy" allows to increase the performance, achieving up to 79.82% precision, 81.52% recall, 80.64% F1 score, 82.61% accuracy, and 89.88% AUC using the neural network classifier. Additionally, different models showed different results in terms of interpretation significance: random forest confirmed that the most important risk factor for polyneuropathy is the increased neutrophil level, meaning the presence of inflammation in the body. Linear models showed linear dependencies of the presence of polyneuropathy on blood glucose levels, which is confirmed by the clinical interpretation of the importance of blood glucose control. Conclusion Depending on whether one needs to identify pathophysiological mechanisms for one's prospective study or identify early or late predictors, the choice of model will vary. In comparison with the previous studies, our research makes a comprehensive comparison of different decisions using a large and well-structured dataset applied to different decision support tasks.
引用
收藏
页数:15
相关论文
共 35 条
  • [1] Treating Pain in Diabetic Neuropathy: Current and Developmental Drugs
    Alam, Uazman
    Sloan, Gordon
    Tesfaye, Solomon
    [J]. DRUGS, 2020, 80 (04) : 363 - 384
  • [2] [Anonymous], 2017, ENCY MACHINE LEARNIN, P65
  • [3] IntelliHealth: A medical decision support application using a novel weighted multi-layer classifier ensemble framework
    Bashir, Saba
    Qamar, Usman
    Khan, Farhan Hassan
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 59 : 185 - 200
  • [4] The mean platelet volume in subjects with impaired fasting glucose
    Coban, E
    Bostan, F
    Ozdogan, M
    [J]. PLATELETS, 2006, 17 (01) : 67 - 69
  • [5] Dagliati Arianna, 2018, J Diabetes Sci Technol, V12, P295, DOI 10.1177/1932296817706375
  • [6] The relationship between glycemic control and platelet activity in type 2 diabetes mellitus
    Demirtunc, Refik
    Duman, Dursun
    Basar, Melih
    Bilgi, Mustafa
    Teomete, Mehmet
    Garip, Tayfun
    [J]. JOURNAL OF DIABETES AND ITS COMPLICATIONS, 2009, 23 (02) : 89 - 94
  • [7] Fitri Aida, 2019, Open Access Maced J Med Sci, V7, P2626, DOI 10.3889/oamjms.2019.454
  • [8] Furnkranz J., 2017, Encyclopedia of machine learning and data mining, P330
  • [9] Understanding Diabetic Neuropathy-From Subclinical Nerve Lesions to Severe Nerve Fiber Deficits: A Cross-Sectional Study in Patients With Type 2 Diabetes and Healthy Control Subjects
    Groener, Jan B.
    Jende, Johann M. E.
    Kurz, Felix T.
    Kender, Zoltan
    Treede, Rolf-Detlef
    Schuh-Hofer, Sigrid
    Nawroth, Peter P.
    Bendszus, Martin
    Kopf, Stefan
    [J]. DIABETES, 2020, 69 (03) : 436 - 447
  • [10] An interpretable rule-based diagnostic classification of diabetic nephropathy among type 2 diabetes patients
    Huang, Guan-Mau
    Huang, Kai-Yao
    Lee, Tzong-Yi
    Weng, Julia Tzu-Ya
    [J]. BMC BIOINFORMATICS, 2015, 16