Predicting Diabetes Mellitus With Machine Learning Techniques

被引:367
|
作者
Zou, Quan [1 ,2 ]
Qu, Kaiyang [1 ]
Luo, Yamei [3 ]
Yin, Dehui [3 ]
Ju, Ying [4 ]
Tang, Hua [5 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin, Peoples R China
[2] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu, Sichuan, Peoples R China
[3] Southwest Med Univ, Sch Med Informat & Engn, Luzhou, Peoples R China
[4] Xiamen Univ, Sch Informat Sci & Technol, Xiamen, Peoples R China
[5] Southwest Med Univ, Sch Basic Med, Dept Pathophysiol, Luzhou, Peoples R China
关键词
diabetes mellitus; random forest; decision tree; neural network; machine learning; feature ranking; RANDOM FOREST; FEATURE-SELECTION; DIAGNOSIS; CLASSIFICATION; EXTRACTION; TOOL;
D O I
10.3389/fgene.2018.00515
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Diabetes mellitus is a chronic disease characterized by hyperglycemia. It may cause many complications. According to the growing morbidity in recent years, in 2040, the world's diabetic patients will reach 642 million, which means that one of the ten adults in the future is suffering from diabetes. There is no doubt that this alarming figure needs great attention. With the rapid development of machine learning, machine learning has been applied to many aspects of medical health. In this study, we used decision tree, random forest and neural network to predict diabetes mellitus. The dataset is the hospital physical examination data in Luzhou, China. It contains 14 attributes. In this study, five-fold cross validation was used to examine the models. In order to verity the universal applicability of the methods, we chose some methods that have the better performance to conduct independent test experiments. We randomly selected 68994 healthy people and diabetic patients' data, respectively as training set. Due to the data unbalance, we randomly extracted 5 times data. And the result is the average of these five experiments. In this study, we used principal component analysis (PCA) and minimum redundancy maximum relevance (mRMR) to reduce the dimensionality. The results showed that prediction with random forest could reach the highest accuracy (ACC = 0.8084) when all the attributes were used.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Predicting complications of diabetes mellitus using advanced machine learning algorithms
    Ljubic, Branimir
    Hai, Ameen Abdel
    Stanojevic, Marija
    Diaz, Wilson
    Polimac, Daniel
    Pavlovski, Martin
    Obradovic, Zoran
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (09) : 1343 - 1351
  • [22] A COMPREHENSIVE ANALYSIS OF MACHINE LEARNING TECHNIQUES FOR INCESSANT PREDICTION OF DIABETES MELLITUS
    Reddy, Shiva Shankar
    Sethi, Nilambar
    Rajender, R.
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2020, 13 (01): : 1 - 22
  • [23] Predicting Diabetes Mellitus With Machine Learning Techniques Using Multi-Criteria Decision Making
    Juneja, Abhinav
    Juneja, Sapna
    Kaur, Sehajpreet
    Kumar, Vivek
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2021, 11 (02) : 38 - 52
  • [24] Optimization of an Analysis Method for Diabetes Prediction Using Classical and Ensemble Machine Learning Techniques
    Naranjo, Edison
    Arguero, Berenice
    Hurtado, Remigio
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2024, VOL 3, 2024, 1013 : 527 - 536
  • [25] Diabetes prediction using machine learning and explainable AI techniques
    Tasin, Isfafuzzaman
    Nabil, Tansin Ullah
    Islam, Sanjida
    Khan, Riasat
    HEALTHCARE TECHNOLOGY LETTERS, 2023, 10 (1-2) : 1 - 10
  • [26] Machine Learning Techniques for Predicting Metamaterial Microwave Absorption Performance: A Comparison
    Jain, Prince
    Chhabra, Himanshu
    Chauhan, Urvashi
    Prakash, Krishna
    Samant, Piyush
    Singh, Dhiraj Kumar
    Soliman, Mohamed S.
    Islam, Mohammad Tariqul
    IEEE ACCESS, 2023, 11 : 128774 - 128783
  • [27] A Comparison of Feature Selection and Forecasting Machine Learning Algorithms for Predicting Glycaemia in Type 1 Diabetes Mellitus
    Rodriguez-Rodriguez, Ignacio
    Rodriguez, Jose-Victor
    Woo, Wai Lok
    Wei, Bo
    Pardo-Quiles, Domingo-Javier
    APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 20
  • [28] Diabetes Mellitus Affected Patients Classification and Diagnosis through Machine Learning Techniques
    Mercaldo, Francesco
    Nardone, Vittoria
    Santone, Antonella
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS, 2017, 112 : 2519 - 2528
  • [29] COMPARISON OF MACHINE LEARNING TECHNIQUES FOR PREDICTING NLR PROTEINS
    Nadia
    Gandotra, Ekta
    Kumar, Narendra
    BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2023, 35 (02):
  • [30] Predicting performance of swimmers using machine learning techniques
    Guerra-Salcedo, Cesar M.
    Janek, Libor
    Perez-Ortega, Joaquin
    Pazos-Rangel, Rodolfo A.
    WMSCI 2005: 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Vol 3, 2005, : 146 - 148