Predicting Diabetes Mellitus With Machine Learning Techniques

被引:367
|
作者
Zou, Quan [1 ,2 ]
Qu, Kaiyang [1 ]
Luo, Yamei [3 ]
Yin, Dehui [3 ]
Ju, Ying [4 ]
Tang, Hua [5 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin, Peoples R China
[2] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu, Sichuan, Peoples R China
[3] Southwest Med Univ, Sch Med Informat & Engn, Luzhou, Peoples R China
[4] Xiamen Univ, Sch Informat Sci & Technol, Xiamen, Peoples R China
[5] Southwest Med Univ, Sch Basic Med, Dept Pathophysiol, Luzhou, Peoples R China
关键词
diabetes mellitus; random forest; decision tree; neural network; machine learning; feature ranking; RANDOM FOREST; FEATURE-SELECTION; DIAGNOSIS; CLASSIFICATION; EXTRACTION; TOOL;
D O I
10.3389/fgene.2018.00515
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Diabetes mellitus is a chronic disease characterized by hyperglycemia. It may cause many complications. According to the growing morbidity in recent years, in 2040, the world's diabetic patients will reach 642 million, which means that one of the ten adults in the future is suffering from diabetes. There is no doubt that this alarming figure needs great attention. With the rapid development of machine learning, machine learning has been applied to many aspects of medical health. In this study, we used decision tree, random forest and neural network to predict diabetes mellitus. The dataset is the hospital physical examination data in Luzhou, China. It contains 14 attributes. In this study, five-fold cross validation was used to examine the models. In order to verity the universal applicability of the methods, we chose some methods that have the better performance to conduct independent test experiments. We randomly selected 68994 healthy people and diabetic patients' data, respectively as training set. Due to the data unbalance, we randomly extracted 5 times data. And the result is the average of these five experiments. In this study, we used principal component analysis (PCA) and minimum redundancy maximum relevance (mRMR) to reduce the dimensionality. The results showed that prediction with random forest could reach the highest accuracy (ACC = 0.8084) when all the attributes were used.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] An Analysis of various Machine Learning Techniques for Predicting Diabetes in its Early Stages
    Durga, P.
    Sudhakar, T.
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 2030 - 2038
  • [32] Predicting factors for survival of breast cancer patients using machine learning techniques
    Mogana Darshini Ganggayah
    Nur Aishah Taib
    Yip Cheng Har
    Pietro Lio
    Sarinder Kaur Dhillon
    BMC Medical Informatics and Decision Making, 19
  • [33] Predicting Employee Attrition using Machine Learning
    Alduayj, Sarah S.
    Rajpoot, Kashif
    PROCEEDINGS OF THE 2018 13TH INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION TECHNOLOGY (IIT), 2018, : 93 - 98
  • [34] Metabolic Syndrome and Development of Diabetes Mellitus: Predictive Modeling Based on Machine Learning Techniques
    Perveen, Sajida
    Shahbaz, Muhammad
    Keshavjee, Karim
    Guergachi, Aziz
    IEEE ACCESS, 2019, 7 : 1365 - 1375
  • [35] Predicting Diabetes Mellitus Using Machine Learning and Optical Character Recognition
    Silva, W. A. J. R.
    Shirantha, H. M. K.
    Balalla, L. J. M. V. N.
    Ranasinghe, R. A. D. V. K.
    Kuruwitaarachchi, N.
    Kasthurirathna, D.
    2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [36] Evaluation of predisposing factors of Diabetes Mellitus post Gestational Diabetes Mellitus using Machine Learning Techniques
    Krishnan, Devi R.
    Menakath, Gayathri P.
    Radhakrishnan, Anagha
    Himavarshini, Yarrangangu
    Aparna, A.
    Mukundan, Kaveri
    Pathinarupothi, Rahul Krishnan
    Alangot, Bithin
    Mahankali, Sirisha
    Maddipati, Chakravarthy
    2019 17TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2019, : 81 - 85
  • [37] Assessing Advanced Machine Learning Techniques for Predicting Hospital Readmission
    Alajmani, Samah
    Jambi, Kamal
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (02) : 377 - 384
  • [38] Predicting human liver microsomal stability with machine learning techniques
    Sakiyama, Yojiro
    Yuki, Hitomi
    Moriya, Takashi
    Hattori, Kazunari
    Suzuki, Misaki
    Shimada, Kaoru
    Honma, Teruki
    JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2008, 26 (06) : 907 - 915
  • [39] Predicting Heating Load in Energy-Efficient Buildings Through Machine Learning Techniques
    Moayedi, Hossein
    Dieu Tien Bui
    Dounis, Anastasios
    Lyu, Zongjie
    Foong, Loke Kok
    APPLIED SCIENCES-BASEL, 2019, 9 (20):
  • [40] Predicting Breast Cancer Leveraging Supervised Machine Learning Techniques
    Aamir, Sanam
    Rahim, Aqsa
    Aamir, Zain
    Abbasi, Saadullah Farooq
    Khan, Muhammad Shahbaz
    Alhaisoni, Majed
    Khan, Muhammad Attique
    Khan, Khyber
    Ahmad, Jawad
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022