Optimizing diabetes classification with a machine learning-based framework

被引:5
|
作者
Feng, Xin [1 ,2 ,3 ]
Cai, Yihuai [1 ]
Xin, Ruihao [4 ,5 ,6 ]
机构
[1] Jilin Inst Chem Technol, Sch Sci, Jilin 130000, Peoples R China
[2] Jilin Univ, Coll Chem, State Key Lab Inorgan Synth & Preparat Chem, Changchun 130012, Peoples R China
[3] Jilin Univ, Sch Publ Hlth, Dept Epidemiol & Biostat, Changchun 130012, Peoples R China
[4] Jilin Inst Chem Technol, Coll Informat & Control Engn, Jilin 130000, Peoples R China
[5] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
[6] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130012, Peoples R China
关键词
Diabetes diagnoses; Machine learning; GAN;
D O I
10.1186/s12859-023-05467-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundDiabetes is a metabolic disorder usually caused by insufficient secretion of insulin from the pancreas or insensitivity of cells to insulin, resulting in long-term elevated blood sugar levels in patients. Patients usually present with frequent urination, thirst, and hunger. If left untreated, it can lead to various complications that can affect essential organs and even endanger life. Therefore, developing an intelligent diagnosis framework for diabetes is necessary.ResultThis paper proposes a machine learning-based diabetes classification framework machine learning optimized GAN. The framework encompasses several methodological approaches to address the diverse challenges encountered during the analysis. These approaches encompass the implementation of the mean and median joint filling method for handling missing values, the application of the cap method for outlier processing, and the utilization of SMOTEENN to mitigate sample imbalance. Additionally, the framework incorporates the employment of the proposed Diabetes Classification Model based on Generative Adversarial Network and employs logistic regression for detailed feature analysis. The effectiveness of the framework is evaluated using both the PIMA dataset and the diabetes dataset obtained from the GEO database. The experimental findings showcase our model achieved exceptional results, including a binary classification accuracy of 96.27%, tertiary classification accuracy of 99.31%, precision and f1 score of 0.9698, recall of 0.9698, and an AUC of 0.9702.ConclusionThe experimental results show that the framework proposed in this paper can accurately classify diabetes and provide new ideas for intelligent diagnosis of diabetes.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Optimizing diabetes classification with a machine learning-based framework
    Xin Feng
    Yihuai Cai
    Ruihao Xin
    BMC Bioinformatics, 24
  • [2] Machine Learning-Based Classification Models for Diagnosis of Diabetes
    Jaiswal S.
    Jaiswal T.
    Recent Advances in Computer Science and Communications, 2022, 15 (06) : 813 - 821
  • [3] Machine learning-based classification of maritime accidents
    Atak, Ustun
    Demiray, Ahmet
    SHIPS AND OFFSHORE STRUCTURES, 2025,
  • [4] Machine Learning-Based Classification of Dislocation Microstructures
    Steinberger, Dominik
    Song, Hengxu
    Sandfeld, Stefan
    FRONTIERS IN MATERIALS, 2019, 6
  • [5] Machine learning-based assessment of diabetes risk
    Sun, Qi
    Cheng, Xin
    Han, Kuo
    Sun, Yichao
    Ren, He
    Li, Ping
    APPLIED INTELLIGENCE, 2025, 55 (02)
  • [6] A first step towards a machine learning-based framework for bloodstain classification in forensic science
    Jung, Hyeonah
    Jo, Yeon-Soo
    Ahn, Yoseop
    Jeong, Jaehoon
    Lim, Si-Keun
    FORENSIC SCIENCE INTERNATIONAL, 2024, 365
  • [7] A machine learning-based underwater noise classification method
    Song, Guoli
    Guo, Xinyi
    Wang, Wenbo
    Ren, Qunyan
    Li, Jun
    Ma, Li
    APPLIED ACOUSTICS, 2021, 184
  • [8] Framework for Testing Robustness of Machine Learning-Based Classifiers
    Chuah, Joshua
    Kruger, Uwe
    Wang, Ge
    Yan, Pingkun
    Hahn, Juergen
    JOURNAL OF PERSONALIZED MEDICINE, 2022, 12 (08):
  • [9] An efficient parallel machine learning-based blockchain framework
    Tsai, Chun-Wei
    Chen, Yi-Ping
    Tang, Tzu-Chieh
    Luo, Yu-Chen
    ICT EXPRESS, 2021, 7 (03): : 300 - 307
  • [10] Machine Learning-based Classification of Online Industrial Datasets
    Faber, Rastislav
    L'ubusky, Karol
    Paulen, Radoslav
    2023 24TH INTERNATIONAL CONFERENCE ON PROCESS CONTROL, PC, 2023, : 132 - 137