Classification and prediction of spinal disease based on the SMOTE-RFE- XGBoost model

被引:10
|
作者
Zhang, Biao [1 ]
Dong, Xinyan [2 ]
Hu, Yuwei [2 ]
Jiang, Xuchu [2 ]
Li, Gongchi [3 ]
机构
[1] Liaocheng Univ, Sch Comp Sci, Liaocheng, Shandong, Peoples R China
[2] Zhongnan Univ Econ & Law, Sch Stat & Math, Wuhan, Hubei, Peoples R China
[3] Huazhong Univ Sci & Technol, Union Hosp, Tongji Med Coll, Wuhan, Hubei, Peoples R China
关键词
Spinal disorders; Feature selection; XGBoost; Machine learning; Classification prediction;
D O I
10.7717/peerj-cs.1280
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spinal diseases are killers that cause long-term disturbance to people with complex and diverse symptoms and may cause other conditions. At present, the diagnosis and treatment of the main diseases mainly depend on the professional level and clinical experience of doctors, which is a breakthrough problem in the field of medicine. This article proposes the SMOTE-RFE-XGBoost model, which takes the physical angle of human bone as the research index for feature selection and classification model construction to predict spinal diseases. The research process is as follows: two groups of people with normal and abnormal spine conditions are taken as the research objects of this article, and the synthetic minority oversampling technique (SMOTE) algorithm is used to address category imbalance. Three methods, least absolute shrinkage and selection operator (LASSO), tree-based feature selection, and recursive feature elimination (RFE), are used for feature selection. Logistic regression (LR), support vector machine (SVM), parsimonious Bayes, decision tree (DT), random forest (RF), gradient boosting tree (GBT), extreme gradient boosting (XGBoost), and ridge regression models are used to classify the samples, construct single classification models and combine classification models and rank the feature importance. According to the accuracy and mean square error (MSE) values, the SMOTE-RFE-XGBoost combined model has the best classification, with accuracy, MSE and F1 values of 97.56%, 0.1111 and 0.8696, respectively. The importance of four indicators, lumbar slippage, cervical tilt, pelvic radius and pelvic tilt, was higher.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Prediction and analysis model for ground peak acceleration based on XGBoost and SHAP
    Qi W.
    Sun R.
    Zheng T.
    Qi J.
    Yantu Gongcheng Xuebao/Chinese Journal of Geotechnical Engineering, 2023, 45 (09): : 1934 - 1943
  • [42] An Interpretable Prediction Model for Acute Kidney Injury Based on XGBoost and SHAP
    LUO Yan
    WANG Cong
    YE Wenling
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2022, 44 (01) : 27 - 38
  • [43] Photovoltaic power prediction based on combined XGBoost-LSTM model
    Tan H.
    Yang Q.
    Xing J.
    Huang K.
    Zhao S.
    Hu H.
    Taiyangneng Xuebao/Acta Energiae Solaris Sinica, 2022, 43 (08): : 75 - 81
  • [44] Respiratory infectious disease prediction model integrating XGboost algorithm and knowledge graph
    Qiu, Youchun
    WIENER KLINISCHE WOCHENSCHRIFT, 2023, 135 : S815 - S815
  • [45] Hybrid XGBoost model with hyperparameter tuning for prediction of liver disease with better accuracy
    Dalal, Surjeet
    Onyema, Edeh Michael
    Malik, Amit
    WORLD JOURNAL OF GASTROENTEROLOGY, 2022, 28 (46) : 6551 - 6563
  • [46] Hybrid XGBoost model with hyperparameter tuning for prediction of liver disease with better accuracy
    Surjeet Dalal
    Edeh Michael Onyema
    Amit Malik
    World Journal of Gastroenterology, 2022, 28 (46) : 6551 - 6563
  • [47] Intelligent classification model for railway signal equipment fault based on SMOTE and ensemble learning
    Yang, Lianbao
    Li, Ping
    Xue, Rui
    Ma, Xiaoning
    Li, Xinqin
    Wang, Zhe
    2018 INTERNATIONAL JOINT CONFERENCE ON MATERIALS SCIENCE AND MECHANICAL ENGINEERING, 2018, 383
  • [48] An hybrid soft attention based XGBoost model for classification of poikilocytosis blood cells
    Prasenjit Dhar
    K. Suganya Devi
    Satish Kumar Satti
    P. Srinivasan
    Evolving Systems, 2024, 15 : 523 - 539
  • [49] SMALF: miRNA-disease associations prediction based on stacked autoencoder and XGBoost
    Liu, Dayun
    Huang, Yibiao
    Nie, Wenjuan
    Zhang, Jiaxuan
    Deng, Lei
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [50] XGBoost Based Strategic Consumers Classification Model on E-commerce Platform
    Du, Mengjin
    Yu, Zhuchao
    Wang, Teng
    Wang, Xueying
    Jiang, Xihao
    2020 6TH INTERNATIONAL CONFERENCE ON E-BUSINESS AND APPLICATIONS (ICEBA 2020), 2020, : 48 - 53