Classification and prediction of spinal disease based on the SMOTE-RFE- XGBoost model

被引:10
|
作者
Zhang, Biao [1 ]
Dong, Xinyan [2 ]
Hu, Yuwei [2 ]
Jiang, Xuchu [2 ]
Li, Gongchi [3 ]
机构
[1] Liaocheng Univ, Sch Comp Sci, Liaocheng, Shandong, Peoples R China
[2] Zhongnan Univ Econ & Law, Sch Stat & Math, Wuhan, Hubei, Peoples R China
[3] Huazhong Univ Sci & Technol, Union Hosp, Tongji Med Coll, Wuhan, Hubei, Peoples R China
关键词
Spinal disorders; Feature selection; XGBoost; Machine learning; Classification prediction;
D O I
10.7717/peerj-cs.1280
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spinal diseases are killers that cause long-term disturbance to people with complex and diverse symptoms and may cause other conditions. At present, the diagnosis and treatment of the main diseases mainly depend on the professional level and clinical experience of doctors, which is a breakthrough problem in the field of medicine. This article proposes the SMOTE-RFE-XGBoost model, which takes the physical angle of human bone as the research index for feature selection and classification model construction to predict spinal diseases. The research process is as follows: two groups of people with normal and abnormal spine conditions are taken as the research objects of this article, and the synthetic minority oversampling technique (SMOTE) algorithm is used to address category imbalance. Three methods, least absolute shrinkage and selection operator (LASSO), tree-based feature selection, and recursive feature elimination (RFE), are used for feature selection. Logistic regression (LR), support vector machine (SVM), parsimonious Bayes, decision tree (DT), random forest (RF), gradient boosting tree (GBT), extreme gradient boosting (XGBoost), and ridge regression models are used to classify the samples, construct single classification models and combine classification models and rank the feature importance. According to the accuracy and mean square error (MSE) values, the SMOTE-RFE-XGBoost combined model has the best classification, with accuracy, MSE and F1 values of 97.56%, 0.1111 and 0.8696, respectively. The importance of four indicators, lumbar slippage, cervical tilt, pelvic radius and pelvic tilt, was higher.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Lithological Classification by Hyperspectral Images Based on a Two-Layer XGBoost Model, Combined with a Greedy Algorithm
    Lin, Nan
    Fu, Jiawei
    Jiang, Ranzhe
    Li, Genjun
    Yang, Qian
    REMOTE SENSING, 2023, 15 (15)
  • [32] A Novel PCA-Firefly Based XGBoost Classification Model for Intrusion Detection in Networks Using GPU
    Bhattacharya, Sweta
    Krishnan, Siva Rama S.
    Maddikunta, Praveen Kumar Reddy
    Kaluri, Rajesh
    Singh, Saurabh
    Gadekallu, Thippa Reddy
    Alazab, Mamoun
    Tariq, Usman
    ELECTRONICS, 2020, 9 (02)
  • [33] An hybrid soft attention based XGBoost model for classification of poikilocytosis blood cells
    Prasenjit Dhar
    K. Suganya Devi
    Satish Kumar Satti
    P. Srinivasan
    Evolving Systems, 2024, 15 : 523 - 539
  • [34] SMALF: miRNA-disease associations prediction based on stacked autoencoder and XGBoost
    Liu, Dayun
    Huang, Yibiao
    Nie, Wenjuan
    Zhang, Jiaxuan
    Deng, Lei
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [35] XGBoost Based Strategic Consumers Classification Model on E-commerce Platform
    Du, Mengjin
    Yu, Zhuchao
    Wang, Teng
    Wang, Xueying
    Jiang, Xihao
    2020 6TH INTERNATIONAL CONFERENCE ON E-BUSINESS AND APPLICATIONS (ICEBA 2020), 2020, : 48 - 53
  • [36] Metabolic syndrome prediction model using Bayesian optimization and XGBoost based on traditional Chinese medicine features
    Zheng, Jianhua
    Zhang, Zihao
    Wang, Jinhe
    Zhao, Ruolin
    Liu, Shuangyin
    Yang, Gaolin
    Liu, Zhengjie
    Deng, Zhengyuan
    HELIYON, 2023, 9 (12)
  • [37] An hybrid soft attention based XGBoost model for classification of poikilocytosis blood cells
    Dhar, Prasenjit
    Devi, K. Suganya
    Satti, Satish Kumar
    Srinivasan, P.
    EVOLVING SYSTEMS, 2024, 15 (02) : 523 - 539
  • [38] SMALF: miRNA-disease associations prediction based on stacked autoencoder and XGBoost
    Dayun Liu
    Yibiao Huang
    Wenjuan Nie
    Jiaxuan Zhang
    Lei Deng
    BMC Bioinformatics, 22
  • [39] Prediction model for missed abortion of patients treated with IVF-ET based on XGBoost: a retrospective study
    Yuan, Guanghui
    Lv, Bohan
    Du, Xin
    Zhang, Huimin
    Zhao, Mingzi
    Liu, Yingxue
    Hao, Cuifang
    PEERJ, 2023, 11 : 25 - 25
  • [40] XGBoost-Based Framework for Smoking-Induced Noncommunicable Disease Prediction
    Davagdorj, Khishigsuren
    Van Huy Pham
    Theera-Umpon, Nipon
    Ryu, Keun Ho
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2020, 17 (18) : 1 - 22