"WarfarinSeer": a predictive tool based on SMOTE random forest to improve warfarin dose prediction in Chinese patients

被引:0
|
作者
Tao, Yanyun [1 ]
Zhang, Yuzhen [2 ]
机构
[1] Soochow Univ, Inst Intelligence Struct & Syst, Suzhou 215137, Peoples R China
[2] Soochow Univ, Affiliated Hosp 1, Cardiol, Suzhou 215006, Peoples R China
来源
PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM) | 2018年
基金
中国国家自然科学基金;
关键词
warfarin dose prediction; machine learning; oversampling; SMOTE; random forest; CARTS;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Warfarin daily dosage prediction for a specific patient is difficult. To solve the data imbalance and improve the predictive accuracy on Warfarin daily dosage, we develop a dosage predictive tool called "WarfarinSeer", which is based on Synthetic Minority Oversampling Technique-Random Forest (SMOTE-RF) model. In STMOE-RF, STMOE is adopted to oversample the data, which have rare genotypes (i.e., *1/*3 and *3/*3 for CYP2C9, AG and GG for VKORCI), to produce new samples by using k-Nearest Neighbor. Random forest produces a group of trees by training them on minority and the masses as well as different combinations of features. It makes uses of correlation of tress to improve the generalization of ensemble model. In the experiment, six machine learning methods and three conventional Warfarin predictive models are as comparators to "WarfarinSeer". A dataset of 589 Han Chinese patients is collected from the data of The First Affiliated Hospital of Soochow University and an open source data of International Warfarin Pharmacogenetics Consortium (IWPC) for training and test. Results showed that "WarfarinSeer" present the highest accuracy on the prediction of warfarin dose in terms of R-squared (R-2) and mean squared error (mse).
引用
收藏
页码:1022 / 1026
页数:5
相关论文
共 27 条
  • [1] Evolutionary synthetic minority oversampling technique with random forest for warfarin dose prediction in Chinese patients
    Tao, Yanyun
    Wang, Kaixin
    Zhang, Yuzhen
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 2514 - 2520
  • [2] Evolutionary learning-based modeling for warfarin dose prediction in Chinese
    Tao, Yanyun
    Zhang, Yuzhen
    Jiang, Bin
    PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCO'17 COMPANION), 2017, : 1380 - 1386
  • [3] An Ensemble Model With Clustering Assumption for Warfarin Dose Prediction in Chinese Patients
    Tao, Yanyun
    Chen, Yenming J.
    Xue, Ling
    Xie, Cheng
    Jiang, Bin
    Zhang, Yuzhen
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2019, 23 (06) : 2642 - 2654
  • [4] Swarm ANN/SVR-Based Modeling Method for Warfarin Dose Prediction in Chinese
    Tao, Yanyun
    Xiang, Dan
    Zhang, Yuzhen
    Jiang, Bin
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2017, PT II, 2017, 10386 : 351 - 358
  • [5] A Post-Hoc Interpretable Ensemble Model to Feature Effect Analysis in Warfarin Dose Prediction for Chinese Patients
    Zhang, Yuzhen
    Xie, Cheng
    Xue, Ling
    Tao, Yanyun
    Yue, Guoqi
    Jiang, Bin
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (02) : 840 - 851
  • [6] Prediction of Wind Turbine Blades Icing Based on MBK-SMOTE and Random Forest in Imbalanced Data Set
    Ge, Yangming
    Yue, Dong
    Chen, Lei
    2017 IEEE CONFERENCE ON ENERGY INTERNET AND ENERGY SYSTEM INTEGRATION (EI2), 2017,
  • [7] Research on tool wear prediction based on the random forest optimized by NGO algorithm
    Cheng, Yaonan
    Zhou, Shilong
    Xue, Jing
    Lu, Mengda
    Gai, Xiaoyu
    Guan, Rui
    MACHINING SCIENCE AND TECHNOLOGY, 2024, 28 (04) : 523 - 546
  • [8] SMOTE-NC and gradient boosting imputation based random forest classifier for predicting severity level of covid-19 patients with blood samples
    Gok, Elif Ceren
    Olgun, Mehmet Onur
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (22) : 15693 - 15707
  • [9] Hybrid Prediction Model for Type 2 Diabetes and Hypertension Using DBSCAN-Based Outlier Detection, Synthetic Minority Over Sampling Technique (SMOTE), and Random Forest
    Ijaz, Muhammad Fazal
    Alfian, Ganjar
    Syafrudin, Muhammad
    Rhee, Jongtae
    APPLIED SCIENCES-BASEL, 2018, 8 (08):
  • [10] SMOTE-NC and gradient boosting imputation based random forest classifier for predicting severity level of covid-19 patients with blood samples
    Elif Ceren Gök
    Mehmet Onur Olgun
    Neural Computing and Applications, 2021, 33 : 15693 - 15707