The effect of resampling techniques on the performances of machine learning clinical risk prediction models in the setting of severe class imbalance: development and internal validation in a retrospective cohort

被引：0

作者：

Ke, Janny Xue Chen ^{[1
,2
,3
]}

DhakshinaMurthy, Arunachalam ^{[4
]}

George, Ronald B. ^{[5
]}

Branco, Paula ^{[6
]}

机构：

[1] Department of Anesthesia, St. Paul’s Hospital, Providence Health Care, 1081 Burrard Street, Vancouver, V6Z1Y6, BC

[2] Department of Anesthesiology, Pharmacology and Therapeutics, University of British Columbia, Vancouver, BC

[3] Perioperative Medicine, Dalhousie University, Halifax, NS

[4] School of Computer Science, Carleton University, Ottawa, ON

[5] Mount Sinai Hospital, University of Toronto, Toronto, ON

[6] School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, ON

来源：

Discover Artificial Intelligence | 2024年 / 4卷 / 01期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Anesthesiology; Class imbalance; Machine learning; Predictive modeling; Resampling; Risk prediction;

D O I：

10.1007/s44163-024-00199-0

中图分类号：

学科分类号：

摘要：

Purpose: The availability of population datasets and machine learning techniques heralded a new era of sophisticated prediction models involving a large number of routinely collected variables. However, severe class imbalance in clinical datasets is a major challenge. The aim of this study is to investigate the impact of commonly-used resampling techniques in combination with commonly-used machine learning algorithms in a clinical dataset, to determine whether combination(s) of these approaches improve upon the original multivariable logistic regression with no resampling. Methods: We previously developed and internally validated a multivariable logistic regression 30-day mortality prediction model in 30,619 patients using preoperative and intraoperative features. Using the same dataset, we systematically evaluated and compared model performances after application of resampling techniques [random under-sampling, near miss under-sampling, random oversampling, and synthetic minority oversampling (SMOTE)] in combination with machine learning algorithms (logistic regression, elastic net, decision trees, random forest, and extreme gradient boosting). Results: We found that in the setting of severe class imbalance, the impact of resampling techniques on model performance varied by the machine learning algorithm and the evaluation metric. Existing resampling techniques did not meaningfully improve area under receiving operating curve (AUROC). The area under the precision recall curve (AUPRC) was only increased by random under-sampling and SMOTE for decision trees, and oversampling and SMOTE for extreme gradient boosting. Importantly, some combinations of algorithm and resampling technique decreased AUROC and AUPRC compared to no resampling. Conclusion: Existing resampling techniques had a variable impact on models, depending on the algorithms and the evaluation metrics. Future research is needed to improve predictive performances in the setting of severe class imbalance. © The Author(s) 2024.

引用

共 41 条

[1]

Nepogodiev D., Et al., Global burden of postoperative death, Lancet, 393, (2019)

[2]

Moonesinghe S.R., Mythen M.G., Das P., Rowan K.M., Grocott M.P.W., Risk stratification tools for predicting morbidity and mortality in adult patients undergoing major surgery: qualitative systematic review, Anesthesiology, 119, 4, pp. 959-981, (2013)

[3]

Wong D.J.N., Harris S., Sahni A., Bedford J.R., Cortes L., Shawyer R., Et al., Developing and validating subjective and objective risk-assessment measures for predicting mortality after major surgery: an international prospective cohort study, PLOS Med, 17, 10, (2020)

[4]

Sigakis M.J.G., Bittner E.A., Wanderer J.P., Validation of a risk stratification index and risk quantification index for predicting patient outcomesin-hospital mortality, 30-day mortality, 1-year mortality, and length-of-stay, Anesthesiol J Am Soc Anesthesiol, 119, 3, pp. 525-540, (2013)

[5]

Lee C.K., Hofer I., Gabel E., Baldi P., Cannesson M., Development and validation of a deep neural network model for prediction of postoperative in-hospital mortality, Anesthesiology, 129, 4, pp. 649-662, (2018)

[6]

Hill B.L., Brown R., Gabel E., Rakocz N., Lee C., Cannesson M., Et al., An automated machine learning-based model predicts postoperative mortality using readily-extractable preoperative electronic health record data, Br J Anaesth, 123, 6, pp. 877-886, (2019)

[7]

Fritz B.A., Cui Z., Zhang M., He Y., Chen Y., Kronzer A., Et al., Deep-learning model for predicting 30-day postoperative mortality, Br J Anaesth, 123, 5, pp. 688-695, (2019)

[8]

Ke J.X.C., McIsaac D.I., George R.B., Branco P., Cook E.F., Beattie W.S., Et al., Postoperative mortality risk prediction that incorporates intraoperative vital signs: development and internal validation in a historical cohort, Can J Anesth, (2022)

[9]

Kazemi P., Lau F., Simpao A.F., Williams R.J., Matava C., The state of adoption of anesthesia information management systems in Canadian academic anesthesia departments: A survey, Can J Anaesth J Can Anesth, (2021)

[10]

Megahed F.M., Chen Y.J., Megahed A., Ong Y., Altman N., Krzywinski M., The class imbalance problem, Nat Methods, 18, 11, pp. 1270-1272, (2021)

← 1 2 3 4 5 →