Prediction of Prolonged Length of Hospital Stay After Cancer Surgery Using Machine Learning on Electronic Health Records: Retrospective Cross-sectional Study

被引:14
作者
Jo, Yong-Yeon [1 ]
Han, JaiHong [2 ]
Park, Hyun Woo [1 ]
Jung, Hyojung [1 ]
Lee, Jae Dong [1 ]
Jung, Jipmin [3 ]
Cha, Hyo Soung [3 ]
Sohn, Dae Kyung [4 ]
Hwangbo, Yul [1 ]
机构
[1] Natl Canc Ctr, Healthcare AI Team, 323 Ilsan Ro, Goyang 10408, South Korea
[2] Natl Canc Ctr, Dept Surg, Goyang, South Korea
[3] Natl Canc Ctr, Canc Data Ctr, Natl Canc Control Inst, Goyang, South Korea
[4] Natl Canc Ctr, Ctr Colorectal Canc, Res Inst & Hosp, Goyang, South Korea
关键词
postoperative length of stay; cancer surgery; machine learning; electronic health records; OF-STAY; RISK-FACTORS; COMPLICATIONS; CARE; COST;
D O I
10.2196/23147
中图分类号
R-058 [];
学科分类号
摘要
Background: Postoperative length of stay is a key indicator in the management of medical resources and an indirect predictor of the incidence of surgical complications and the degree of recovery of the patient after cancer surgery. Recently, machine learning has been used to predict complex medical outcomes, such as prolonged length of hospital stay, using extensive medical information. Objective: The objective of this study was to develop a prediction model for prolonged length of stay after cancer surgery using a machine learning approach. Methods: In our retrospective study, electronic health records (EHRs) from 42,751 patients who underwent primary surgery for 17 types of cancer between January 1, 2000, and December 31, 2017, were sourced from a single cancer center. The EHRs included numerous variables such as surgical factors, cancer factors, underlying diseases, functional laboratory assessments, general assessments, medications, and social factors. To predict prolonged length of stay after cancer surgery, we employed extreme gradient boosting classifier, multilayer perceptron, and logistic regression models. Prolonged postoperative length of stay for cancer was defined as bed-days of the group of patients who accounted for the top 50% of the distribution of bed-days by cancer type. Results: In the prediction of prolonged length of stay after cancer surgery, extreme gradient boosting classifier models demonstrated excellent performance for kidney and bladder cancer surgeries (area under the receiver operating characteristic curve [AUC] >0.85). A moderate performance (AUC 0.70-0.85) was observed for stomach, breast, colon, thyroid, prostate, cervix uteri, corpus uteri, and oral cancers. For stomach, breast, colon, thyroid, and lung cancers, with more than 4000 cases each, the extreme gradient boosting classifier model showed slightly better performance than the logistic regression model, although the logistic regression model also performed adequately. We identified risk variables for the prediction of prolonged postoperative length of stay for each type of cancer, and the importance of the variables differed depending on the cancer type. After we added operative time to the models trained on preoperative factors, the models generally outperformed the corresponding models using only preoperative variables. Conclusions: A machine learning approach using EHRs may improve the prediction of prolonged length of hospital stay after primary cancer surgery. This algorithm may help to provide a more effective allocation of medical resources in cancer surgery.
引用
收藏
页数:10
相关论文
共 26 条
  • [1] Identifying modifiable and non-modifiable risk factors associated with prolonged length of stay after hysterectomy for uterine cancer
    Agrawal, Surbhi
    Chen, Ling
    Tergas, Ana I.
    Hou, June Y.
    St Clair, Caryn M.
    Ananth, Cande V.
    Hershman, Dawn L.
    Wright, Jason D.
    [J]. GYNECOLOGIC ONCOLOGY, 2018, 149 (03) : 545 - 553
  • [2] [Anonymous], 2002, American Joint Committee on Cancer (AJCC) cancer staging manual
  • [3] [Anonymous], 2015, AJCC Cancer Staging Manual
  • [4] The Korea Cancer Big Data Platform (K-CBP) for Cancer Research
    Cha, Hyo Soung
    Jung, Jip Min
    Shin, Seob Yoon
    Jang, Young Mi
    Park, Phillip
    Lee, Jae Wook
    Chung, Seung Hyun
    Choi, Kui Son
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2019, 16 (13)
  • [5] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [6] Risk Factors Associated With Perioperative Complications and Prolonged Length of Stay After Laparoscopic Adrenalectomy
    Chen, Yufei
    Scholten, Anouk
    Chomsky-Higgins, Kathryn
    Nwaogu, Iheoma
    Gosnell, Jessica E.
    Seib, Carolyn
    Shen, Wen T.
    Suh, Insoo
    Duh, Quan-Yang
    [J]. JAMA SURGERY, 2018, 153 (11) : 1036 - 1041
  • [7] Risk factors for prolonged length of stay after major elective surgery
    Collins, TC
    Daley, J
    Henderson, WH
    Khuri, SK
    [J]. ANNALS OF SURGERY, 1999, 230 (02) : 251 - 259
  • [8] Desai RJ, 2019, JAMA NETW OPEN, V2, DOI [10.1001/jamanetworkopen.2019.18962, 10.1001/jamanetworkopen.2019.10626]
  • [9] Association of Use of an Intravascular Microaxial Left Ventricular Assist Device vs Intra-aortic Balloon Pump With In-Hospital Mortality and Major Bleeding Among Patients With Acute Myocardial Infarction Complicated by Cardiogenic Shock
    Dhruva, Sanket S.
    Ross, Joseph S.
    Mortazavi, Bobak J.
    Hurley, Nathan C.
    Krumholz, Harlan M.
    Curtis, Jeptha P.
    Berkowitz, Alyssa
    Masoudi, Frederick A.
    Messenger, John C.
    Parzynski, Craig S.
    Ngufor, Che
    Girotra, Saket
    Amin, Amit P.
    Shah, Nilay D.
    Desai, Nihar R.
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2020, 323 (08): : 734 - 745
  • [10] Gohil Rohit, 2014, Br J Med Med Res, V4, P481