Predicting decompression surgery by applying multimodal deep learning to patients' structured and unstructured health data

被引:14
作者
Jujjavarapu, Chethan [1 ]
Suri, Pradeep [2 ,3 ]
Pejaver, Vikas [4 ,5 ]
Friedly, Janna [2 ,3 ]
Gold, Laura S. [2 ,6 ]
Meier, Eric [2 ,7 ,8 ]
Cohen, Trevor [1 ]
Mooney, Sean D. [1 ]
Heagerty, Patrick J. [7 ,8 ]
Jarvik, Jeffrey G. [2 ,6 ,9 ,10 ]
机构
[1] Univ Washington, Sch Med, Dept Biomed Informat & Med Educ, Box 358047, Seattle, WA 98195 USA
[2] Univ Washington, Clin Learning Evidence & Res Ctr, 4333 Brooklyn Ave NE, Seattle, WA 98105 USA
[3] Univ Washington, Dept Rehabil Med, 1959 NE Pacific St, Seattle, WA 98195 USA
[4] Icahn Sch Med Mt Sinai, Inst Genom Hlth, New York, NY 10029 USA
[5] Icahn Sch Med Mt Sinai, Dept Genet & Genom Sci, New York, NY 10029 USA
[6] Univ Washington, Dept Radiol, 1959 NE Pacific St, Seattle, WA 98195 USA
[7] Univ Washington, Dept Biostat, Box 357232, Seattle, WA 98195 USA
[8] Univ Washington, Ctr Biomed Stat, Seattle, WA USA
[9] Univ Washington, Dept Neurol Surg, 1959 NE Pacific St, Seattle, WA 98195 USA
[10] Univ Washington, Dept Hlth Serv, Box 357660, Seattle, WA 98195 USA
基金
美国国家卫生研究院;
关键词
Lower back pain; Lumbar spinal stenosis; Lumbar disc herniation; Deep learning; Generalizability; Multimodal; Machine learning; Decompression surgery; Prediction; Classification; LOW-BACK-PAIN; LUMBAR DISC HERNIATION; NONOPERATIVE TREATMENT; SPINAL STENOSIS; CARE; TRAJECTORIES; DISABILITY; INJECTIONS; FEATURES; ADULTS;
D O I
10.1186/s12911-022-02096-x
中图分类号
R-058 [];
学科分类号
摘要
Background Low back pain (LBP) is a common condition made up of a variety of anatomic and clinical subtypes. Lumbar disc herniation (LDH) and lumbar spinal stenosis (LSS) are two subtypes highly associated with LBP. Patients with LDH/LSS are often started with non-surgical treatments and if those are not effective then go on to have decompression surgery. However, recommendation of surgery is complicated as the outcome may depend on the patient's health characteristics. We developed a deep learning (DL) model to predict decompression surgery for patients with LDH/LSS.Materials and method We used datasets of 8387 and 8620 patients from a prospective study that collected data from four healthcare systems to predict early (within 2 months) and late surgery (within 12 months after a 2 month gap), respectively. We developed a DL model to use patients' demographics, diagnosis and procedure codes, drug names, and diagnostic imaging reports to predict surgery. For each prediction task, we evaluated the model's performance using classical and generalizability evaluation. For classical evaluation, we split the data into training (80%) and testing (20%). For generalizability evaluation, we split the data based on the healthcare system. We used the area under the curve (AUC) to assess performance for each evaluation. We compared results to a benchmark model (i.e. LASSO logistic regression).Results For classical performance, the DL model outperformed the benchmark model for early surgery with an AUC of 0.725 compared to 0.597. For late surgery, the DL model outperformed the benchmark model with an AUC of 0.655 compared to 0.635. For generalizability performance, the DL model outperformed the benchmark model for early surgery. For late surgery, the benchmark model outperformed the DL model.Conclusions For early surgery, the DL model was preferred for classical and generalizability evaluation. However, for late surgery, the benchmark and DL model had comparable performance. Depending on the prediction task, the balance of performance may shift between DL and a conventional ML method. As a result, thorough assessment is needed to quantify the value of DL, a relatively computationally expensive, time-consuming and less interpretable method.
引用
收藏
页数:14
相关论文
共 69 条
[1]   Lumbar Disc Herniation [J].
Amin R.M. ;
Andrade N.S. ;
Neuman B.J. .
Current Reviews in Musculoskeletal Medicine, 2017, 10 (4) :507-516
[2]   Epidemiological features of chronic low-back pain [J].
Andersson, GBJ .
LANCET, 1999, 354 (9178) :581-585
[3]   Feasibility and Assessment of a Machine Learning-Based Predictive Model of Outcome After Lumbar Decompression Surgery [J].
Andre, Arthur ;
Peyrou, Bruno ;
Carpentier, Alexandre ;
Vignaux, Jean-Jacques .
GLOBAL SPINE JOURNAL, 2022, 12 (05) :894-908
[4]   Fostering reproducibility and generalizability in machine learning for clinical prediction modeling in spine surgery [J].
Azad, Tej D. ;
Ehresman, Jeff ;
Ahmed, Ali Karim ;
Staartjes, Victor E. ;
Lubelski, Daniel ;
Stienen, Martin N. ;
Veeravagu, Anand ;
Ratliff, John K. .
SPINE JOURNAL, 2021, 21 (10) :1610-1616
[5]   Radiology report annotation using intelligent word embeddings: Applied to multi-institutional chest CT cohort [J].
Banerjee, Imon ;
Chen, Matthew C. ;
Lungren, Matthew P. ;
Rubin, Daniel L. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 77 :11-20
[6]  
Bird Steven., 2009, Natural language processing with Python: analyzing text with the natural language toolkit
[7]   Predicting survival from microarray data -: a comparative study [J].
Bovelstad, H. M. ;
Nygard, S. ;
Storvold, H. L. ;
Aldrin, M. ;
Borgan, O. ;
Frigessi, A. ;
Lingjaerde, O. C. .
BIOINFORMATICS, 2007, 23 (16) :2080-2087
[8]   ICD-9-CM to ICD-10-CM Codes: What? Why? How? [J].
Cartwright, Donna J. .
ADVANCES IN WOUND CARE, 2013, 2 (10) :588-592
[9]   Deep learning and alternative learning strategies for retrospective real-world clinical data [J].
Chen, David ;
Liu, Sijia ;
Kingsbury, Paul ;
Sohn, Sunghwan ;
Storlie, Curtis B. ;
Habermann, Elizabeth B. ;
Naessens, James M. ;
Larson, David W. ;
Liu, Hongfang .
NPJ DIGITAL MEDICINE, 2019, 2 (1)
[10]   Trajectories and predictors of the long-term course of low back pain: cohort study with 5-year follow-up [J].
Chen, Ying ;
Campbell, Paul ;
Strauss, Victoria Y. ;
Foster, Nadine E. ;
Jordan, Kelvin P. ;
Dunn, Kate M. .
PAIN, 2018, 159 (02) :252-260