Development of a natural language processing algorithm for the detection of spinal metastasis based on magnetic resonance imaging reports

被引:0
作者
Mostafa, Evan [1 ]
Hui, Aaron [2 ]
Aasman, Boudewijn [2 ]
Chowdary, Kamlesh [2 ]
Mani, Kyle [2 ]
Mardakhaev, Edward [2 ]
Zampolin, Richard [2 ]
Blumfield, Einat [2 ]
Berman, Jesse [2 ]
Ramos, Rafael De La Garza [3 ]
Fourman, Mitchell [1 ]
Yassari, Reza [3 ]
Eleswarapu, Ananth [1 ]
Mirhaji, Parsa [2 ]
机构
[1] Montefiore Med Ctr, Dept Orthopaed Surg, 111 E 210th St, Bronx, NY 10467 USA
[2] Albert Einstein Coll Med, 1300 Morris Pk Ave, Bronx, NY 10461 USA
[3] Montefiore Med Ctr, Dept Neurol Surg, 111 E 210th St, Bronx, NY 10467 USA
来源
NORTH AMERICAN SPINE SOCIETY JOURNAL | 2024年 / 19卷
关键词
Spine; Metastatic; Cancer; MRI; Processing; Algorithm; Language;
D O I
10.1016/j.xnsj.2024.100513
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
Background: Metastasis to the spinal column is a common complication of malignancy, potentially causing pain and neurologic injury. An automated system to identify and refer patients with spinal metastases can help overcome barriers to timely treatment. We describe the training, optimization and validation of a natural language processing algorithm to identify the presence of vertebral metastasis and metastatic epidural cord compression (MECC) from radiology reports of spinal MRIs. Methods: Reports from patients with spine MRI studies performed between January 1, 2008 and April 14, 2019 were reviewed by a team of radiologists to assess for the presence of cancer and generate a labeled dataset for model training. Using regular expression, impression sections were extracted from the reports and converted to all lower-case letters with all nonalphabetic characters removed. The reports were then tokenized and vectorized using the doc2vec algorithm. These were then used to train a neural network to predict the likelihood of spinal tumor or MECC. For each report, the model provided a number from 0 to 1 corresponding to its impression. We then obtained 111 MRI reports from outside the test set, 92 manually labeled negative and 19 with MECC to test the model's performance. Results: About 37,579 radiology reports were reviewed. About 36,676 were labeled negative, and 903 with MECC. We chose a cutoff of 0.02 as a positive result to optimize for a low false negative rate. At this threshold we found a 100% sensitivity rate with a low false positive rate of 2.2%. Conclusions: The NLP model described predicts the presence of spinal tumor and MECC in spine MRI reports with high accuracy. We plan to implement the algorithm into our EMR to allow for faster referral of these patients to appropriate specialists, allowing for reduced morbidity and increased survival.
引用
收藏
页数:6
相关论文
共 17 条
  • [1] Suspected malignant cord compression - improving time to diagnosis via a 'hotline': a prospective audit
    Allan, L.
    Baker, L.
    Dewar, J.
    Eljamel, S.
    Grant, R. M.
    Houston, J. G.
    McLeay, T.
    Munro, A. J.
    Levack, P.
    [J]. BRITISH JOURNAL OF CANCER, 2009, 100 (12) : 1867 - 1872
  • [2] A systematic review of natural language processing applied to radiology reports
    Casey, Arlene
    Davidson, Emma
    Poon, Michael
    Dong, Hang
    Duma, Daniel
    Grivas, Andreas
    Grover, Claire
    Suarez-Paniagua, Victor
    Tobin, Richard
    Whiteley, William
    Wu, Honghan
    Alex, Beatrice
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)
  • [3] Natural language processing for identification of hypertrophic cardiomyopathy patients from cardiac magnetic resonance reports
    Dewaswala, Nakeya
    Chen, David
    Bhopalwala, Huzefa
    Kaggal, Vinod C.
    Murphy, Sean P.
    Bos, J. Martijn
    Geske, Jeffrey B.
    Gersh, Bernard J.
    Ommen, Steve R.
    Araoz, Philip A.
    Ackerman, Michael J.
    Arruda-Olson, Adelaide M.
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
  • [4] Durán JM, 2021, J MED ETHICS, V47, P329, DOI [10.1136/medethics-2020-106820, 10.1136/medethics-2021-107531]
  • [5] Facilitators and barriers to implementing clinical care pathways
    Evans-Lacko, Sara
    Jarrett, Manuela
    McCrone, Paul
    Thornicroft, Graham
    [J]. BMC HEALTH SERVICES RESEARCH, 2010, 10
  • [6] Natural language processing for automated quantification of bone metastases reported in free-text bone scintigraphy reports
    Groot, Olivier Q.
    Bongers, Michiel E. R.
    Karhade, Aditya V.
    Kapoor, Neal D.
    Fenn, Brian P.
    Kim, Jason
    Verlaan, J. J.
    Schwab, Joseph H.
    [J]. ACTA ONCOLOGICA, 2020, 59 (12) : 1455 - 1460
  • [7] Analysis of factors delaying the surgical treatment of patients with neurological deficits in the course of spinal metastatic disease
    Guzik, Grzegorz
    [J]. BMC PALLIATIVE CARE, 2018, 17
  • [8] Natural Language Processing in Spine Surgery: A Systematic Review of Applications, Bias, and Reporting Transparency
    Huang, Bonnie B.
    Huang, Jonathan
    Swong, Kevin N.
    [J]. WORLD NEUROSURGERY, 2022, 167 : 156 - +
  • [9] Hui AT, 2021, Ethics Biol, Eng Med, V12
  • [10] Natural language processing and machine learning algorithm to identify brain MRI reports with acute ischemic stroke
    Kim, Chulho
    Zhu, Vivienne
    Obeid, Jihad
    Lenert, Leslie
    [J]. PLOS ONE, 2019, 14 (02):