Natural language processing for automated quantification of bone metastases reported in free-text bone scintigraphy reports

被引:18
作者
Groot, Olivier Q. [1 ,2 ]
Bongers, Michiel E. R. [1 ]
Karhade, Aditya V. [1 ]
Kapoor, Neal D. [1 ]
Fenn, Brian P. [1 ]
Kim, Jason [1 ]
Verlaan, J. J. [2 ]
Schwab, Joseph H. [1 ]
机构
[1] Harvard Med Sch, Massachusetts Gen Hosp, Orthopaed Oncol Serv, Dept Orthopaed Surg, 55 Fruit St, Boston, MA 02114 USA
[2] Univ Utrecht, Univ Med Ctr Utrecht, Dept Orthopaed Surg, Utrecht, Netherlands
基金
美国国家卫生研究院;
关键词
D O I
10.1080/0284186X.2020.1819563
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background The widespread use of electronic patient-generated health data has led to unprecedented opportunities for automated extraction of clinical features from free-text medical notes. However, processing this rich resource of data for clinical and research purposes, depends on labor-intensive and potentially error-prone manual review. The aim of this study was to develop a natural language processing (NLP) algorithm for binary classification (single metastasis versus two or more metastases) in bone scintigraphy reports of patients undergoing surgery for bone metastases. Material and methods Bone scintigraphy reports of patients undergoing surgery for bone metastases were labeled each by three independent reviewers using a binary classification (single metastasis versus two or more metastases) to establish a ground truth. A stratified 80:20 split was used to develop and test an extreme-gradient boosting supervised machine learning NLP algorithm. Results A total of 704 free-text bone scintigraphy reports from 704 patients were included in this study and 617 (88%) had multiple bone metastases. In the independent test set (n = 141) not used for model development, the NLP algorithm achieved an 0.97 AUC-ROC (95% confidence interval [CI], 0.92-0.99) for classification of multiple bone metastases and an 0.99 AUC-PRC (95% CI, 0.99-0.99). At a threshold of 0.90, NLP algorithm correctly identified multiple bone metastases in 117 of the 124 who had multiple bone metastases in the testing cohort (sensitivity 0.94) and yielded 3 false positives (specificity 0.82). At the same threshold, the NLP algorithm had a positive predictive value of 0.97 and F1-score of 0.96. Conclusions NLP has the potential to automate clinical data extraction from free text radiology notes in orthopedics, thereby optimizing the speed, accuracy, and consistency of clinical chart review. Pending external validation, the NLP algorithm developed in this study may be implemented as a means to aid researchers in tackling large amounts of data.
引用
收藏
页码:1455 / 1460
页数:6
相关论文
共 38 条
  • [11] Using Natural Language Processing to Improve Efficiency of Manual Chart Abstraction in Research: The Case of Breast Cancer Recurrence
    Carrell, David S.
    Halgrim, Scott
    Diem-Thy Tran
    Buist, Diana S. M.
    Chubak, Jessica
    Chapman, Wendy W.
    Savova, Guergana
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2014, 179 (06) : 749 - 758
  • [12] Integrating Natural Language Processing and Machine Learning Algorithms to Categorize Oncologic Response in Radiology Reports
    Chen, Po-Hao
    Zafar, Hanna
    Galperin-Aizenberg, Maya
    Cook, Tessa
    [J]. JOURNAL OF DIGITAL IMAGING, 2018, 31 (02) : 178 - 184
  • [13] Chen W, 2016, 2016 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (IEEE RCAR), P22, DOI 10.1109/RCAR.2016.7783995
  • [14] Collins GS, 2015, ANN INTERN MED, V162, P55, DOI [10.1016/j.eururo.2014.11.025, 10.7326/M14-0697, 10.1016/j.jclinepi.2014.11.010, 10.1002/bjs.9736, 10.1038/bjc.2014.639, 10.1186/s12916-014-0241-z, 10.1136/bmj.g7594, 10.7326/M14-0698]
  • [15] Interrater Reliability and Accuracy of Clinicians and Trained Research Assistants Performing Prospective Data Collection in Emergency Department Patients With Potential Acute Coronary Syndrome
    Cruz, Carlos O.
    Meshberg, Emily B.
    Shofer, Frances S.
    McCusker, Christine M.
    Chang, Anna Marie
    Hollander, Judd E.
    [J]. ANNALS OF EMERGENCY MEDICINE, 2009, 54 (01) : 1 - 7
  • [16] Automated Extraction of Grade, Stage, and Quality Information From Transurethral Resection of Bladder Tumor Pathology Reports Using Natural Language Processing
    Glaser, Alexander P.
    Jordan, Brian J.
    Cohen, Jason
    Desai, Anuj
    Silberman, Philip
    Meeks, Joshua J.
    [J]. JCO CLINICAL CANCER INFORMATICS, 2018, 2 : 1 - 8
  • [17] High Risk of Symptomatic Venous Thromboembolism After Surgery for Spine Metastatic Bone Lesions: A Retrospective Study
    Groot, Olivier Q.
    Ogink, Paul T.
    Pereira, Nuno Rei Paulino
    Ferrone, Marco L.
    Harris, Mitchell B.
    Lozano-Calderon, Santiago A.
    Schoenfeld, Andrew J.
    Schwab, Joseph H.
    [J]. CLINICAL ORTHOPAEDICS AND RELATED RESEARCH, 2019, 477 (07) : 1674 - 1686
  • [18] High Risk of Venous Thromboembolism After Surgery for Long Bone Metastases: A Retrospective Study of 682 Patients
    Groot, Olivier Q.
    Ogink, Paul T.
    Janssen, Stein J.
    Pereira, Nuno Rui Paulino
    Lozano-Calderon, Santiago
    Raskin, Kevin
    Hornicek, Francis
    Schwab, Joseph H.
    [J]. CLINICAL ORTHOPAEDICS AND RELATED RESEARCH, 2018, 476 (10) : 2052 - 2061
  • [19] Complications After Surgical Management of Proximal Femoral Metastasis: A Retrospective Study of 417 Patients
    Janssen, Stein J.
    Kortlever, Joost T. P.
    Ready, John E.
    Raskin, Kevin A.
    Ferrone, Marco L.
    Hornicek, Francis J.
    Lozano-Calderon, Santiago A.
    Schwab, Joseph H.
    [J]. JOURNAL OF THE AMERICAN ACADEMY OF ORTHOPAEDIC SURGEONS, 2016, 24 (07) : 483 - 494
  • [20] External validation of the SORG 90-day and 1-year machine learning algorithms for survival in spinal metastatic disease
    Karhade, Aditya, V
    Ahmed, Ali K.
    Pennington, Zach
    Chara, Alejandro
    Schilling, Andrew
    Thio, Quirina C. B. S.
    Ogink, Paul T.
    Sciubba, Daniel M.
    Schwab, Joseph H.
    [J]. SPINE JOURNAL, 2020, 20 (01) : 14 - 21