Automatic Lung-RADS™ classification with a natural language processing system

被引:15
作者
Beyer, Sebastian E. [1 ]
McKee, Brady J. [1 ]
Regis, Shawn M. [2 ]
McKee, Andrea B. [2 ]
Flacke, Sebastian [1 ]
El Saadawi, Gilan [3 ]
Wald, Christoph [1 ]
机构
[1] Lahey Hosp & Med Ctr, Dept Radiol, 41 Mall Rd, Burlington, MA 01805 USA
[2] Lahey Hosp & Med Ctr, Dept Radiat Oncol, Burlington, MA USA
[3] MModal, CMIO, Imaging Solut, Pittsburgh, PA USA
关键词
CT lung screening (CTLS); Lung-RADS (TM) (LR); natural language processing (NLP); LUNG-CANCER; INFORMATION; IDENTIFICATION; QUALITY;
D O I
10.21037/jtd.2017.08.13
中图分类号
R56 [呼吸系及胸部疾病];
学科分类号
摘要
Background: Our aim was to train a natural language processing (NLP) algorithm to capture imaging characteristics of lung nodules reported in a structured CT report and suggest the applicable Lung-RADS (TM) (LR) category. Methods: Our study included structured, clinical reports of consecutive CT lung screening (CTLS) exams performed from 08/2014 to 08/2015 at an ACR accredited Lung Cancer Screening Center. All patients screened were at high-risk for lung cancer according to the NCCN Guidelines (R). All exams were interpreted by one of three radiologists credentialed to read CTLS exams using LR using a standard reporting template. Training and test sets consisted of consecutive exams. Lung screening exams were divided into two groups: three training sets (500, 120, and 383 reports each) and one final evaluation set (498 reports). NLP algorithm results were compared with the gold standard of LR category assigned by the radiologist. Results: The sensitivity/specificity of the NLP algorithm to correctly assign LR categories for suspicious nodules (LR 4) and positive nodules (LR 3/4) were 74.1%/98.6% and 75.0%/98.8% respectively. The majority of mismatches occurred in cases where pulmonary findings were present not currently addressed by LR. Misclassifications also resulted from the failure to identify exams as follow-up and the failure to completely characterize part-solid nodules. In a sub-group analysis among structured reports with standardized language, the sensitivity and specificity to detect LR 4 nodules were 87.0% and 99.5%, respectively. Conclusions: An NLP system can accurately suggest the appropriate LR category from CTLS exam findings when standardized reporting is used.
引用
收藏
页码:3114 / +
页数:11
相关论文
共 23 条
  • [1] Reduced Lung-Cancer Mortality with Low-Dose Computed Tomographic Screening
    Aberle, Denise R.
    Adams, Amanda M.
    Berg, Christine D.
    Black, William C.
    Clapp, Jonathan D.
    Fagerstrom, Richard M.
    Gareen, Ilana F.
    Gatsonis, Constantine
    Marcus, Pamela M.
    Sicks, JoRean D.
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2011, 365 (05) : 395 - 409
  • [2] [Anonymous], 2015, HEALTHCARE DATA ANAL
  • [3] Natural Language Processing Technologies in Radiology Research and Clinical Applications
    Cai, Tianrun
    Giannopoulos, Andreas A.
    Yu, Sheng
    Kelil, Tatiana
    Ripley, Beth
    Kumamaru, Kanako K.
    Rybicki, Frank J.
    Mitsouras, Dimitrios
    [J]. RADIOGRAPHICS, 2016, 36 (01) : 176 - 191
  • [4] Systematic review: Impact of health information technology on quality, efficiency, and costs of medical care
    Chaudhry, Basit
    Wang, Jerome
    Wu, Shinyi
    Maglione, Margaret
    Mojica, Walter
    Roth, Elizabeth
    Morton, Sally C.
    Shekelle, Paul G.
    [J]. ANNALS OF INTERNAL MEDICINE, 2006, 144 (10) : 742 - 752
  • [5] Chen Elizabeth S, 2010, AMIA Annu Symp Proc, V2010, P101
  • [6] Discerning Tumor Status from Unstructured MRI Reports-Completeness of Information in Existing Reports and Utility of Automated Natural Language Processing
    Cheng, Lionel T. E.
    Zheng, Jiaping
    Savova, Guergana K.
    Erickson, Bradley J.
    [J]. JOURNAL OF DIGITAL IMAGING, 2010, 23 (02) : 119 - 132
  • [7] Automated Identification of Patients With Pulmonary Nodules in an Integrated Health System Using Administrative Health Plan Data, Radiology Reports, and Natural Language Processing
    Danforth, Kim N.
    Early, Megan I.
    Ngan, Sharon
    Kosco, Anne E.
    Zheng, Chengyi
    Gould, Michael K.
    [J]. JOURNAL OF THORACIC ONCOLOGY, 2012, 7 (08) : 1257 - 1262
  • [8] Ferrucci D., 2004, Natural Language Engineering, V10, P327, DOI 10.1017/S1351324904003523
  • [9] A GENERAL NATURAL-LANGUAGE TEXT PROCESSOR FOR CLINICAL RADIOLOGY
    FRIEDMAN, C
    ALDERSON, PO
    AUSTIN, JHM
    CIMINO, JJ
    JOHNSON, SB
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1994, 1 (02) : 161 - 174
  • [10] Jain NL, 1997, J AM MED INFORM ASSN, P829