Assessment of Deep Natural Language Processing in Ascertaining Oncologic Outcomes From Radiology Reports

被引：110

作者：

Kehl, Kenneth L. ^{[1
,2
,3
]}

Elmarakeby, Haitham ^{[3
]}

Nishino, Mizuki ^{[4
]}

Van Allen, Eliezer M. ^{[3
]}

Lepisto, Eva M. ^{[1
,3
,5
]}

Hassett, Michael J. ^{[1
,3
]}

Johnson, Bruce E. ^{[2
,3
]}

Schrag, Deborah ^{[1
,3
]}

机构：

[1] Dana Farber Canc Inst, Div Populat Sci, 450 Brookline Ave, Boston, MA 02115 USA

[2] Dana Farber Canc Inst, Thorac Oncol Program, Boston, MA 02115 USA

[3] Dana Farber Canc Inst, Dept Med Oncol, Boston, MA 02115 USA

[4] Dana Farber Canc Inst, Dept Imaging, Boston, MA 02115 USA

[5] Dana Farber Canc Inst, Dept Informat, Boston, MA 02115 USA

来源：

JAMA ONCOLOGY | 2019年 / 5卷 / 10期

关键词：

NEURAL-NETWORKS; LUNG-CANCER; ERA;

D O I：

10.1001/jamaoncol.2019.1800

中图分类号：

R73 [肿瘤学];

学科分类号：

100214 ;

摘要：

Key PointsQuestionCan deep natural language processing of radiologic reports be used to measure real-world oncologic outcomes, including disease progression and response to therapy? FindingsIn a cohort study of 2406 patients with lung cancer, the findings suggested that deep learning models may estimate human curations of the presence of active cancer, cancer worsening/progression, and cancer improvement/response in radiologic reports with good discrimination (area under the receiver operating characteristic curve, >0.90). Statistically significant associations between these end points and overall survival were observed. MeaningDeep natural language processing may be able to extract clinically relevant oncologic end points from radiologic reports. ImportanceA rapid learning health care system for oncology will require scalable methods for extracting clinical end points from electronic health records (EHRs). Outside of clinical trials, end points such as cancer progression and response are not routinely encoded into structured data. ObjectiveTo determine whether deep natural language processing can extract relevant cancer outcomes from radiologic reports, a ubiquitous but unstructured EHR data source. Design, Setting, and ParticipantsA retrospective cohort study evaluated 1112 patients who underwent tumor genotyping for a diagnosis of lung cancer and participated in the Dana-Farber Cancer Institute PROFILE study from June 26, 2013, to July 2, 2018. ExposuresPatients were divided into curation and reserve sets. Human abstractors applied a structured framework to radiologic reports for the curation set to ascertain the presence of cancer and changes in cancer status over time (ie, worsening/progressing vs improving/responding). Deep learning models were then trained to capture these outcomes from report text and subsequently evaluated in a 10% held-out test subset of curation patients. Cox proportional hazards regression models compared human and machine curations of disease-free survival, progression-free survival, and time to improvement/response in the curation set, and measured associations between report classification and overall survival in the curation and reserve sets. Main Outcomes and MeasuresThe primary outcome was area under the receiver operating characteristic curve (AUC) for deep learning models; secondary outcomes were time to improvement/response, disease-free survival, progression-free survival, and overall survival. ResultsA total of 2406 patients were included (mean [SD] age, 66.5 [10.8] years; 1428 female [59.7%]; 2170 [90.2%] white). Radiologic reports (n=14230) were manually reviewed for 1112 patients in the curation set. In the test subset (n=109), deep learning models identified the presence of cancer, improvement/response, and worsening/progression with accurate discrimination (AUC >0.90). Machine and human curation yielded similar measurements of disease-free survival (hazard ratio [HR] for machine vs human curation, 1.18; 95% CI, 0.71-1.95); progression-free survival (HR, 1.11; 95% CI, 0.71-1.71); and time to improvement/response (HR, 1.03; 95% CI, 0.65-1.64). Among 15000 additional reports for 1294 reserve set patients, algorithm-detected cancer worsening/progression was associated with decreased overall survival (HR for mortality, 4.04; 95% CI, 2.78-5.85), and improvement/response was associated with increased overall survival (HR, 0.41; 95% CI, 0.22-0.77). Conclusions and RelevanceDeep natural language processing appears to speed curation of relevant cancer outcomes and facilitate rapid learning from EHR data. This cohort study examines the use of deep natural language processing in extraction of information on cancer outcomes from the medical records of patients with lung cancer.

引用

页码：1421 / 1429

页数：9

共 42 条

[1] AACR Project GENIE: Powering Precision Medicine through an International Consortium [J].

Andre, Fabrice ;

Arnedos, Monica ;

Baras, Alexander S. ;

Baselga, Jose ;

Bedard, Philippe L. ;

Berger, Michael F. ;

Bierkens, Mariska ;

Calvo, Fabien ;

Cerami, Ethan ;

Chakravarty, Debyani ;

Dang, Kristen K. ;

Davidson, Nancy E. ;

Del Vecchio, Fitz Catherine ;

Dogan, Semih ;

DuBois, Raymond N. ;

Ducar, Matthew D. ;

Futreal, P. Andrew ;

Gao Jianjiong ;

Garcia, Francisco ;

Gardos, Stu ;

Gocke, Christopher D. ;

Gross, Benjamin E. ;

Guinney, Justin ;

Heins, Zachary J. ;

Hintzen, Stephanie ;

Horlings, Hugo ;

Hudecek, Jan ;

Hyman, David M. ;

Kamel-Reid, Suzanne ;

Kandoth, Cyriac ;

Kinyua, Walter ;

Kumari, Priti ;

Kundra, Ritika ;

Ladanyi, Marc ;

Lefebvre, Celine ;

LeNoue-Newton, Michele L. ;

Lepisto, Eva M. ;

Levy, Mia A. ;

Lindeman, Neal, I ;

Lindsay, James ;

Liu, David ;

Lu Zhibin ;

MacConaill, Laura E. ;

Ian, Maurer ;

Maxwell, David S. ;

Meijer, Gerrit A. ;

Meric-Bernstam, Funda ;

Micheel, Christine M. ;

Miller, Clinton ;

Mills, Gordon .

CANCER DISCOVERY, 2017, 7 (08) :818-831

[2]

[Anonymous], 2014, INTRO STAT LEARNING

[3]

[Anonymous], 2016, CORR

[4]

[Anonymous], 2007, LEARNING HEALTHCARE

[5]

[Anonymous], P 2016 C N AM CHAPT

[6] A New Initiative on Precision Medicine [J].

Collins, Francis S. ;

Varmus, Harold .

NEW ENGLAND JOURNAL OF MEDICINE, 2015, 372 (09) :793-795

[7] Development and Validation of a High-Quality Composite Real-World Mortality Endpoint [J].

Curtis, Melissa D. ;

Griffith, Sandra D. ;

Tucker, Melisa ;

Taylor, Michael D. ;

Capra, William B. ;

Carrigan, Gillis ;

Holzman, Ben ;

Torres, Aracelis Z. ;

You, Paul ;

Arnieri, Brandon ;

Abernethy, Amy P. .

HEALTH SERVICES RESEARCH, 2018, 53 (06) :4460-4476

[8]

Dayhoff JE, 2001, CANCER, V91, P1615, DOI 10.1002/1097-0142(20010415)91:8+<1615::AID-CNCR1175>3.0.CO

[9]

2-L

[10] New response evaluation criteria in solid tumours: Revised RECIST guideline (version 1.1) [J].

Eisenhauer, E. A. ;

Therasse, P. ;

Bogaerts, J. ;

Schwartz, L. H. ;

Sargent, D. ;

Ford, R. ;

Dancey, J. ;

Arbuck, S. ;

Gwyther, S. ;

Mooney, M. ;

Rubinstein, L. ;

Shankar, L. ;

Dodd, L. ;

Kaplan, R. ;

Lacombe, D. ;

Verweij, J. .

EUROPEAN JOURNAL OF CANCER, 2009, 45 (02) :228-247

← 1 2 3 4 5 →