Document-level classification of CT pulmonary angiography reports based on an extension of the ConText algorithm

被引:81
作者
Chapman, Brian E. [1 ]
Lee, Sean [3 ]
Kang, Hyunseok Peter [2 ]
Chapman, Wendy W. [1 ]
机构
[1] Univ Calif San Diego, Div Biomed Informat, Dept Med, La Jolla, CA 92093 USA
[2] Stanford Univ, Sch Med, Biomed Informat Program, Stanford, CA 94305 USA
[3] Univ Pittsburgh, Sch Med, Pittsburgh, PA USA
关键词
Medical language processing; ConText; Pulmonary emboli; Computed tomography; CTPA; TEXT; PNEUMONIA; INFORMATION; RADIOLOGY; EVENTS; SYSTEM;
D O I
10.1016/j.jbi.2011.03.011
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper we describe an application called peFinder for document-level classification of CT pulmonary angiography reports. peFinder is based on a generalized version of the ConText algorithm, a simple text processing algorithm for identifying features in clinical report documents. peFinder was used to answer questions about the disease state (pulmonary emboli present or absent), the certainty state of the diagnosis (uncertainty present or absent), the temporal state of an identified pulmonary embolus (acute or chronic), and the technical quality state of the exam (diagnostic or not diagnostic). Gold standard answers for each question were determined from the consensus classifications of three human annotators. peFinder results were compared to naive Bayes' classifiers using unigrams and bigrams. The sensitivities (and positive predictive values) for peFinder were 0.98(0.83), 0.86(0.96), 0.94(0.93), and 0.60(0.90) for disease state, quality state, certainty state, and temporal state respectively, compared to 0.68(0.77), 0.67(0.87), 0.62(0.82), and 0.04(0.25) for the naive Bayes' classifier using unigrams, and 0.75(0.79), 0.52(0.69), 0.59(0.84), and 0.04(0.25) for the naive Bayes' classifier using bigrams. (C) 2011 Elsevier Inc. All rights reserved.
引用
收藏
页码:728 / 737
页数:10
相关论文
共 43 条
[1]   Computed Tomography Pulmonary Angiography: An Assessment of the Radiology Report [J].
Abujudeh, Hani H. ;
Kaewlai, Rathachai ;
Farsad, Khashayar ;
Orr, Esther ;
Gilman, Matthew ;
Shepard, Jo-Anne O. .
ACADEMIC RADIOLOGY, 2009, 16 (11) :1309-1315
[2]  
Aronis JM, 1999, J AM MED INFORM ASSN, P658
[3]  
Aronson AR, 2001, J AM MED INFORM ASSN, P17
[4]   Malpractice issues in radiology - Pitfalls of the vague radiology report [J].
Berlin, L .
AMERICAN JOURNAL OF ROENTGENOLOGY, 2000, 174 (06) :1511-1518
[5]   Creating a text classifier to detect radiology reports describing mediastinal findings associated with inhalational anthrax and other disorders [J].
Chapman, WW ;
Cooper, GF ;
Hanbury, P ;
Chapman, BE ;
Harrison, LH ;
Wagner, MM .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2003, 10 (05) :494-503
[6]   A simple algorithm for identifying negated findings and diseases in discharge summaries [J].
Chapman, WW ;
Bridewell, W ;
Hanbury, P ;
Cooper, GF ;
Buchanan, BG .
JOURNAL OF BIOMEDICAL INFORMATICS, 2001, 34 (05) :301-310
[7]   Quantifying the characteristics of unambiguous chest radiography reports in the context of pneumonia [J].
Chapman, WW ;
Fiszman, M ;
Frederick, PR ;
Chapman, BE ;
Haug, PJ .
ACADEMIC RADIOLOGY, 2001, 8 (01) :57-66
[8]   Classifying free-text triage chief complaints into syndromic categories with natural language processing [J].
Chapman, WW ;
Christensen, LM ;
Wagner, MM ;
Haug, PJ ;
Ivanov, O ;
Dowling, JN ;
Olszewski, RT .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2005, 33 (01) :31-40
[9]  
Christensen LM., 2002, P ACL 02 WORKSHOP NA, P29
[10]  
CHRISTENSEN LM, 2009, BIONLP 09, P19