Description of a Rule-based System for the i2b2 Challenge in Natural Language Processing for Clinical Data

被引:17
作者
Childs, Lois C. [1 ]
Enelow, Robert [2 ]
Simonsen, Lone [2 ]
Heintzelman, Norris H. [1 ]
Kowalski, Kimberly M. [1 ]
Taylor, Robert J. [2 ]
机构
[1] Lockheed Martin Inc, Valley Forge, PA USA
[2] SAGE Analyt LLC, Bethesda, MD USA
关键词
PATIENT SMOKING STATUS;
D O I
10.1197/jamia.M3083
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Obesity Challenge, sponsored by Informatics for Integrating Biology and the Bedside (i2b2), a National Center for Biomedical Computing, asked participants to build software systems that could "read" a patient's clinical discharge summary and replicate the judgments of physicians in evaluating presence or absence of obesity and 15 comorbidities. The authors describe their methodology and discuss the results of applying Lockheed Martin's rule-based natural language processing (NLP) capability, ClinREAD. We tailored ClinREAD with medical domain expertise to create assigned default judgments based on the most probable results as defined in the ground truth. It then used rules to collect evidence similar to the evidence that the human judges likely relied upon, and applied a logic module to weigh the strength of all evidence collected to arrive at final judgments. The Challenge results suggest that rule-based systems guided by human medical expertise are capable of solving complex problems in machine processing of medical text.
引用
收藏
页码:571 / 575
页数:5
相关论文
共 11 条
  • [1] A simple algorithm for identifying negated findings and diseases in discharge summaries
    Chapman, WW
    Bridewell, W
    Hanbury, P
    Cooper, GF
    Buchanan, BG
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2001, 34 (05) : 301 - 310
  • [2] Identifying smokers with a medical extraction system
    Clark, Cheryl
    Good, Kathleen
    Jezierny, Lesley
    Macpherson, Melissa
    Wilson, Brian
    Chajewska, Urszula
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2008, 15 (01) : 36 - 39
  • [3] Elkin Peter L, 2008, AMIA Annu Symp Proc, P172
  • [4] FARKAS R, 2007, P 2 INT S LANG BIOL
  • [5] Medical i2b2 NLP smoking challenge: The A-life system architecture and methodology
    Heinze, Daniel T.
    Morsch, Mark L.
    Potter, Brian C.
    Sheffer, Ronald E., Jr.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2008, 15 (01) : 40 - 43
  • [6] Epidemiology of angina pectoris: Role of natural language processing of the medical record
    Pakhomov, Serguei S. V.
    Hemingway, Harry
    Weston, Susan A.
    Jacobsen, Steven J.
    Rodeheffer, Richard
    Roger, Veronique L.
    [J]. AMERICAN HEART JOURNAL, 2007, 153 (04) : 666 - 673
  • [7] PESTIAN JP, 2007, P ACL BIONLP JUN PRA
  • [8] Rennie J. D., 2004, Derivation of the f-measure
  • [9] Mayo clinic NLP system for patient smoking status identification
    Savova, Guergana K.
    Ogren, Philip V.
    Duffy, Patrick H.
    Buntrock, James D.
    Chute, Christopher G.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2008, 15 (01) : 25 - 28
  • [10] Identifying patient smoking status from medical discharge records
    Uzuner, Oezlem
    Goldstein, Ira
    Luo, Yuan
    Kohane, Isaac
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2008, 15 (01) : 14 - 24