Applying Natural Language Processing Toolkits to Electronic Health Records - An Experience Report

被引:8
作者
Barrett, Neil [1 ]
Weber-Jahnke, Jens H. [1 ]
机构
[1] Univ Victoria, Dept Comp Sci, Victoria, BC V8W 3P6, Canada
来源
ADVANCES IN INFORMATION TECHNOLOGY AND COMMUNICATION IN HEALTH | 2009年 / 143卷
关键词
natural language processing; NLP; medical language processing; MLP; toolkits; i2b2;
D O I
10.3233/978-1-58603-979-0-441
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
A natural language challenge devised by Informatics for Integrating Biology and the Bedside (i2b2) was to analyze free-text health data to construct a multi-class, multi-label classification system focused on obesity and its comorbidities. This report presents a case study in which a natural language processing (NLP) toolkit, called NLTK, was used in the challenge. This report provides a brief review of NLP in the context of EHR applications, briefly surveys and contrasts some existing NLP toolkits, and reports on our experiences with the i2b2 case study. Our efforts uncovered issues including the lack of human annotated physician notes for use as NLP training data, differences between conventional free-text and medical notes, and potential hardware and software limitations affecting future projects.
引用
收藏
页码:441 / 446
页数:6
相关论文
共 4 条
[1]   Automated encoding of clinical documents based on natural language processing [J].
Friedman, C ;
Shagina, L ;
Lussier, Y ;
Hripcsak, G .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2004, 11 (05) :392-402
[2]  
Jurafsky D., 2009, Speech and Language Processing, DOI DOI 10.1162/JMLR.2003.3.4-5.993
[3]  
Nugues P.M., 2006, INTRO LANGUAGE PROCE
[4]  
O'Grady William., 2004, Contemporary Linguistic Analysis: An Introduction, V5th