Tracking medical students' clinical experiences using natural language processing
被引:20
作者:
Denny, Joshua C.
论文数: 0引用数: 0
h-index: 0
机构:
Vanderbilt Univ, Med Ctr, Dept Med, Div Gen Internal Med, Nashville, TN 37235 USA
Vanderbilt Univ, Med Ctr, Dept Biomed Informat, Eskind Biomed Lib, Nashville, TN 37232 USAVanderbilt Univ, Med Ctr, Dept Med, Div Gen Internal Med, Nashville, TN 37235 USA
Denny, Joshua C.
[1
,2
]
Bastarache, Lisa
论文数: 0引用数: 0
h-index: 0
机构:
Vanderbilt Univ, Med Ctr, Dept Biomed Informat, Eskind Biomed Lib, Nashville, TN 37232 USAVanderbilt Univ, Med Ctr, Dept Med, Div Gen Internal Med, Nashville, TN 37235 USA
Bastarache, Lisa
[2
]
论文数: 引用数:
h-index:
机构:
Sastre, Elizabeth Ann
[1
]
论文数: 引用数:
h-index:
机构:
Spickard, Anderson, III
[1
,2
]
机构:
[1] Vanderbilt Univ, Med Ctr, Dept Med, Div Gen Internal Med, Nashville, TN 37235 USA
[2] Vanderbilt Univ, Med Ctr, Dept Biomed Informat, Eskind Biomed Lib, Nashville, TN 37232 USA
Natural language processing;
Medical education;
Concept identification;
Education portfolios;
Competency assessment;
Experience tracking;
UMLS;
PRIMARY-CARE;
SCHOOL CURRICULUM;
RADIOLOGY REPORTS;
EDUCATION;
PERFORMANCE;
DOCUMENTS;
CLERKSHIP;
RECORD;
D O I:
10.1016/j.jbi.2009.02.004
中图分类号:
TP39 [计算机的应用];
学科分类号:
081203 ;
0835 ;
摘要:
Graduate medical students must demonstrate competency in clinical skills. Current tracking methods rely either on manual efforts or on simple electronic entry to record clinical experience. We evaluated automated methods to locate 10 institution-defined core clinical problems from three medical students' clinical notes (n = 290). Each note was processed with section header identification algorithms and the KnowledgeMap concept identifier to locate Unified Medical Language System (UMLS) concepts. The best performing automated search strategies accurately classified documents containing primary discussions to the core clinical problems with area under receiver operator characteristic curve of 0.90-0.94. Recall and precision for UMLS concept identification was 0.91 and 0.92, respectively. Of the individual note section, concepts found within the chief complaint, history of present illness, and assessment and plan were the strongest predictors of relevance. This automated method of tracking can provide detailed, pertinent reports of clinical experience that does not require additional work from medical trainees. The coupling of section header identification and concept identification holds promise for other natural language processing tasks, such as clinical research or phenotype identification. (C) 2009 Elsevier Inc. All rights reserved.