Evaluating temporal relations in clinical text: 2012 i2b2 Challenge

被引:259
作者
Sun, Weiyi [1 ]
Rumshisky, Anna [2 ]
Uzuner, Ozlem [3 ]
机构
[1] SUNY Albany, Dept Informat, Albany, NY 12222 USA
[2] Univ Massachusetts, Dept Comp Sci, Lowell, MA USA
[3] SUNY Albany, Dept Informat Studies, Albany, NY 12222 USA
关键词
clinical language processing; sharedtask challenges; temporal reasoning; natural language processing; medical language processing; SYSTEM; INFORMATION; EXTRACTION; UMLS;
D O I
10.1136/amiajnl-2013-001628
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background The Sixth Informatics for Integrating Biology and the Bedside (i2b2) Natural Language Processing Challenge for Clinical Records focused on the temporal relations in clinical narratives. The organizers provided the research community with a corpus of discharge summaries annotated with temporal information, to be used for the development and evaluation of temporal reasoning systems. 18 teams from around the world participated in the challenge. During the workshop, participating teams presented comprehensive reviews and analysis of their systems, and outlined future research directions suggested by the challenge contributions. Methods The challenge evaluated systems on the information extraction tasks that targeted: (1) clinically significant events, including both clinical concepts such as problems, tests, treatments, and clinical departments, and events relevant to the patient's clinical timeline, such as admissions, transfers between departments, etc; (2) temporal expressions, referring to the dates, times, durations, or frequencies phrases in the clinical text. The values of the extracted temporal expressions had to be normalized to an ISO specification standard; and (3) temporal relations, between the clinical events and temporal expressions. Participants determined pairs of events and temporal expressions that exhibited a temporal relation, and identified the temporal relation between them. Results For event detection, statistical machine learning (ML) methods consistently showed superior performance. While ML and rule based methods seemed to detect temporal expressions equally well, the best systems overwhelmingly adopted a rule based approach for value normalization. For temporal relation classification, the systems using hybrid approaches that combined ML and heuristics based methods produced the best results.
引用
收藏
页码:806 / 813
页数:8
相关论文
共 44 条
[1]  
[Anonymous], 2010, Proceedings of the 5th International Workshop on Semantic Evaluation
[2]  
[Anonymous], 2010, Proceedings of the 5th International Workshop on Semantic Evaluation
[3]  
[Anonymous], TIMEML ANNOTATION GU
[4]  
[Anonymous], 86012004 ISO
[5]  
Aronson AR, 2001, J AM MED INFORM ASSN, P17
[6]   The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[7]  
Brown P. F., 1992, Computational Linguistics, V18, P467
[8]  
Chang AX, 2012, LANGUAGE RESOURCES E
[9]  
Chang Y-C, 2013, J BIOMEDICA IN PRESS
[10]   A la Recherche du Temps Perdu: extracting temporal relations from medical text in the 2012 i2b2 NLP challenge [J].
Cherry, Colin ;
Zhu, Xiaodan ;
Martin, Joel ;
de Bruijn, Berry .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2013, 20 (05) :843-848