A la Recherche du Temps Perdu: extracting temporal relations from medical text in the 2012 i2b2 NLP challenge

被引:14
作者
Cherry, Colin [1 ]
Zhu, Xiaodan [1 ]
Martin, Joel [1 ]
de Bruijn, Berry [1 ]
机构
[1] Natl Res Council Canada, Ottawa, ON K1A 0R6, Canada
关键词
information extraction; temporal reasoning; natural language processing; relation extraction; clinical text;
D O I
10.1136/amiajnl-2013-001624
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective An analysis of the timing of events is critical for a deeper understanding of the course of events within a patient record. The 2012 i2b2 NLP challenge focused on the extraction of temporal relationships between concepts within textual hospital discharge summaries. Materials and methods The team from the National Research Council Canada (NRC) submitted three system runs to the second track of the challenge: typifying the time-relationship between pre-annotated entities. The NRC system was designed around four specialist modules containing statistical machine learning classifiers. Each specialist targeted distinct sets of relationships: local relationships, sectime'-type relationships, non-local overlap-type relationships, and non-local causal relationships. Results The best NRC submission achieved a precision of 0.7499, a recall of 0.6431, and an F1 score of 0.6924, resulting in a statistical tie for first place. Post hoc improvements led to a precision of 0.7537, a recall of 0.6455, and an F1 score of 0.6954, giving the highest scores reported on this task to date. Discussion and conclusions Methods for general relation extraction extended well to temporal relations, and gave top-ranked state-of-the-art results. Careful ordering of predictions within result sets proved critical to this success.
引用
收藏
页码:843 / 848
页数:6
相关论文
共 16 条
[1]  
Aho A. V., 1972, SIAM Journal on Computing, V1, P131, DOI 10.1137/0201008
[2]  
[Anonymous], 2006, PROC 5 INT C LANGUAG
[3]  
[Anonymous], P M ASS COMP LING AC
[4]   An overview of MetaMap: historical perspective and recent advances [J].
Aronson, Alan R. ;
Lang, Francois-Michel .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (03) :229-236
[5]  
Berger AL, 1996, COMPUT LINGUIST, V22, P39
[6]   Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010 [J].
de Bruijn, Berry ;
Cherry, Colin ;
Kiritchenko, Svetlana ;
Martin, Joel ;
Zhu, Xiaodan .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (05) :557-562
[7]  
Fan RE, 2008, J MACH LEARN RES, V9, P1871
[8]   ALGORITHM-97 - SHORTEST PATH [J].
FLOYD, RW .
COMMUNICATIONS OF THE ACM, 1962, 5 (06) :345-345
[9]  
Mani I, 2006, COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, P753
[10]  
McClosky David, 2006, P HUMAN LANGUAGE TEC, P152, DOI [10.3115/1220835.1220855, DOI 10.3115/1220835.1220855]